getProbesetsFromRegionOfInterest: Get Probeset IDs From Region Of Interest

Description Usage Arguments Details Value Author(s) See Also Examples

View source: R/creationFunctions.R

Description

Function that return the probesets located within a given region of the genome.

Usage

1
    getProbesetsFromRegionOfInterest(annotation, chromosome, start, end, pythonPath=NULL, transcriptClustersFile=NULL, mpsToPsFile=NULL)

Arguments

annotation

A characther string giving the type of array for which probesets are needed. Will be used to load the .db file from bioconductor. Optional if mps to ps and transcript cluster file is given.

chromosome

The chromosome of interest. Should be given as a character string of the type "Chr1", "Chr2", "ChrY".

start

A character string with the start location of interest in the chromosome.

end

A character string with the end location of interest in the chromosome.

pythonPath

Optional character string with the path for Python software. This is only needed for exon arrays. If Python is in path this will be recognised automatically. Python can be downloaded from http://www.python.org.

transcriptClustersFile

The location of a transcript cluster file such as HuEx-1\_0-st-v2.na26.hg18.transcript.csv. These can be downloaded from http://www.affymetrix.com.

mpsToPsFile

The location of a transcript cluster file such as HuEx-1\_0-st-v2.r2.dt1.hg18.full.mps. These can be downloaded from http://www.affymetrix.com. Best to use the "full" type file.

Details

While working with 3'IVT type affymetrix arrays, for which .db files exist within the bioconductor environment, this function works in a pretty simple way. However if exon arrays are used, it will change to parsing the relevant library files from the http://www.affymetrix.com website using a python script. Admittedly this is not optional from an R-only point-of-view, but it works and its fast. The function will be updated when more R-centric ways of parsing exon array annotation data is available.

Alternatively this data can just as well be retrieved from the web, but in some cases this function is faster an easier.

Value

A list of all probesets found in the given range on the given chromosome.

Author(s)

Lasse Folkersen

See Also

getLocalProbeIntensities, getMetaprobesetsFromRegionOfInterest

Examples

1
2
3
4
5
	## Not run: 
	#must supply mpsToPsFile and transcriptClustersFile for this to work
	probesets<-getProbesetsFromRegionOfInterest("notusedhere", chromosome=2, start="215889955", end="216106710", transcriptClustersFile=transcriptClustersFile, mpsToPsFile=mpsToPsFile)
	
## End(Not run)

GeneRegionScan documentation built on Nov. 8, 2020, 8:28 p.m.