excludeDoubleMatchingProbes: Exclude ProbeLevelSet probes that match more than once in...

Description Usage Arguments Details Value Author(s) See Also Examples

Description

Function that will remove probes from ProbeLevelSet if they have more than one match in a given genome.

Usage

1
2
3
    excludeDoubleMatchingProbes(object, genome="BSgenome.Hsapiens.UCSC.hg18", verbose=TRUE,
    directions=c("matchForwardSense", "matchForwardAntisense", "matchReverseSense", "matchReverseAntisense"),
    previousData = NULL)

Arguments

object

A ProbeLevelSet class object.

genome

character string with the name of the BSGenome in which sequences should be found. Defaults to the human genome.

verbose

TRUE or FALSE.

directions

character string with elements from c("matchForwardSense", "matchForwardAntisense", "matchReverseSense", "matchReverseAntisense"). Defines which directions (complementary and reverse mirrorings) that should be scanned. Defaults to all directions.

previousData

Optional: The output from a call to findSequenceInGenome. If given the scanning will be skipped, and the probes will be omitted directly. Useful in cases were datasets from the same region needs to be processed.

Details

This function will take quite a while to run, so if you have many sequences, overnight runs are recommended. BSgenome contains some alternative versions of chromosomes. They are marked with an underscore. This function automatically disregards chromosome names with an underscore, and this is known to work for the human genome. Nevertheless, check the output printed to terminal if all chromosomes are included. The function is a wrapper around findSequenceInGenome, which can be used for purposes that are more flexible (although that function really is just following the example in the BSgenome package)

At present, there is no functionality to check matches with known SNP or known splice forms taken into account.

Value

The ProbeLevelSet class object provided as argument, with all double matching probes removed. Double matching probes are probes whose sequence are found twice or more in the genome. In addition, the output of the matching investigation is saved in the notes of the ProbeLevelSet and can be further examined for information on the locations of the probe sequences in relation to the BSgenome sequences.

Author(s)

Lasse Folkersen

See Also

findSequenceInGenome, BSgenome, excludeDoubleMatchingProbes

Examples

1
2
3
4
5
	## Not run: 
	#you can run this, but it takes a lot of time
	probelevelsetwithnodoubles<-excludeDoubleMatchingProbes(probelevelset)
	
## End(Not run)

GeneRegionScan documentation built on Nov. 8, 2020, 8:28 p.m.