createVirtualFragmentLibrary: Create a virtual fragment library from a provided genome and...

Description Usage Arguments Details Value Note Author(s) Examples

Description

Basic4Cseq can create virtual fragment libraries from any BSgenome package or DNAString object. Two restriction enzymes have to be specified to cut the DNA, the read length is needed to check the fragment ends of corresponding length for uniqueness. Filter options (minimum and maximum size) are provided on fragment level and on fragment end level.

Usage

1
createVirtualFragmentLibrary(chosenGenome, firstCutter, secondCutter, readLength, onlyNonBlind = TRUE, useOnlyIndex = FALSE, minSize = 0, maxSize = -1, minFragEndSize = 0, maxFragEndSize = 10000000, useAllData = TRUE, chromosomeName = "chr1", libraryName = "default")

Arguments

chosenGenome

The genome that is to be digested in silico with the provided enzymes; can be an instance of BSgenome or DNAString

firstCutter

First of two restriction enzymes

secondCutter

Second of two restriction enzymes

readLength

Read length for the experiment

onlyNonBlind

Variable that is TRUE (default) if only non-blind fragments are considered (i.e. all blind fragments are removed)

useOnlyIndex

Convenience function to adapt the annotation style of the chromosomes ("chr1", ... "chrY" or "1", ..., "Y"); parameter has to be set to match the BAM file in question

minSize

Filter option that allows to delete fragments below a certain size (in bp)

maxSize

Filter option that allows to delete fragments above a certain size (in bp)

minFragEndSize

Filter option that allows to delete fragment ends below a certain size (in bp)

maxFragEndSize

Filter option that allows to delete fragment ends above a certain size (in bp)

useAllData

Variable that indicates if all data of a BSgenome package is to be used. If FALSE, chromosome names including a "_" are removed, reducing the set of chromosomes to (1 ... 19, X, Y, MT) for the mouse genome or (1 ... 22, X, Y, MT) for the human genome

chromosomeName

Chromosome name for the virtual fragment library if a DNAString object is used instead of a BSgenome object.

libraryName

Name of the file the created virtual fragment library is written to. Per default the file is called "fragments_firstCutter_secondCutter.csv". The fragment data is returned as a data frame if and only if an empty character string is chosen as libraryName.

Details

Value

A tab-separated file with the specified virtual fragment library (containing fragment position, length, presence of second restriction enzyme and uniqueness of the fragment ends)

Note

Author(s)

Carolin Walter

Examples

1
2
3
4
    if(interactive()) {
        library(BSgenome.Ecoli.NCBI.20080805)
        fragmentData = createVirtualFragmentLibrary(chosenGenome = Ecoli$NC_002655, firstCutter = "catg", secondCutter = "gtac", readLength = 30,  onlyNonBlind = TRUE, chromosomeName = "NC_002655", libraryName = "fragments_Ecoli.csv")
    }

Basic4Cseq documentation built on Nov. 8, 2020, 6:53 p.m.