findChunks: Identifies 'chunks' of data within a set of aligned reads.
In segmentSeq: Methods for identifying small RNA loci from high-throughput sequencing data

Description Usage Arguments Details Value Author(s) Examples

This function identifies chunks of data within a set of aligned reads by looking for gaps within the alignments; regions where no reads align. If we assume that a locus should not contain a gap of sufficient length, then we can separate the analysis of the data into chunks defined by these gaps, reducing the complexity of the problem of segmentation.

1	findChunks(alignments, gap, checkDuplication = TRUE, justChunks = FALSE)

`alignments`	A `GRanges` object defining a set of aligned reads.
`gap`	The minimum length of a gap across which it is assumed that no locus can exist.
`checkDuplication`	Should we check whether or not reads are duplicated within a chunk? Defaults to TRUE.
`justChunks`	If TRUE, returns a vector of the chunks rather than the GRanges object with chunks attached. Defaults to FALSE.

This function is called by the readGeneric and readBAM functions but may usefully be called again if filtering of an linkS4class{alignmentData} object has altered the data present, or to increase the computational effort required for subsequent analysis. The lower the ‘gap’ parameter used to define the chunks, the faster (though potentially less accurate) any subsequent analyses will be.

A modified GRanges object, now containing columns ‘chunk’ and ‘chunkDup’ (if 'checkDuplication' is TRUE), identifying the chunk to which the alignment belongs and whether the alignment of the tag is duplicated within the chunk respectively.

Thomas J. Hardcastle

# Define the files containing sample information.

datadir <- system.file("extdata", package = "segmentSeq")
libfiles <- c("SL9.txt", "SL10.txt", "SL26.txt", "SL32.txt")

# Establish the library names and replicate structure.

libnames <- c("SL9", "SL10", "SL26", "SL32")
replicates <- c(1,1,2,2)

# Read the files to produce an `alignmentData' object.

alignData <- readGeneric(file = libfiles, dir = datadir, replicates =
replicates, libnames = libnames, gap = 100)

# Filter the data on number of matches of each tag to the genome

alignData <- alignData[values(alignData@alignments)$matches < 5,]

# Redefine the chunking structure of the data.

alignData <- findChunks(alignData@alignments, gap = 100)

segmentSeq documentation built on Nov. 8, 2020, 5:18 p.m.

segmentSeq index

Package overview segmentSeq: small RNA locus detection segmentsSeq: Methylation locus identification

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

segmentSeq
Methods for identifying small RNA loci from high-throughput sequencing data

findChunks: Identifies 'chunks' of data within a set of aligned reads.
In segmentSeq: Methods for identifying small RNA loci from high-throughput sequencing data

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Related to findChunks in segmentSeq...

R Package Documentation

Browse R Packages

We want your feedback!

segmentSeq Methods for identifying small RNA loci from high-throughput sequencing data

findChunks: Identifies 'chunks' of data within a set of aligned reads. In segmentSeq: Methods for identifying small RNA loci from high-throughput sequencing data

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Related to findChunks in segmentSeq...

R Package Documentation

Browse R Packages

We want your feedback!

segmentSeq
Methods for identifying small RNA loci from high-throughput sequencing data

findChunks: Identifies 'chunks' of data within a set of aligned reads.
In segmentSeq: Methods for identifying small RNA loci from high-throughput sequencing data