aggregateTagClusters: Aggregating tag clusters across multiple CAGE datasets
In ge11232002/CAGEr: Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

Description Usage Arguments Details Value Author(s) See Also Examples

Aggregates tag clusters (TCs) across all CAGE dataset within the CAGEset object to create a referent set of consensus clusters.

1
2
3

aggregateTagClusters(object, tpmThreshold = 5, 
					 excludeSignalBelowThreshold = TRUE, 
					 qLow = NULL, qUp = NULL, maxDist = 100)

`object`	A `CAGEset` object
`tpmThreshold`	Only tag clusteres with normalized signal `>= tpmThreshold` will be used to construct consensus clusters.
`excludeSignalBelowThreshold`	When `TRUE` only tag clusteres with normalized signal `>= tpmThreshold` will contribute to the total CAGE signal of a consensus cluster, i.e. only the TCs that are used to construct consensus cluster. When set to `FALSE` all TCs that overlap consensus cluster will contribute to the total signal (regardless whether they pass the threshold or not), however only the TCs above the threshold will be used to define consensus cluster boundaries. Thus, it that case the TCs above the threshold are first used to construct consensus clusters and define their boundaries, but then CAGE signal from all TCs that fall within those boundaries is used to calculate total signal of a particular consensus cluster.
`qLow`	Position of which "lower" quantile should be used as 5' boundary of the tag cluster. If `qLow = NULL` start position of the TC is used. See Details.
`qUp`	Position of which "upper" quantile should be used as 3' boundary of the tag cluster. If `qUp = NULL` end position of the TC is used. `qUp` has to be `>= qLow`. See Details.
`maxDist`	Maximal length of the gap (in base-pairs) between two tag clusters for them to be part of the same consensus clusters. See Details.

Tag clusters (TCs) returned by clusterCTSS function are constructed for every CAGE dataset within CAGEset object separatelly, based on the CAGE signal in that sample. Thus, TCs from two CAGE datasets can differ both in their number, genomic coordinates, position of dominant TSS and overall signal. To be able to compare all samples at the level of clusters of TSSs, TCs from all CAGE datasets are aggregated into a single set of consensus clusters. First, TCs with signal >= tpmThreshold from all CAGE datasets are selected, and their 5' and 3' boundaries are determined based on provided qLow and qUp parameters. If qLow = NULL and qUp = NULL the start and end coordinates, i.e. the full span of the TC is used, otherwise the positions of qLow and qUp quantiles are used as 5' and 3' boundary, respectively. Finally, the defined set of TCs from all CAGE datasets is reduced to a non-overlapping set of consensus clusters by merging overlapping TCs and TCs <= maxDist base-pairs apart. Consensus clusters represent a referent set of promoters that can be further used for expression profiling or detecting "shifting" (differentially used) promoters between different CAGE samples.

The slots consensusClusters, tagClustersInConsensusClusters and consensusClustersTpmMatrix of the provided CAGEset object will be occupied by the genomic coordinates of consensus clusters, information on containing TCs and the total CAGE signal across all CAGE datasets, respectively.

Vanja Haberle

clusterCTSS

load(system.file("data", "exampleCAGEset.RData", package="CAGEr"))

aggregateTagClusters(object = exampleCAGEset, tpmThreshold = 50,
excludeSignalBelowThreshold = FALSE, qLow = 0.1, qUp = 0.9, maxDist = 100)

ge11232002/CAGEr documentation built on May 17, 2019, 12:13 a.m.

ge11232002/CAGEr index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ge11232002/CAGEr
Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

aggregateTagClusters: Aggregating tag clusters across multiple CAGE datasets
In ge11232002/CAGEr: Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Related to aggregateTagClusters in ge11232002/CAGEr...

R Package Documentation

Browse R Packages

We want your feedback!

ge11232002/CAGEr Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

aggregateTagClusters: Aggregating tag clusters across multiple CAGE datasets In ge11232002/CAGEr: Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Related to aggregateTagClusters in ge11232002/CAGEr...

R Package Documentation

Browse R Packages

We want your feedback!

ge11232002/CAGEr
Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

aggregateTagClusters: Aggregating tag clusters across multiple CAGE datasets
In ge11232002/CAGEr: Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining