Description Usage Arguments Value Use of multi-width regions of interest Author(s) See Also Examples
View source: R/signal_counting.R
Get the sum of the signal in dataset.gr that overlaps each position
within each range in regions.gr. If binning is used (i.e. positions
are wider than 1 bp), any function can be used to summarize the signal
overlapping each bin. For a description of the critical difference between
expand_ranges = FALSE and expand_ranges = TRUE, see
getCountsByRegions.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 |
dataset.gr |
A GRanges object in which signal is contained in metadata (typically in the "score" field), or a named list of such GRanges objects. |
regions.gr |
A GRanges object containing regions of interest. |
binsize |
Size of bins (in bp) to use for counting within each range of
|
FUN |
If |
simplify.multi.widths |
A string indicating the output format if the
ranges in |
field |
The metadata field of |
NF |
An optional normalization factor by which to multiply the counts.
If given, |
blacklist |
An optional GRanges object containing regions that should be excluded from signal counting. |
NA_blacklisted |
A logical indicating if NA values should be returned
for blacklisted regions. By default, signal in the blacklisted sites is
ignored, i.e. the reads are excluded. If |
melt |
A logical indicating if the count matrices should be melted. If
set to |
expand_ranges |
Logical indicating if ranges in |
ncores |
Multiple cores will only be used if |
If the widths of all ranges in regions.gr are equal, a matrix
is returned that contains a row for each region of interest, and a column
for each position (each base if binsize = 1) within each region. If
dataset.gr is a list, a parallel list is returned containing a
matrix for each input dataset.
If the input
regions.gr contains ranges of varying widths, setting
simplify.multi.widths = "list" will output a list of variable-length
vectors, with each vector corresponding to an individual input region. If
simplify.multi.widths = "pad 0" or "pad NA", the output is a
matrix containing a row for each range in regions.gr, but the number
of columns is determined by the largest range in regions.gr. For
each region of interest, columns that correspond to positions outside of
the input range are set, depending on the argument, to 0 or
NA.
Mike DeBerardine
getCountsByRegions,
metaSubsample
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 | data("PROseq") # load included PROseq data
data("txs_dm6_chr4") # load included transcripts
#--------------------------------------------------#
# counts from 0 to 50 bp after the TSS
#--------------------------------------------------#
txs_pr <- promoters(txs_dm6_chr4, 0, 50) # first 50 bases
countsmat <- getCountsByPositions(PROseq, txs_pr)
countsmat[10:15, 41:50] # show only 41-50 bp after TSS
#--------------------------------------------------#
# redo with 10 bp bins from 0 to 100
#--------------------------------------------------#
# column 5 is sums of rows shown above
txs_pr <- promoters(txs_dm6_chr4, 0, 100)
countsmat <- getCountsByPositions(PROseq, txs_pr, binsize = 10)
countsmat[10:15, ]
#--------------------------------------------------#
# same as the above, but with the average signal in each bin
#--------------------------------------------------#
countsmat <- getCountsByPositions(PROseq, txs_pr, binsize = 10, FUN = mean)
countsmat[10:15, ]
#--------------------------------------------------#
# standard deviation of signal in each bin
#--------------------------------------------------#
countsmat <- getCountsByPositions(PROseq, txs_pr, binsize = 10, FUN = sd)
round(countsmat[10:15, ], 1)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.