Description Usage Arguments Value Author(s) See Also Examples
Counts reads in binding site intervals. Files must be one of bam, bed and gzip-compressed bed. File suffixes must be ".bam", ".bed", or ".bed.gz" respectively.
1 2 3 4 5 6 7 | dba.count(DBA, peaks, minOverlap=2, score=DBA_SCORE_NORMALIZED,
fragmentSize=DBA$config$fragmentSize,
summits=200, filter=1, bRemoveDuplicates=FALSE, bScaleControl=TRUE,
bSubControl = is.null(DBA$greylist),
mapQCth=DBA$config$mapQCth, filterFun=max, minCount=0,
bLog=FALSE, bUseSummarizeOverlaps=TRUE,
readFormat=DBA_READS_DEFAULT, bParallel=DBA$config$RunParallel)
|
DBA |
DBA object | |||||||||||||||||||||||
peaks |
If GRanges, RangedData, dataframe, or matrix, this parameter contains the intervals to use for counting. If character string, it specifies a file containing the intervals to use (with the first three columns specifying chromosome, startpos, endpos).If missing or a mask, generates a consensus peakset using minOverlap parameter (after applying the mask if present). If NULL, the | |||||||||||||||||||||||
minOverlap |
only include peaks in at least this many peaksets when generating consensus peakset (i.e. when peaks parameter is missing). If minOverlap is between zero and one, peak will be included from at least this proportion of peaksets. | |||||||||||||||||||||||
score |
which score to use in the binding affinity matrix. Note that all raw read counts are maintained for use by
| |||||||||||||||||||||||
fragmentSize |
This value will be used as the length of the reads.
Each read will be extended from its endpoint along the appropriate strand by this many bases.
If set to zero, the read size indicated in the BAM/BED file will be used.
| |||||||||||||||||||||||
summits |
unless set to If the value of Note that if | |||||||||||||||||||||||
filter |
value to use for filtering intervals with low read counts.
The NB: the filtering will be based on RPKM values.
If | |||||||||||||||||||||||
bRemoveDuplicates |
logical indicating if duplicate reads (ones that map to exactly the same genomic position) should be removed.
If | |||||||||||||||||||||||
bScaleControl |
logical indicating if the Control reads should be scaled based on relative library sizes.
If | |||||||||||||||||||||||
bSubControl |
logical indicating whether Control read counts are subtracted for
each site in each sample.
If there are more overlapping control reads than ChIP reads,
the count will be set to the If | |||||||||||||||||||||||
mapQCth |
for filtering by mapping quality (mapqc).
Only alignments with mapping scores of at least this value will be included.
Only applicable for bam files when | |||||||||||||||||||||||
filterFun |
function that will be invoked for each interval with a
vector of scores for each sample.
Returns a score that will be evaluated against the | |||||||||||||||||||||||
minCount |
minimum read count value. Any interval with fewer than this many overlapping reads will be set to have this count. Also applies to scores. | |||||||||||||||||||||||
bLog |
logical indicating whether log2 of score should be used (only applies to DBA_SCORE_RPKM_FOLD and DBA_SCORE_READS_FOLD). | |||||||||||||||||||||||
bUseSummarizeOverlaps |
logical indicating that See notes for when the Five additional parameters for
| |||||||||||||||||||||||
readFormat |
Specify the file type of the read files, over-riding the file extension. Possible values:
Note that if | |||||||||||||||||||||||
bParallel |
if |
DBA object with binding affinity matrix based on read count scores.
Rory Stark and Gordon Brown
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 | # These won't run unless you have the reads available in a BAM or BED file
data(tamoxifen_peaks)
## Not run: tamoxifen <- dba.count(tamoxifen)
# Count using a peakset made up of only peaks in all responsive MCF7 replicates
data(tamoxifen_peaks)
mcf7Common <- dba.overlap(tamoxifen,tamoxifen$masks$MCF7&tamoxifen$masks$Responsive)
## Not run: tamoxifen <- dba.count(tamoxifen,peaks=mcf7Common$inAll)
tamoxifen
#First make consensus peaksets from each set of replicates,
#then derive master consensus set for counting from those
data(tamoxifen_peaks)
tamoxifen <- dba.peakset(tamoxifen,consensus = -DBA_REPLICATE)
## Not run: tamoxifen <- dba.count(tamoxifen, peaks=tamoxifen$masks$Consensus)
tamoxifen
# Change binding affinity scores
data(tamoxifen_counts)
dba.peakset(tamoxifen, bRetrieve=TRUE) # default: DBA_SCORE_NORMALIZED
tamoxifen <- dba.count(tamoxifen,peaks=NULL,score=DBA_SCORE_READS)
dba.peakset(tamoxifen, bRetrieve=TRUE)
tamoxifen <- dba.count(tamoxifen,peaks=NULL,score=DBA_SCORE_RPKM_MINUS)
dba.peakset(tamoxifen, bRetrieve=TRUE)
# Plot effect of a range of filter values and then apply filter
data(tamoxifen_counts)
rate.max <- dba.count(tamoxifen, peaks=NULL, filter=0:250)
rate.sum <- dba.count(tamoxifen, peaks=NULL, filter=0:250,filterFun=sum)
plot(0:250,rate.max/rate.max[1],type='l',xlab="Filter Value",ylab="Proportion Retained Sites")
lines(0:250,rate.sum/rate.sum[1],col=2)
tamoxifen <- dba.count(tamoxifen,peaks=NULL,filter=125,filterFun=sum)
tamoxifen
# Calculate summits
data(tamoxifen_counts)
# pre-counted with summits=250 or 501bp intervals
as.numeric(dba.show(tamoxifen)$FRiP)
## Not run: tamoxifen <- dba.count(tamoxifen,peaks=NULL,summits=50)
# re-counted with summits=50 or 101bp intervals
as.numeric(dba.show(tamoxifen)$FRiP)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.