Description Usage Arguments Details Value Note Author(s) References See Also Examples
View source: R/bPeaksAnalysis.R
This function allows to detect basic peaks (bPeaks) using the procedure described in the function peakDetection
. Chromosomes are analyzed successively. Several values(regarding thresholds T1, T2, T3 and T4 and other parameters) can be specified simultaneously in order to rapidly compare the obtained results and evaluate parameter relevance.
1 2 3 4 5 6 7 | bPeaksAnalysis(IPdata, controlData, cdsPositions = NULL,
smoothingValue = 20,
windowSize = 150, windowOverlap = 50,
IPcoeff = 2, controlCoeff = 2,
log2FC = 2, averageQuantiles = 0.9,
resultName = "bPeaks",
peakDrawing = TRUE, promSize = 800, withoutOverlap = FALSE)
|
IPdata |
A dataframe with sequencing results of the IP sample. This dataframe has three columns (chromosome, position, number of sequences) and should have been created with the |
controlData |
A dataframe with sequencing results of the control sample. This dataframe has three columns (chromosome, position, number of sequences) and should have been created with the |
cdsPositions |
Not mandatory. A table (matrix) with positions of CDS (genes). Four columns are required (chromosome, starting position, ending position, strand (W or C), description). CDS positions for several yeast species are stored in bPeaks package
(see the dataset |
smoothingValue |
The number (n/2) of surrounding positions to use for mean calculation in the |
windowSize |
Size of the sliding windows to scan chromosomes |
windowOverlap |
Size of the overlap between two successive windows |
IPcoeff |
Threshold T1. Value for the multiplicative parameter that will be combined with the value of the mean genome-wide read depth (see |
controlCoeff |
Threshold T2. Value for the multiplicative parameter that will be combined with the value of the mean genome-wide read depth (see |
log2FC |
Threshold T3. Threshold to consider log2(IP/control) values as sufficiently important to be interesting. Note that a vector with different values can be specified, the bPeaks analysis will be therefore repeated using successively each value for peak detection |
averageQuantiles |
Threshold T4. Threshold to consider (log2(IP) + log2(control)) / 2 as sufficiently important to be interesting. This parameter ensures that the analyzed genomic region has enough sequencing coverage to be reliable. These threshold should be between [0, 1] and refers to the quantile value of the global distribution observed with the analyzed chromosome |
resultName |
Name for output files created during bPeaks procedure |
peakDrawing |
TRUE or FLASE. If TRUE, the function |
promSize |
Size of the genomic regions to be considered as "upstream" to the annotated genomic features (see documentation of the function |
withoutOverlap |
If TRUE, this option allows to filter peak that are located in a promoter AND a CDS. |
More information together with tutorials can be found online http://bpeaks.gene-networks.net/
.
BED files for each chromosomes and a final BED file combining all the results with information regarding detected peaks (genomic positions, mean IP signal, etc.). These files are all saved in the R working directory. Summaries of parameter calculations and peak detection criteria are shown in PDF files (saved in the working directory).
Detailed information and tutorials can be found online http://bpeaks.gene-networks.net/
Gaelle LELANDAIS
http://bpeaks.gene-networks.net/
peakDetection
dataReading
dataSmoothing
baseLineCalc
peakDrawing
peakLocation
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 | # get library
library(bPeaks)
# STEP 1: get PDR1 data
data(dataPDR1)
# STEP 2 : bPeaks analysis (only 10 kb of chrIV are analyzed here,
# as an illustration)
bPeaksAnalysis(IPdata = dataPDR1$IPdata[40000:50000,],
controlData = dataPDR1$controlData[40000:50000,],
windowSize = 150, windowOverlap = 50,
IPcoeff = 4, controlCoeff = 2, log2FC = 1,
averageQuantiles = 0.5,
resultName = "bPeaks_example",
peakDrawing = TRUE, promSize = 800)
## Not run:
# STEP 2 : bPeaks analysis (all chromosome)
bPeaksAnalysis(IPdata = dataPDR1$IPdata, controlData = dataPDR1$controlData,
cdsPositions = dataPDR1$cdsPositions,
smoothingValue = c(20),
windowSize = c(150), windowOverlap = 50,
IPcoeff = c(2), controlCoeff = c(2), log2FC = c(2),
averageQuantiles = c(0.9),
resultName = "bPeaks_PDR1_chr4",
peakDrawing = TRUE, promSize = 800)
# To repeat the bPeaks analysis with different parameters
bPeaksAnalysis(IPdata = dataPDR1$IPdata, controlData = dataPDR1$controlData,
cdsPositions = dataPDR1$cdsPositions,
smoothingValue = c(20),
windowSize = c(150), windowOverlap = 50,
IPcoeff = c(2, 4, 6), controlCoeff = c(2, 4, 6), log2FC = c(2, 3),
averageQuantiles = c(0.7, 0.9),
resultName = "bPeaks_PDR1_chr4_paremeterEval",
peakDrawing = FALSE, promSize = 800)
# -> Summary table is created and saved as "peakStats.Robject" in the working directory
# as well as a text file named "_bPeaks_parameterSummary.txt"...
load("peakStats.Robject")
# This table comprises different information regarding peak detection (number of peaks,
# mean size of peaks, mean IP signal, mean control signal, etc.)
peakStats[1:2,]
# smoothingValue windowSize windowOverlap IPcoeff controlCoeff log2FC
# [1,] 20 150 50 1 1 1
# [2,] 20 150 50 1 1 1
# averageQuantiles bPeakNumber meanSize meanIPsignal meanControlSignal
# [1,] 0.5 308 209.091 276.047 71.534
# [2,] 0.7 294 205.782 287.808 74.002
# meanLog2FC bPeakNumber_beforeFeatures bPeakNumber_afterFeatures
# [1,] 1.571 99 80
# [2,] 1.589 94 77
# bPeakNumber_inFeatures
# [1,] 52
# [2,] 53
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.