Description Usage Arguments Details Value Note Author(s) References See Also Examples
This function is the main interface to most essential functions of RIPSeeker package.
1 2 3 4 5 |
bamPath |
Either a path to all of the bam files or a list of paths to individual BAM files. BED and SAM files are also accepted. |
cNAME |
An identifer pattern found in the control alignment files. Once specified, these files will be used as control and the remaining files as RIP for discriminative analysis (see |
binSize |
Size to use for binning the read counts across each chromosome. If NULL, optimal bin size within a range (default: minBinSize=200, maxBinSize=1200) will be automatically selected (see |
strandType |
Type of strand can be +, -, or * as in GAlignments, GAlignmentPairs, or GRanges (see |
paired |
Binary to indicate whether the library is paired-end (TRUE) or single-end (FALSE by default) (see |
biomaRt_dataset |
The dataset name used in biomaRt for retrieving genomic information for a given species name (see |
goAnno |
GO dataset name used for GO enrichment analysis (See |
exportFormat |
Format to export the RIP predictions. The commonly used ones are GFF and BED, which can be directly imported as a track to a genomic viewer such as Integrative Genomic Viewer, SAVANT or USCSC browser. |
annotateFormat |
Format to export the annotated RIP predictions. The default "txt" is a tab-delimited format, recommanded for viewing in Excel. |
annotateType |
Type of genomic information in association with the RIP predictions that can be retrieved from Ensembl database (Default: TSS; See |
outDir |
Output directory to save the results. The output data include ... |
padjMethod |
Method to adjust multiple testing (Benjamini-Hocherge method by default). |
logOddCutoff |
Threshold for the log odd ratio of posterior for the RIP over the background states (See |
pvalCutoff |
Threshold for the p-value for the logOdd score. Only peaks with p-value less than the |
pvalAdjCutoff |
Threshold for the adjusted p-value for the logOdd score. Only peaks with adjusted p-value less than the |
eFDRCutoff |
Threshold for the empirical false discovery rate (eFDR). Only peaks with eFDR less than the |
... |
Arguments passed to |
This is the main front-end function of RIPSeeker and in many cases the only function that users need to get RIP predictions and all relevant information.
A list is returned with the following items:
mainSeekOutputRIP |
A (inner) list comprising three items:
|
mainSeekOutputCTL |
Same as |
RIPGRList |
The results as |
annotatedRIPGR |
If |
You may only want to know the expression/abundance of known transcripts/genes or the foldchange between two conditions. In that case, use rulebaseRIPSeek
and computeRPKM
, respectively. Both singl-end and paired-end alignments are supported in these functions.
Yue Li
Zhao, J., Ohsumi, T. K., Kung, J. T., Ogawa, Y., Grau, D. J., Sarma, K., Song, J. J., et al. (2010). Genome-wide Identification of Polycomb-Associated RNAs by RIP-seq. Molecular Cell, 40(6), 939D953. doi:10.1016/j.molcel.2010.12.011
The RIPSeeker manuscript has been submitted to NAR for review.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 | if(interactive()) { # need internet connection
# Retrieve system files
extdata.dir <- system.file("extdata", package="RIPSeeker")
bamFiles <- list.files(extdata.dir, ".bam$", recursive=TRUE, full.names=TRUE)
bamFiles <- grep("PRC2", bamFiles, value=TRUE)
cNAME <- "SRR039214" # specify control name
# output file directory
outDir <- paste(getwd(), "ripSeek_example", sep="/")
# Parameters setting
binSize <- NULL # automatically determine bin size
minBinSize <- 10000 # min bin size in automatic bin size selection
maxBinSize <- 12000 # max bin size in automatic bin size selection
multicore <- TRUE # use multicore
strandType <- "-" # set strand type to minus strand
biomart <- "ENSEMBL_MART_ENSEMBL" # use archive to get ensembl 65
dataset <- "mmusculus_gene_ensembl" # mouse dataset id name
host <- "dec2011.archive.ensembl.org" # use ensembl 65 for annotation
goAnno <- "org.Mm.eg.db"
################ run main function ripSeek to predict RIP ################
seekOut <- ripSeek(bamPath=bamFiles, cNAME=cNAME,
binSize=binSize, minBinSize = minBinSize,
maxBinSize = maxBinSize, strandType=strandType,
outDir=outDir, silentMain=FALSE,
verbose=TRUE, reverseComplement=TRUE, genomeBuild="mm9",
biomart=biomart, host=host,
biomaRt_dataset = dataset,
goAnno = goAnno,
uniqueHit = TRUE, assignMultihits = TRUE,
rerunWithDisambiguatedMultihits = TRUE, multicore=multicore)
################ visualization ################
viewRIP(seekOut$RIPGRList$chrX, seekOut$mainSeekOutputRIP$alignGalFiltered,
seekOut$mainSeekOutputCTL$alignGalFiltered, scoreType="eFDR")
}
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.