BiocStyle::markdown() suppressPackageStartupMessages({ library(GenomicAlignments) library(BSgenome.Hsapiens.UCSC.hg19) library(TxDb.Hsapiens.UCSC.hg19.knownGene) })
Genomic ranges describe...
Packages
r Biocpkg("GenomicRanges")
for essential genomic ranges; depends
on r Biocpkg("IRanges")
, r Biocpkg("S4Vectors")
and other
packagesr Biocpkg("GenomicAlignments")
for aligned reads using genomic
range-based concepts. Depends on r Biocpkg("Rsamtools")
, r
Biocpkg("Biostrings")
and other packageslibrary(GenomicRanges) library(GenomicAlignments) sessionInfo()
GRanges
: simple genomic ranges
mcols()
of associated data, e.g., 'score', 'id', ...seqname()
, e.g., chromosome, but could be, e.g., contig, ...start()
, end()
: 1-based, closed intervalsstrand()
: +, -, or * (does not matter)mcols()
GRangesList
: nested genomic ranges
*List
objects: lists, but all elements of the
same type. E.g., start()
returns an IntegerList()
.unlist()
Intra-range operations
range()
, flank()
Inter-range operations
reduce()
, disjoin()
Between-object
psetdiff()
, findOverlaps()
, countOverlaps()
PLoS Comput Biol 9(8): e1003118
What can I do with my GRanges
instance?
methods(class="GRanges")
What type of object(s) can I use findOverlaps()
on (what methods
exist for the findOverlaps()
generic)?
methods(findOverlaps)
How can I get help on functions, generics, and methods?
?"findOverlaps" ## generic ?"findOverlaps,<tab>" ## specific method
Other help?
browseVignettes("GenomicRanges")
GAlignments
and friends (r Biocpkg("GenomicAlignments")
)
GAlignments
: Single-end aligned reads, e.g., from BAM filesGAlignmentPairs
, GAlignmentsList
: paired-end aligned
reads. *Pairs
is more restrictive on what pairs can be representedDNAString
and DNAStringSet
(r Biocpkg("Biostrings")
)
SummarizedExperiment
(r Biocpkg("GenomicRanges")
)
assays
of rows (regions of interest; genomic ranges) x columns
(samples, including integrated phenotypic information)TxDb
(r Biocpkg("AnnotationDb")
)
transcripts()
interfaceselect()
interfaceVCF
(r Biocpkg("VariantAnnotation")
)
Lower-level classes
DataFrame
(r Biocpkg("S4Vectors")
) (like a data.frame
, but can
contain S4 objects)IRanges
(r Biocpkg("IRanges")
), Rle
(r Biocpkg("S4Vectors")
)R works efficiently on vectors
GRanges
as a collection of vectors, not as a collection
of recordsgetClass("GRanges")
Vector
and Annotated
[
, length()
, names()
, etc.mcols()
List
-like[[
elementLengths()
Implementation: Vector
plus partitioning
unlist()
and relist()
are very inexpensiveIngredients
r Biocpkg("TxDb.Hsapiens.UCSC.hg19")
TxDb packageexons()
, and exonsBy()
functionswidth()
, elementLengths()
accessorshist()
Goals
Ingredients
- r Biocpkg("BSgenome.Hsapiens.UCSC.hg19")
BSGenome package
- r Biocpkg("TxDb.Hsapiens.UCSC.hg19")
TxDb package
- ?"getSeq,BSgenome-method"
, letterFrequency()
Goapls
Ingredients
DNAStringSet()
to construct CG island sequencematchPDict()
to find CG islands on BSgenome chromosomescoverage()
, tileGenome()
, Views()
, following
HintstileGenome()
, findOverlaps()
, splitAsList()
, mean()
Goal
Ingredients
r Biocexptpkg("RNAseqData.HNRNPC.bam.chr14")
and
RNAseqData.HNRNPC.bam.chr14_BAMFILES
readGAlignments()
, readGAlignmentPairs()
, and
readGAlignmentsList()
BamFile()
Goals
ScanBamParam()
which
and what
to selective input datacountOverlaps()
between reads and known genesBamFile()
yieldSize
argument to iterate through fileAdd the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.