Description Usage Arguments Details Author(s) See Also Examples
Methods for comparing and/or ordering GenomicRanges objects.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 | ## duplicated()
## ------------
## S4 method for signature 'GenomicRanges'
duplicated(x, incomparables=FALSE, fromLast=FALSE,
nmax=NA, method=c("auto", "quick", "hash"))
## match() & selfmatch()
## ---------------------
## S4 method for signature 'GenomicRanges,GenomicRanges'
match(x, table, nomatch=NA_integer_, incomparables=NULL,
method=c("auto", "quick", "hash"), ignore.strand=FALSE)
## S4 method for signature 'GenomicRanges'
selfmatch(x, method=c("auto", "quick", "hash"), ignore.strand=FALSE)
## order() and related methods
## ----------------------------
## S4 method for signature 'GenomicRanges'
is.unsorted(x, na.rm=FALSE, strictly=FALSE, ignore.strand=FALSE)
## S4 method for signature 'GenomicRanges'
order(..., na.last=TRUE, decreasing=FALSE,
method=c("auto", "shell", "radix"))
## S4 method for signature 'GenomicRanges'
sort(x, decreasing=FALSE, ignore.strand=FALSE, by)
## S4 method for signature 'GenomicRanges'
rank(x, na.last=TRUE,
ties.method=c("average", "first", "last", "random", "max", "min"),
ignore.strand=FALSE)
## Generalized parallel comparison of 2 GenomicRanges objects
## ----------------------------------------------------------
## S4 method for signature 'GenomicRanges,GenomicRanges'
pcompare(x, y)
|
x, table, y |
GenomicRanges objects. |
incomparables |
Not supported. |
fromLast, method, nomatch, nmax, na.rm, strictly, na.last, decreasing |
See |
ignore.strand |
Whether or not the strand should be ignored when comparing 2 genomic ranges. |
... |
One or more GenomicRanges objects. The GenomicRanges objects after the first one are used to break ties. |
ties.method |
A character string specifying how ties are treated. Only |
by |
An optional formula that is resolved against |
Two elements of a GenomicRanges derivative (i.e. two genomic ranges)
are considered equal iff they are on the same underlying sequence and strand,
and share the same start and width. duplicated()
and unique()
on a GenomicRanges derivative are conforming to this.
The "natural order" for the elements of a GenomicRanges derivative
is to order them (a) first by sequence level, (b) then by strand, (c) then
by start, (d) and finally by width.
This way, the space of genomic ranges is totally ordered.
Note that, because we already do (c) and (d) for regular ranges (see
?`IPosRanges-comparison`
), genomic ranges that
belong to the same underlying sequence and strand are ordered like regular
ranges.
pcompare()
, ==
, !=
, <=
, >=
, <
and >
on GenomicRanges derivatives behave accordingly to this
"natural order".
is.unsorted()
, order()
, sort()
, rank()
on
GenomicRanges derivatives also behave accordingly to this
"natural order".
Finally, note that some inter range transformations like
reduce
or disjoin
also use this "natural order" implicitly when operating on
GenomicRanges derivatives.
H. Pag<c3><a8>s, is.unsorted
contributed by Pete Hickey
The GenomicRanges class.
IPosRanges-comparison in the IRanges package for comparing and ordering genomic ranges.
findOverlaps-methods for finding overlapping genomic ranges.
intra-range-methods and inter-range-methods for intra range and inter range transformations of a GRanges object.
setops-methods for set operations on GenomicRanges objects.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 | gr0 <- GRanges(
Rle(c("chr1", "chr2", "chr1", "chr3"), c(1, 3, 2, 4)),
IRanges(c(1:9,7L), end=10),
strand=Rle(strand(c("-", "+", "*", "+", "-")), c(1, 2, 2, 3, 2)),
seqlengths=c(chr1=11, chr2=12, chr3=13)
)
gr <- c(gr0, gr0[7:3])
names(gr) <- LETTERS[seq_along(gr)]
## ---------------------------------------------------------------------
## A. ELEMENT-WISE (AKA "PARALLEL") COMPARISON OF 2 GenomicRanges OBJECTS
## ---------------------------------------------------------------------
gr[2] == gr[2] # TRUE
gr[2] == gr[5] # FALSE
gr == gr[4]
gr >= gr[3]
## ---------------------------------------------------------------------
## B. match(), selfmatch(), %in%, duplicated(), unique()
## ---------------------------------------------------------------------
table <- gr[1:7]
match(gr, table)
match(gr, table, ignore.strand=TRUE)
gr %in% table
duplicated(gr)
unique(gr)
## ---------------------------------------------------------------------
## C. findMatches(), countMatches()
## ---------------------------------------------------------------------
findMatches(gr, table)
countMatches(gr, table)
findMatches(gr, table, ignore.strand=TRUE)
countMatches(gr, table, ignore.strand=TRUE)
gr_levels <- unique(gr)
countMatches(gr_levels, gr)
## ---------------------------------------------------------------------
## D. order() AND RELATED METHODS
## ---------------------------------------------------------------------
is.unsorted(gr)
order(gr)
sort(gr)
is.unsorted(sort(gr))
is.unsorted(gr, ignore.strand=TRUE)
gr2 <- sort(gr, ignore.strand=TRUE)
is.unsorted(gr2) # TRUE
is.unsorted(gr2, ignore.strand=TRUE) # FALSE
## TODO: Broken. Please fix!
#sort(gr, by = ~ seqnames + start + end) # equivalent to (but slower than) above
score(gr) <- rev(seq_len(length(gr)))
## TODO: Broken. Please fix!
#sort(gr, by = ~ score)
rank(gr, ties.method="first")
rank(gr, ties.method="first", ignore.strand=TRUE)
## ---------------------------------------------------------------------
## E. GENERALIZED ELEMENT-WISE COMPARISON OF 2 GenomicRanges OBJECTS
## ---------------------------------------------------------------------
gr3 <- GRanges(c(rep("chr1", 12), "chr2"), IRanges(c(1:11, 6:7), width=3))
strand(gr3)[12] <- "+"
gr4 <- GRanges("chr1", IRanges(5, 9))
pcompare(gr3, gr4)
rangeComparisonCodeToLetter(pcompare(gr3, gr4))
|
Loading required package: stats4
Loading required package: BiocGenerics
Loading required package: parallel
Attaching package: 'BiocGenerics'
The following objects are masked from 'package:parallel':
clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
clusterExport, clusterMap, parApply, parCapply, parLapply,
parLapplyLB, parRapply, parSapply, parSapplyLB
The following objects are masked from 'package:stats':
IQR, mad, sd, var, xtabs
The following objects are masked from 'package:base':
Filter, Find, Map, Position, Reduce, anyDuplicated, append,
as.data.frame, basename, cbind, colMeans, colSums, colnames,
dirname, do.call, duplicated, eval, evalq, get, grep, grepl,
intersect, is.unsorted, lapply, lengths, mapply, match, mget,
order, paste, pmax, pmax.int, pmin, pmin.int, rank, rbind,
rowMeans, rowSums, rownames, sapply, setdiff, sort, table, tapply,
union, unique, unsplit, which, which.max, which.min
Loading required package: S4Vectors
Attaching package: 'S4Vectors'
The following object is masked from 'package:base':
expand.grid
Loading required package: IRanges
Loading required package: GenomeInfoDb
[1] TRUE
[1] FALSE
[1] FALSE FALSE FALSE TRUE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
[13] FALSE TRUE FALSE
[1] FALSE FALSE TRUE TRUE FALSE FALSE TRUE TRUE TRUE TRUE TRUE FALSE
[13] FALSE TRUE TRUE
[1] 1 2 3 4 5 6 7 NA NA NA 7 6 5 4 3
[1] 1 2 3 4 5 6 7 NA NA 7 7 6 5 4 3
[1] TRUE TRUE TRUE TRUE TRUE TRUE TRUE FALSE FALSE FALSE TRUE TRUE
[13] TRUE TRUE TRUE
[1] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE TRUE TRUE
[13] TRUE TRUE TRUE
GRanges object with 10 ranges and 0 metadata columns:
seqnames ranges strand
<Rle> <IRanges> <Rle>
A chr1 1-10 -
B chr2 2-10 +
C chr2 3-10 +
D chr2 4-10 *
E chr1 5-10 *
F chr1 6-10 +
G chr3 7-10 +
H chr3 8-10 +
I chr3 9-10 -
J chr3 7-10 -
-------
seqinfo: 3 sequences from an unspecified genome
Hits object with 12 hits and 0 metadata columns:
queryHits subjectHits
<integer> <integer>
[1] 1 1
[2] 2 2
[3] 3 3
[4] 4 4
[5] 5 5
... ... ...
[8] 11 7
[9] 12 6
[10] 13 5
[11] 14 4
[12] 15 3
-------
queryLength: 15 / subjectLength: 7
[1] 1 1 1 1 1 1 1 0 0 0 1 1 1 1 1
Hits object with 13 hits and 0 metadata columns:
queryHits subjectHits
<integer> <integer>
[1] 1 1
[2] 2 2
[3] 3 3
[4] 4 4
[5] 5 5
... ... ...
[9] 11 7
[10] 12 6
[11] 13 5
[12] 14 4
[13] 15 3
-------
queryLength: 15 / subjectLength: 7
[1] 1 1 1 1 1 1 1 0 0 1 1 1 1 1 1
[1] 1 1 2 2 2 2 2 1 1 1
[1] TRUE
[1] 6 12 1 5 13 2 3 15 4 14 7 11 8 10 9
GRanges object with 15 ranges and 0 metadata columns:
seqnames ranges strand
<Rle> <IRanges> <Rle>
F chr1 6-10 +
L chr1 6-10 +
A chr1 1-10 -
E chr1 5-10 *
M chr1 5-10 *
. ... ... ...
G chr3 7-10 +
K chr3 7-10 +
H chr3 8-10 +
J chr3 7-10 -
I chr3 9-10 -
-------
seqinfo: 3 sequences from an unspecified genome
[1] FALSE
[1] TRUE
[1] TRUE
[1] FALSE
[1] 3 6 7 9 4 1 11 13 15 14 12 2 5 10 8
[1] 1 6 7 9 2 4 11 14 15 12 13 5 3 10 8
[1] -6 -5 -4 -4 -1 2 3 4 4 5 6 -11 42
[1] a b c c f i j k k l m X X
Levels: a b c d e f g h i j k l m X
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.