atenaParam-class | R Documentation |
This is a class for storing parameters to quantify TE (and gene) expression using the atena method. It is a subclass of the 'QuantifyParam-class'.
Build an object of the class atenaParam
.
atenaParam(
bfl,
teFeatures,
aggregateby = character(0),
ovMode = "ovUnion",
geneFeatures = NULL,
singleEnd = TRUE,
strandMode = 1L,
ignoreStrand = FALSE,
fragments = TRUE,
pi_prior = 0L,
theta_prior = 0L,
em_epsilon = 1e-07,
maxIter = 100L,
reassign_mode = "exclude",
conf_prob = 0.9,
verbose = TRUE
)
## S4 method for signature 'atenaParam'
show(object)
bfl |
A |
teFeatures |
A |
aggregateby |
Character vector with column names from the annotation
to be used to aggregate quantifications. By default, this is an empty
vector, which means that the names of the input |
ovMode |
Character vector indicating the overlapping mode. Available options are: "ovUnion" (default) and "ovIntersectionStrict", which implement the corresponding methods from HTSeq (https://htseq.readthedocs.io/en/release_0.11.1/count.html). Ambiguous alignments (alignments overlapping > 1 feature) are not counted. |
geneFeatures |
(Default NULL) A |
singleEnd |
(Default TRUE) Logical value indicating if reads are single
( |
strandMode |
(Default 1) Numeric vector which can take values 0, 1 or
2.
The strand mode is a per-object switch on
|
ignoreStrand |
(Default FALSE) A logical which defines if the strand
should be taken into consideration when computing the overlap between reads
and annotated features. When |
fragments |
(Default TRUE) A logical; applied to paired-end data only.
When |
pi_prior |
(Default 0) A positive numeric object indicating the prior
on pi. The same prior can be specified for all features setting
|
theta_prior |
(Default 0) A positive numeric object indicating the
prior on Q. The same prior can be specified for all features setting
|
em_epsilon |
(Default 1e-7) A numeric scalar indicating the EM Algorithm Epsilon cutoff. |
maxIter |
A positive integer scalar storing the maximum number of
iterations of the EM SQUAREM algorithm (Du and Varadhan, 2020). Default
is 100 and this value is passed to the |
reassign_mode |
(Default 'exclude') Character vector indicating
reassignment mode after EM step.
Available methods are 'exclude' (reads with more than one best
assignment are excluded from the final counts), 'choose' (when reads have
more than one best assignment, one of them is randomly chosen), 'average'
(the read count is divided evenly among the best assignments) and 'conf'
(only assignments that exceed a certain threshold -defined by
|
conf_prob |
(Default 0.9) Minimum probability for high confidence assignment. |
verbose |
(Default |
object |
A atenaParam object. |
This is the constructor function for objects of the class
atenaParam-class
. This type of object is the input to the
function qtex()
for quantifying expression of transposable
elements, which will call the atena method with this type of object. The
atena method uses a multiple '__no_feature' approach in which as many
'__no_feature' features as different overlapping patterns of multimapping
reads in the overlapping matrix are used to represent alignments mapping
outside annotations.
A atenaParam object.
singleEnd
(Default TRUE) Logical value indicating if reads are single
(TRUE
) or paired-end (FALSE
).
strandMode
(Default 1) Numeric vector which can take values 0, 1 or 2.
The strand mode is a per-object switch on
GAlignmentPairs
objects that controls the behavior of the strand getter. See
GAlignmentPairs
class for further detail. If singleEnd = TRUE
, then strandMode
is ignored.
ignoreStrand
(Default FALSE) A logical which defines if the strand
should be taken into consideration when computing the overlap between reads
and annotated features. When ignoreStrand = FALSE
, an aligned read
is considered to be overlapping an annotated feature as long as they
have a non-empty intersecting genomic range on the same strand, while when
ignoreStrand = TRUE
the strand is not considered.
fragments
(Default TRUE) A logical; applied to paired-end data only.
When fragments=FALSE
, the read-counting method only counts
‘mated pairs’ from opposite strands (non-ambiguous properly paired reads),
while when fragments=TRUE
same-strand pairs, singletons, reads with
unmapped pairs and other ambiguous or not properly paired fragments
are also counted (see "Pairing criteria" in
readGAlignments()
).
For further details see
summarizeOverlaps()
.
pi_prior
(Default 0) A positive numeric object indicating the prior
on pi. The same prior can be specified for all features setting
pi_prior
as a scalar, or each feature can have a specific prior by
setting pi_prior
as a vector with names()
corresponding to
all feature names. Setting a pi prior is equivalent to adding n unique
reads.
theta_prior
(Default 0) A positive numeric object indicating the
prior on Q. The same prior can be specified for all features setting
theta_prior
as a scalar, or each feature can have a specific prior by
setting theta_prior
as a vector with names()
corresponding to
all feature names. Equivalent to adding n non-unique reads.
em_epsilon
(Default 1e-7) A numeric scalar indicating the EM Algorithm Epsilon cutoff.
maxIter
A positive integer scalar storing the maximum number of
iterations of the EM SQUAREM algorithm (Du and Varadhan, 2020). Default
is 100 and this value is passed to the maxiter
parameter of the
squarem()
function.
reassign_mode
(Default 'exclude') Character vector indicating
reassignment mode after EM step.
Available methods are 'exclude' (reads with more than one best
assignment are excluded from the final counts), 'choose' (when reads have
more than one best assignment, one of them is randomly chosen), 'average'
(the read count is divided evenly among the best assignments) and 'conf'
(only assignments that exceed a certain threshold -defined by
conf_prob
parameter- are accepted, then the read count is
proportionally divided among the assignments above conf_prob
).
conf_prob
(Default 0.9) Minimum probability for high confidence assignment.
bamfiles <- list.files(system.file("extdata", package="atena"),
pattern="*.bam", full.names=TRUE)
## Not run:
## use the following two instructions to fetch annotations, they are here
## commented out to enable running this example quickly when building and
## checking the package
rmskat <- annotaTEs(genome="dm6", parsefun=rmskatenaparser,
strict=FALSE, insert=500)
rmskLTR <- getLTRs(rmskat, relLength=0.8,
fullLength=TRUE,
partial=TRUE,
otherLTR=TRUE)
## End(Not run)
## DO NOT TYPE THIS INSTRUCTION, WHICH JUST LOADS A PRE-COMPUTED ANNOTATION
## YOU SHOULD USE THE INSTRUCTIONS ABOVE TO FETCH ANNOTATIONS
rmskLTR <- readRDS(system.file("extdata", "rmskatLTRrlen80flenpartoth.rds",
package="atena"))
## build a parameter object for the atena method
atpar <- atenaParam(bfl=bamfiles,
teFeatures=rmskLTR,
singleEnd=TRUE,
ignoreStrand=TRUE)
atpar
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.