View source: R/readBiokitAsDGEList.R
readBiokitFeatureAnnotation | R Documentation |
Read feature annotation from Biokit directory
readBiokitFeatureAnnotation(dir, anno = c("refseq", "ensembl"))
dir |
Character string, a Biokit output directory. |
anno |
Character, indicating the annotation type. |
A data.frame
containing feature annotation, with feature IDs
as characters in rownames. The data frame contains following columns
depending on the anno
parameter:
FeatureName, the primary key of feature name as characters
GeneID (refseq only) or EnsemblID (ensembl only)
GeneSymbol
mean: mean length
median: median length
longest_isoform: longest isoform
merged: total length of merged exons
The function depends on the refseq.annot.gz
(ensembl.annot.gz
)
and refseq.geneLength.gz
(ensembl.geneLength.gz
) files in the
biokit directory.
If .annot.gz
file is not found (which can be the case, for instance,
when older biokit output directories are used), feature annotation is
read from the count GCT file. The resulting data.frame
will only
contain two columns: FeatureName
and Description
.
If .geneLength.gz
file is not found, no gene length information is
appended.
## TODO add small example files
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.