initREMP | R Documentation |
initREMP
is used to initialize annotation database for RE methylation prediction.
Three RE types in human, Alu element (Alu), LINE-1 (L1), and endogenous retrovirus (ERV) are available.
initREMP( arrayType = c("450k", "EPIC", "Sequencing"), REtype = c("Alu", "L1", "ERV"), annotation.source = c("AH", "UCSC"), genome = c("hg19", "hg38"), RE = NULL, Seq.GR = NULL, ncore = NULL, BPPARAM = NULL, export = FALSE, work.dir = tempdir(), verbose = FALSE )
arrayType |
Illumina methylation array type. Currently |
REtype |
Type of RE. Currently |
annotation.source |
Character parameter. Specify the source of annotation databases, including
the RefSeq Gene annotation database and RepeatMasker annotation database. If |
genome |
Character parameter. Specify the build of human genome. Can be either |
RE |
A |
Seq.GR |
A |
ncore |
Number of cores used for parallel computing. By default max number of cores
available in the machine will be utilized. If |
BPPARAM |
An optional |
export |
Logical. Should the returned |
work.dir |
Path to the directory where the generated data will be saved. Valid when
|
verbose |
Logical parameter. Should the function be verbose? |
Currently, we support two major types of RE in the human genome, Alu and L1. The main purpose of
initREMP
is to generate and annotate CpG/RE data using the refSeq Gene (hg19)
annotation database (provided by AnnotationHub
). These annotation data are crucial to
RE methylation prediction in remp
. Once generated, the data can be reused in the future
(data can be very large). Therefore, we recommend the user to save the output from
initREMP
to the local machine, so that user only need to run this function once
as long as there is no change to the RE database. To minimize the size of the resulting data file, the generated
annotation data are only for REs that contain RE-CpGs with neighboring profiled CpGs. By default, the
neighboring CpGs are confined within 1200 bp flanking window. This window size can be modified using
remp_options
. Note that the refSeq Gene database from UCSC is dynamic (updated periodically)
and reflecting the latest knowledge of gene, whereas the database from AnnotationHub is static and classic.
Using different sources will have a slight impact on the prediction results of RE methylation and gene annotation
of final results. For sequencing methylation data, please specify the genomic location of CpGs
in a GenomicRanges
object and specify it in Seq.GR
. For an example of Seq.GR
, Please
run minfi::getLocations(IlluminaHumanMethylation450kanno.ilmn12.hg19)
(the row names of the CpGs in
Seq.GR
can be NULL
). The user should make sure the genome build of Seq.GR
match the
build specified in genome
parameter (default is "hg19"
).
An REMParcel
object containing data needed for RE methylation prediction.
See remp
for RE methylation prediction.
if (!exists("remparcel")) { data(Alu.hg19.demo) remparcel <- initREMP(arrayType = "450k", REtype = "Alu", annotation.source = "AH", genome = "hg19", RE = Alu.hg19.demo, ncore = 1, verbose = TRUE) }
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.