| splitDataByChromatin | R Documentation |
This function splits the methylation data into regions
based on the chromatin states predicted by ChromHMM software
(Ernst and Kellis (2012)).
The annotations come from the Bioconductor package annnotatr.
Chromatin states determined by chromHMM are
available in hg19 for nine cell lines (Gm12878, H1hesc, Hepg2, Hmec, Hsmm,
Huvec, K562, Nhek, and Nhlf).
splitDataByChromatin( dat, chr, cell.line, states, gap = -1, min.cpgs = 50, max.cpgs = 2000, verbose = TRUE )
dat |
a data frame with rows as individual CpGs appearing
in all the samples. The first 4 columns should contain the information of
|
chr |
character vector containing the chromosome information. Its length
should be equal to the number of rows in |
cell.line |
character defining the cell line of interest. Nine cell lines are available:
|
states |
character vector defining the chromatin states of interest among the following available options:
|
gap |
this integer defines the maximum gap that is allowed between
two regions to be considered as overlapping.
According to the |
min.cpgs |
positive integer defining the minimum number of CpGs within a region for the algorithm to perform optimally. The default value is 50. |
max.cpgs |
positive integer defining the maximum number of CpGs within a region for the algorithm to perform optimally. The default value is 2000. |
verbose |
logical indicates if the algorithm should provide progress report information. The default value is TRUE. |
A list of data.frame containing the data of each
independent region.
Audrey Lemaçon
#------------------------------------------------------------# data(RAdat) RAdat.f <- na.omit(RAdat[RAdat$Total_Counts != 0, ]) results <- splitDataByChromatin(dat = RAdat.f, cell.line = "huvec", chr = rep(x = "chr4", times = nrow(RAdat.f)), states = "Insulator", verbose = FALSE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.