splitDataByChromatin | R Documentation |
This function splits the methylation data into regions
based on the chromatin states predicted by ChromHMM software
(Ernst and Kellis (2012)).
The annotations come from the Bioconductor package annnotatr
.
Chromatin states determined by chromHMM are
available in hg19 for nine cell lines (Gm12878, H1hesc, Hepg2, Hmec, Hsmm,
Huvec, K562, Nhek, and Nhlf).
splitDataByChromatin( dat, chr, cell.line, states, gap = -1, min.cpgs = 50, max.cpgs = 2000, verbose = TRUE )
dat |
a data frame with rows as individual CpGs appearing
in all the samples. The first 4 columns should contain the information of
|
chr |
character vector containing the chromosome information. Its length
should be equal to the number of rows in |
cell.line |
character defining the cell line of interest. Nine cell lines are available:
|
states |
character vector defining the chromatin states of interest among the following available options:
|
gap |
this integer defines the maximum gap that is allowed between
two regions to be considered as overlapping.
According to the |
min.cpgs |
positive integer defining the minimum number of CpGs within a region for the algorithm to perform optimally. The default value is 50. |
max.cpgs |
positive integer defining the maximum number of CpGs within a region for the algorithm to perform optimally. The default value is 2000. |
verbose |
logical indicates if the algorithm should provide progress report information. The default value is TRUE. |
A list
of data.frame
containing the data of each
independent region.
Audrey Lemaçon
#------------------------------------------------------------# data(RAdat) RAdat.f <- na.omit(RAdat[RAdat$Total_Counts != 0, ]) results <- splitDataByChromatin(dat = RAdat.f, cell.line = "huvec", chr = rep(x = "chr4", times = nrow(RAdat.f)), states = "Insulator", verbose = FALSE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.