Description Usage Arguments Details Value Author(s) See Also Examples
The function uses a segmentation algorithm (fastseg
)
to segment the methylation profiles. Following that, it uses gaussian mixture
modelling to cluster the segments into k components. This process uses mean
methylation value of each segment in the modeling phase. Each component
ideally indicates quantitative classification of segments, such as high or
low methylated regions.
1 2 3 4 5 6 7 |
obj |
|
diagnostic.plot |
if TRUE a diagnostic plot is plotted. The plot shows methylation and length statistics per segment group. In addition, it shows diagnostics from mixture modeling: the density function estimated and BIC criterion used to decide the optimum number of components in mixture modeling. |
join.neighbours |
if TRUE neighbouring segments that cluster to the same seg.group are joined by extending the ranges, summing up num.marks and averaging over seg.means. |
initialize.on.subset |
a numeric value indicating either percentage or absolute value of regions to subsample from segments before performing the mixture modeling. The value can be either between 0 and 1, e.g. 0.1 means that 10 integer higher than 1 to define the number of regions to sample. By default uses the whole dataset, which can be time consuming on large datasets. (Default: 1) |
... |
arguments to |
To be sure that the algorithm will work on your data, the object should have at least 5000 records
After initial segmentation with fastseg(), segments are clustered
into self-similar groups based on their mean methylation profile
using mixture modeling. Mixture modeling estimates the initial
parameters of the distributions by using the whole dataset.
If "initialize.on.subset" argument set as described, the function
will use a subset of the data to initialize the parameters of the
mixture modeling prior to the Expectation-maximization (EM) algorithm
used by Mclust
package.
A GRanges
object with segment
classification and information.
'seg.mean' column shows the mean methylation per segment.
'seg.group' column shows the segment groups obtained by mixture modeling
Altuna Akalin, contributions by Arsene Wabo and Katarzyna Wreczycka
methSeg2bed
, joinSegmentNeighbours
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | download.file(
"https://raw.githubusercontent.com/BIMSBbioinfo/compgen2018/master/day3_diffMeth/data/H1.chr21.chr22.rds",
destfile="H1.chr21.chr22.rds",method="curl")
mbw=readRDS("H1.chr21.chr22.rds")
# it finds the optimal number of componets as 6
res=methSeg(mbw,diagnostic.plot=TRUE,maxInt=100,minSeg=10)
# however the BIC stabilizes after 4, we can also try 4 componets
res=methSeg(mbw,diagnostic.plot=TRUE,maxInt=100,minSeg=10,G=1:4)
# get segments to BED file
methSeg2bed(res,filename="H1.chr21.chr22.trial.seg.bed")
unlink(list.files(pattern="H1.chr21.chr22",full.names=TRUE))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.