Description Usage Arguments Details Value References
This routine selects a subset of CpGs sites used for MeDeCom analysis. Different selection methods are supported.
1 2 3 4 5 6 | prepare_CG_subsets(meth.data = NULL, rnb.set = NULL, MARKER_SELECTION,
N_MARKERS = 5000, REMOVE_CORRELATED = FALSE,
COR_THRESHOLD = "quantile", WRITE_FILES = FALSE, WD = NA,
REF_DATA_SET = NULL, REF_PHENO_COLUMN = NULL, N_PRIN_COMP = 10,
RANGE_DIFF = 0.05, CUSTOM_MARKER_FILE = "", store.heatmaps = F,
heatmap.sample.col = NULL, K.prior = NULL)
|
meth.data |
A |
rnb.set |
An object of type |
MARKER_SELECTION |
A vector of strings representing marker selection methods. Available method are
|
N_MARKERS |
The number of sites to be selected. Defaults to 5000. |
REMOVE_CORRELATED |
Flag indicating if highly correlated sites are to be removed |
COR_THRESHOLD |
Numeric indicating a threshold above which sites are not to be considered in the feature selection.
If |
WRITE_FILES |
Flag indicating if the selected sites are to be stored on disk. |
WD |
Path to the working directory used for analyis, or data preparation. |
REF_DATA_SET |
An object of type |
REF_PHENO_COLUMN |
Optional argument stating the column name of the phenotypic table of |
N_PRIN_COMP |
Optional argument deteriming the number of prinicipal components used for selecting the most important sites. |
RANGE_DIFF |
Optional argument specifying the difference between maximum and minimum required. |
CUSTOM_MARKER_FILE |
Optional argument containing an absolute path to a file that specifies the indices used for employing MeDeCom. Can be provided
either as an |
store.heatmaps |
Flag indicating if a heatmap of the selected input sites is to be create from the input methylation matrix. The files are then stored in the 'heatmaps' folder in WD. |
heatmap.sample.col |
Column name in the phenotypic table of |
K.prior |
K determined from visual inspection. Only has an influence, if |
For methods "houseman2012
" and "jaffe2014
", a predefined set of markers is used. Since those correspond to
absolute indices on the chip, the provided rnb.set
must not be preprocessed and therefore still contain all sites.
For the other metods, you may used prepare_data
to filter sites for quality and context.
List of indices, one entry for each marker selection method specified by MARKER_SELECTION
. The indices correspond
to the sites that should be used in rnb.set
.
1. Houseman, E. A., Accomando, W. P., Koestler, D. C., Christensen, B. C., Marsit, C. J., Nelson, H. H., ..., Kelsey, K. T. (2012). DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics, 13.
2. Houseman, E. A., Molitor, J., & Marsit, C. J. (2014). Reference-free cell mixture adjustments in analysis of DNA methylation data. Bioinformatics, 30(10), 1431-1439. https://doi.org/10.1093/bioinformatics/btu029
3. Jaffe, A. E., & Irizarry, R. A. (2014). Accounting for cellular heterogeneity is critical in epigenome-wide association studies. Genome Biology, 15(2), R31. https://doi.org/10.1186/gb-2014-15-2-r31
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.