meffil.most.variable.cpgs: Most variable CpG sites

View source: R/most-variable.r

meffil.most.variable.cpgsR Documentation

Most variable CpG sites

Description

Returns the most variable CpG sites (rows) in the methylation matrix.

Usage

meffil.most.variable.cpgs(
  beta,
  n = 1000,
  sites = NULL,
  samples = NULL,
  autosomal = T,
  winsorize.pct = NA,
  outlier.iqr.factor = NA
)

Arguments

beta

Output from meffil.normalize.samples(), either a matrix or a GDS filename.

n

Number of CpG sites to return.

sites

Subset of CpG sites to consider (row names of beta) (Default: NULL).

samples

Subset of samples to consider (column names of beta) (Default: NULL).

autosomal

If true, remove probes on sex chromosomes (Default: TRUE).

winsorize.pct

Apply to methylation levels winsorized to the given level. Set to NA to avoid winsorizing (Default: NA).

outlier.iqr.factor

Apply to methylation after setting, for each CpG site, values less than Q1 - outlier.iqr.factor * IQR or more than Q3 + outlier.iqr.factor * IQR to NA. Here IQR is the inter-quartile range of the methylation levels at the CpG site, i.e. Q3-Q1. Set to NA to skip this step (Default: NA).

Value

The n CpG site identifiers (rownames of x) with the greatest variance in x.


perishky/meffil documentation built on June 9, 2024, 5:59 p.m.