Description Usage Arguments Details Value Author(s) References
clustering
is a wrapper of the RAMClustR::ramclustR from RAMClustR
package. It performs a clustering of features with a given sigma for
retention time similarity st
and for correlation similarity sr
. Note
that, in addition to the sr
, the argument deepSplit = TRUE
might be
critical to avoid several metabolites in a single cluster.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 | clustering(
xcmsObj = NULL,
ms = NULL,
idmsms = NULL,
taglocation = "filepaths",
MStag = NULL,
idMSMStag = NULL,
featdelim = "_",
timepos = 2,
st = NULL,
sr = NULL,
maxt = NULL,
deepSplit = FALSE,
blocksize = 2000,
mult = 5,
hmax = NULL,
sampNameCol = 1,
collapse = TRUE,
usePheno = TRUE,
mspout = TRUE,
ExpDes = NULL,
normalize = "TIC",
qc.inj.range = 20,
order = NULL,
batch = NULL,
qc = NULL,
minModuleSize = 2,
linkage = "average",
mzdec = 3,
cor.method = "pearson",
rt.only.low.n = TRUE,
fftempdir = NULL,
replace.zeros = TRUE
)
|
... |
Arguments passed on to
|
Main clustering function output - see citation for algorithm description or vignette('RAMClustR') for a walk through. batch.qc. normalization requires input of three vectors (1) batch (2) order (3) qc. This is a feature centric normalization approach which adjusts signal intensities first by comparing batch median intensity of each feature (one feature at a time) QC signal intensity to full dataset median to correct for systematic batch effects and then secondly to apply a local QC median vs global median sample correction to correct for run order effects.
$featclus: integer vector of cluster membership for each feature
$frt: feature retention time, in whatever units were fed in (xcms uses seconds, by default)
$fmz: feature retention time, reported in number of decimal points selected in ramclustR function
$xcmsOrd: the original XCMS (or csv) feature order for cross referencing, if need be
$clrt: cluster retention time
$clrtsd: retention time standard deviation of all the features that comprise that cluster
$nfeat: number of features in the cluster
$nsing: number of 'singletons' - that is the number of features which clustered with no other feature
$ExpDes: the experimental design object used when running ramclustR. List of two dataframes.
$cmpd: compound name. C#### are assigned in order of output by dynamicTreeCut. Compound with the most features is classified as C0001...
$ann: annotation. By default, annotation names are identical to 'cmpd' names. This slot is a placeholder for when annotations are provided
$MSdata: the MSdataset provided by either xcms or csv input
$MSMSdata: the (optional) MSe/idMSMS dataset provided be either xcms or csv input
$SpecAbund: the cluster intensities after collapsing features to clusters
$SpecAbundAve: the cluster intensities after averaging all samples with identical sample names
- 'spectra' directory is created in the working directory. In this directory a .msp is (optionally) created, which contains the spectra for all compounds in the dataset following clustering. if MSe/idMSMS data are provided, they are listed witht he same compound name as the MS spectrum, with the collision energy provided in the ExpDes object provided to distinguish low from high CE spectra.
Corey Broeckling
Broeckling CD, Afsar FA, Neumann S, Ben-Hur A, Prenni JE. RAMClust: a novel feature clustering method enables spectral-matching-based annotation for metabolomics data. Anal Chem. 2014 Jul 15;86(14):6812-7. doi: 10.1021/ac501530d. Epub 2014 Jun 26. PubMed PMID: 24927477.
Broeckling CD, Ganna A, Layer M, Brown K, Sutton B, Ingelsson E, Peers G, Prenni JE. Enabling Efficient and Confident Annotation of LC-MS Metabolomics Data through MS1 Spectrum and Time Prediction. Anal Chem. 2016 Sep 20;88(18):9226-34. doi: 10.1021/acs.analchem.6b02479. Epub 2016 Sep 8. PubMed PMID: 7560453.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.