In WMBEdmands/MetMSLine: an automated and fully integrated pipeline for rapid processing of high-resolution LC-MS metabolomic datasets.

The following illustrates the MetMSLine workflow with example data:

1. Read in peak table and co-variate information and pre-process the data.

library(MetMSLine)

# file path example peakTable in comma delimited csv file 
# (see ?example_Exp_MS1features for details).
peakTable <- system.file("extdata", "MS1features_example.csv", package = "MetMSLine")
peakTable <- read.csv(peakTable, header=T, stringsAsFactors=F)


# load co-variates table in comma delimited csv file
coVariates <- system.file("extdata", "coVariates.csv", package = "MetMSLine")
coVariates <- read.csv(coVariates, header=T)
# observation names (i.e. sample names)
obsNames <- colnames(peakTable)[grep('ACN_|MeOH_', colnames(peakTable))]

# zero fill
peakTable <- zeroFill(peakTable, obsNames)

# Normalize (median fold change/ probabilistic quotient), total ion signal 
#  also available ?signNorm
peakTable <- signNorm(peakTable, obsNames, method="medFC")

# data deconvolution based on retention time and interfeature correlation
# calculation of weighted mean (see ?weigthed.mean) within each pseudospectral 
# cluster (i.e. the sum of mass spectral intensities across all samples are used 
#          to weight the contribution of each feature to the average).
wMeanPeakTable <- rtCorrClust(peakTable, obsNames, rtThresh=2, corrThresh=0.9, 
                              minFeat=1)

# extract weighted mean pseudospectra table
wMeanPspec <- wMeanPeakTable$wMeanPspec

# log transform (base 2)
wMeanPspec <- logTrans(wMeanPspec, obsNames, base=2)

2. PCA projection, automatic outlier removal and score plot cluster identification.

# add dummy blank to illustrate pca outlier detection
wMeanPspec$blank_1 <- 0.0001
# observation names (i.e. sample names)
obsNames <- colnames(wMeanPspec)[grep('ACN_|MeOH_|blank_', colnames(wMeanPspec))]

#  PCA projection using pca of pcaMethods and automatic outlier removal based
#  on proportional expansion of the Hotellings T2 ellipse
pcaOutResults <- pcaOutId(wMeanPspec, obsNames, cv="q2", outTol=1.05, 
                          scale="pareto")

# Plot PCA displaying any outliers and expanded Hotelling's ellipse, colour according
# to any potential outliers detected. function modified from pcaMethods ?plotPcs.
plotPcsEx(pcaOutResults$pcaResults[[1]]$pcaResult, 
          pcaOutResults$pcaResults[[1]]$exHotEllipse, type="scores", 
          col=pcaOutResults$pcaResults[[1]]$possOut+2)

# plot second PCA model iteration after outlier removal
plotPcsEx(pcaOutResults$pcaResults[[2]]$pcaResult, 
          pcaOutResults$pcaResults[[2]]$exHotEllipse, type="scores", 
          col=pcaOutResults$pcaResults[[2]]$possOut+2)

# show PCA results iteration 2
pcaOutResults$pcaResults[[2]]$pcaResult
# show Q2 cross-validation statistic
pcaOutResults$pcaResults[[2]]$pcaResult@cvstat

# label by extraction type using co-variates table
plotPcsEx(pcaOutResults$pcaResults[[2]]$pcaResult, 
          pcaOutResults$pcaResults[[2]]$exHotEllipse, type="scores", 
          col=as.numeric(as.factor(coVariates$extractionType)) + 2)

# Automatically identify potential cluster membership given the table of co-variates
finalPca <- pcaOutResults$pcaResults[[length(pcaOutResults$pcaResults)]]$pcaResult
clustIdentity <- pcaClustId(finalPca, coVarTable=coVariates)
# plot pam cluster model (partioning around the medoids), minimisation of 
# dissimilarities.
plot(cluster::pam(finalPca@scores, clustIdentity[[1]]$nc))

3. Univariate statistical analysis by co-variate based automatic test type selection.

The most appropriate univariate statistical method is selected based on frequency of factor levels of a co-variate (y-variable) supplied. This provides objective and automatable means of test selection. Multiple comparison adjustment can also be performed (e.g. Bonferroni).

# outliers removed peak table from pcaOutId output
outRemPeakTable <- pcaOutResults$outRem
obsNames <- colnames(outRemPeakTable)[grep('ACN_|MeOH_', colnames(outRemPeakTable))]

# automatic univariate statistical method selection and mean/median fold calculation
statResult <- coVarTypeStat(outRemPeakTable, obsNames, 
                            coVariate=coVariates$extractionType, 
                            Logged=T, base=2)
# volcanoPlot  
volcanoPlot(log2(statResult[[5]]$FoldChange), statResult[[5]]$p.value)

WMBEdmands/MetMSLine documentation built on May 9, 2019, 10:03 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

WMBEdmands/MetMSLine
an automated and fully integrated pipeline for rapid processing of high-resolution LC-MS metabolomic datasets.

In WMBEdmands/MetMSLine: an automated and fully integrated pipeline for rapid processing of high-resolution LC-MS metabolomic datasets.

1. Read in peak table and co-variate information and pre-process the data.

2. PCA projection, automatic outlier removal and score plot cluster identification.

3. Univariate statistical analysis by co-variate based automatic test type selection.

R Package Documentation

Browse R Packages

We want your feedback!

WMBEdmands/MetMSLine an automated and fully integrated pipeline for rapid processing of high-resolution LC-MS metabolomic datasets.

In WMBEdmands/MetMSLine: an automated and fully integrated pipeline for rapid processing of high-resolution LC-MS metabolomic datasets.

1. Read in peak table and co-variate information and pre-process the data.

2. PCA projection, automatic outlier removal and score plot cluster identification.

3. Univariate statistical analysis by co-variate based automatic test type selection.

R Package Documentation

Browse R Packages

We want your feedback!

WMBEdmands/MetMSLine
an automated and fully integrated pipeline for rapid processing of high-resolution LC-MS metabolomic datasets.