w4mclstrpeakpics
packagew4mclstrpeakpics::pool_peak_assessment
produces a figure comprising four
graphs to assess the similarities and differences among peaks in a cluster
of samples using XCMS-preprocessed data files as input.
The figures are described in sections below.
The purpose of the w4mclstrpeakpics R package is to provide the computational back-end of a Galaxy tool for inclusion in Workflow4Metabolomics (W4M).
Galaxy tools are file-oriented; because of this, the
w4mclstrpeakpics::pool_peak_assessment
method reads from and writes to
files. General-purpose R packages usually use data structures in memory for
their input and output, which may mean that this R package is not generally
useful outside of the context of Galaxy.
pool_peak_assessment
functionA Galaxy tool wrapper invokes pool_peak_assessment
. For exploratory or
debugging purposes, the package may be installed loaded into R and help may
then be obtained with the following command:
?w4mclstrpeakpics::pool_peak_assessment
W4M uses the XCMS and CAMERA packages to preprocess GC-MS or LC-MS data,
producing three files (which are documented in detail on the
Workflow4Metabolomics (W4M) web site).
In summary:
sampleMetadata.tsv
: a tab-separated file with metadata for the samples,
one line per sample
One column of this file indicates the class of the sample.
It is the class that is used by this function to determine whether to include the sample in, or exclude the sample from, further analysis.
variableMetadata.tsv
: a tab-separated file with metadata for the features
detected, one line per feature
A feature is a location in the two dimensional space defined by the GC-MS or LC-MS data set, which corresponds to a compound or a group of compounds.
m/z
.The other dimension is the retention time
, i.e., how long until the
solvent gradient eluted the compound(s) from the column.
dataMatrix.tsv
: a tab separated file with the MS intensities for each
sample for each feature:
There is one column per sample.
NA
.The pool_peak_assessment
function reads these files and produces four
graphs. Inputs arguments are as follows:
sample_selector_column_name
- string input: column of W4M/XCMS sampleMetadata holding selector string values (default: "sampleType").
sample_selector_value
- string input: value within selector column to identify samples for analysis (default: "pool").
sample_metadata_path
- string input: path to W4M/XCMS sampleMetadata tab-separated values file.
variable_metadata_path
- string input: path to W4M/XCMS variableMetadata tab-separated values file.
data_matrix_path
- string input: path to W4M/XCMS dataMatrix tab-separated values file.
output_pdf
- string output: path to write assessment figure PDF.
output_tsv
- string output: path to write assessment summary tab-separated values file.
output_rdata
- string output: (optional) path to write RData containing all processing and plotting intermediates.
Feature Number and Likelihood
graphThe upper left graph in the output figure shows the following:
Ideally, there would be an upward trend from left to right; if not, XCMS peak-picking parameters may need to be adjusted to suppress low-intensity "noise" peaks or to address peak-splitting.
Peak Intensity
graphThe lower left graph in the output figure presents the data in the upper figure without summarization, so that "the eye" can do the interpretation. It shows the following:
Ideally, there would be an upward trend from left to right, with more points on the right; if not, XCMS peak-picking parameters may need to be adjusted to suppress low-intensity "noise" peaks or to address peak-splitting.
Symbol area/intensity reflect ion intensity
graphThe upper right graph in the output figure shows the following:
Consequently, if the graph has a lot of large, dark, blueish symbols for repeated runs of a pooled sample, there is strong evidence that the XCMS peak-picking parameters need adjustment to make peak-picking more consistent.
Symbol size/shape reflects prevalence
graphBy contrast with the area/intensity graph, lower right graph's primary purpose is to communicate prevalence of a feature among the samples. It`shows the following:
Consequently, if the graph has a lot of small, vivid symbols for repeated runs of a pooled sample, there is strong evidence that the XCMS peak-picking parameters need adjustment to make peak-picking more consistent.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.