Simulated Metabolomics Data
This is a simulated dataset to show the format of the metabolomics data; patterns of missing data are generated roughly from a real metabolomics experiment. Rows represent metabolites and columns represent samples. The file contains 100 metabolites (rows) and 505 samples (480 biological sample columns and 25 pooled plasma columns) sorted by injection order. There are 20 biological samples between pooled plasma runs. Pooled plasma columns have prefix ‘PPP’ and biological samples are simple integers with no prefix.
The first row (Date) contains the date of processing. The second row (Inject) contains the injection number and is ordered from 1 to 505. The third row contains the column headers:
Metab is the metabolite ID.
Meth is the type of metabolite.
HMDB is the HMDB ID of the metabolite, if it exists.
m/z is the mass-to-charge ratio of the metabolite.
rt is the retention time.
Com contains any comments.
ProcID is the processing ID of the metabolite.
The remaining columns are either pooled plasma samples (prefix: ‘PPP’) or biological samples (prefix: No prefix). The basic structure of the csv file is as follows:
read.met for example of reading this csv file for use.
MetProc-package for examples of running the full process.
Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.