by: David Hughes and Laura Corbin date: June 3rd 2019
Dec 22, 2022
To install do the following
quick install
install the metaboprep package with
R
devtools::install_github("MRCIEU/metaboprep")
from this repo download a copy of the following files
parameter_file.txt
You can also download or clone the entire repo with
R
git clone https://github.com/MRCIEU/metaboprep.git
alternatively you can download the package manually
install R package with:
R
devtools::install("metaboprep")
A common installation error is produced by installation errors of dependent packages. If you experience this, install those dependent packages manually with BiocManager, and then attempt the installation of metaboprep again. You might have to repeat this step more than once.
R
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("MISSINGPACKAGENAME")
Run the metaboprep pipeline on a terminal command line as follows:
R
Rscript run_metaboprep_pipeline.R /FULL/PATH/TO/example_data/excel/parameter_file.txt
If you experienced issues with the geneartion of the html report on an HPC, the report can be generated on a local machine as follows.
produce report with the function metaboprep::generate_report() as
```R output_dir_path = paste0("FULL/PATH/TO/DIR/OF/CHOICE/") rdfile = paste0(output_dir_path, "ReportData.Rdata") generate_report( full_path_2_Rdatafile = rdfile, dir_4_report = output_dir_path )
```
An example data set can be found in the folder "example_data" here on the repository. It is a simulated data set of 100 metabolites, for 100 samples. There is a (1) metabolon like (v1 format) excel file of the data set, and a (2) flat text (tab delim) version of the data set. Both are accompanied by a parameter file to help guide you with the example. This example data includes data from two hypothetical mass spectrometry run modes or platforms, "neg" and "pos". As such, a subset of the metabolites were run (simulated) in the "neg" run mode in two batches, and the second subset of metabolites were run (simulated) in the "pos" run mode in three batches. Each batch was simulated with different mean abundance values to help illustrate possible batch effects and the normalization procedure. If looking at the flat text version of the example, information on which metabolites were simulated in which run mode or platform can be found in the flat text file "feature_data.txt" in the column "platform". Further, to identify which samples belonged to which batch, for each run mode, you would use the columns "neg" and "pos" in the flat text file "sample_data.txt".
NOTE: Derived variable are those that are ratios or percentanges of two or more features already present in a data set, such as those found in Nightingale data.
---example figures provided for illustration---
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.