mzrf: Random Forests Analysis of LCMS data.
In brgordon17/coralclass: Metabolomics Classification of Stressed Coral From Heron Island 2011

Description Usage Arguments Details Value Note Author(s) See Also

mzrf() was used to perform the random forests analysis of the LCMS data (./data/mzdata.rda)

1
2
3

mzrf(parallel = TRUE, save.model = FALSE, view.plot = TRUE,
  save.plot = FALSE, plot.name = "mzrf_cv_plot",
  model.name = "mzrf_model", seed = 1978, pred.results = TRUE, ...)

`parallel`	Logical indicating if parallel processing should be used.
`save.model`	Logical indicating if the model should be saved.
`view.plot`	Logical indicating if a plot of accuracy vs mtry should be printed to the plot viewer.
`save.plot`	Logical indicating if plot should be saved to a `.pdf` in the `./figs` directory.
`plot.name`	Name of plot if `save.plot = TRUE`.
`model.name`	Name of model if `save.model = TRUE`.
`seed`	An integer for setting the RNG state.
`pred.results`	Logical indicating if the results of predicting the test data should be printed to the console.
`...`	Other arguments passed on to individual methods.

mzrf() loads mzdata and performs a RF analysis of the data using mzdata$class as outcomes. The process is outlined as follows:

The data is split into training and test sets using an 80:20 stratified split according to class and day mzdata$class_day.
A list of random seeds is produced for each iteration of the CV process. For the 10-fold, repeated (3 times) CV used here, we require 10 * 3 seeds for each mtry value assesed (tune grid length).
Define a tuning grid of mtry values. In this case we assess mtry values c(25, 75, 100, seq(from = 100, to = 500, by = 50)) giving a tunegrid length of 12.
Define the CV parameters. 10 folds, 3 repeats, default summary. We also define the method for selecting the best tune. In this case, the best tune is the simplest model within one standard error of the empirically optimal model. This rule, as described by Breiman et al. (1984), may avoid overfitting the model. Note that k-fold CV as performed using trainControl(method = "repeatedcv") stratifies sampling according to class.
The data is centred by subtracting the mean of the predictor's data from the predictor values
The data is scaled by dividing the predictor's by the standard deviation.
The model is run.

returns a list with class train.

Although this function is exported, mzrf() was not intended to be used outside of this package.

Benjamin R. Gordon

train ggplot The caret Package by Max Kuhn (2017)

brgordon17/coralclass documentation built on June 15, 2020, 9:21 p.m.

brgordon17/coralclass index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

brgordon17/coralclass
Metabolomics Classification of Stressed Coral From Heron Island 2011

mzrf: Random Forests Analysis of LCMS data.
In brgordon17/coralclass: Metabolomics Classification of Stressed Coral From Heron Island 2011

Description

Usage

Arguments

Details

Value

Note

Author(s)

See Also

Related to mzrf in brgordon17/coralclass...

R Package Documentation

Browse R Packages

We want your feedback!

brgordon17/coralclass Metabolomics Classification of Stressed Coral From Heron Island 2011

mzrf: Random Forests Analysis of LCMS data. In brgordon17/coralclass: Metabolomics Classification of Stressed Coral From Heron Island 2011

Description

Usage

Arguments

Details

Value

Note

Author(s)

See Also

Related to mzrf in brgordon17/coralclass...

R Package Documentation

Browse R Packages

We want your feedback!

brgordon17/coralclass
Metabolomics Classification of Stressed Coral From Heron Island 2011

mzrf: Random Forests Analysis of LCMS data.
In brgordon17/coralclass: Metabolomics Classification of Stressed Coral From Heron Island 2011