fastcforest | R Documentation |
Parallelized version of cforest
function from party
package, which is an implementation of the random forest and bagging ensemble algorithms utilizing conditional inference trees as base learners.
fastcforest(formula, data = list(), subset = NULL, weights = NULL,
controls = party::cforest_unbiased(),
xtrafo = ptrafo, ytrafo = ptrafo, scores = NULL,
parallel = TRUE)
formula |
a symbolic description of the model to be fit. Note that symbols like |
data |
a data frame containing the variables in the model |
subset |
an optional vector specifying a subset of observations to be used in the fitting process |
weights |
an optional vector of weights to be used in the fitting process. Non-negative integer valued weights are allowed as well as non-negative real weights. Observations are sampled (with or without replacement) according to probabilities |
controls |
an object of class |
xtrafo |
a function to be applied to all input variables. By default, the |
ytrafo |
a function to be applied to all response variables. By default, the |
scores |
an optional named list of scores to be attached to ordered factors |
parallel |
Logical indicating whether or not to run |
See cforest
documentation for details.
The code for parallelization is inspired by https://stackoverflow.com/questions/36272816/train-a-cforest-in-parallel
An object of class RandomForest-class
.
Nicolas Robette
Leo Breiman (2001). Random Forests. Machine Learning, 45(1), 5–32.
Torsten Hothorn, Berthold Lausen, Axel Benner and Martin Radespiel-Troeger (2004). Bagging Survival Trees. Statistics in Medicine, 23(1), 77–91.
Torsten Hothorn, Peter Buhlmann, Sandrine Dudoit, Annette Molinaro and Mark J. van der Laan (2006a). Survival Ensembles. Biostatistics, 7(3), 355–373.
Torsten Hothorn, Kurt Hornik and Achim Zeileis (2006b). Unbiased Recursive Partitioning: A Conditional Inference Framework. Journal of Computational and Graphical Statistics, 15(3), 651–674. Preprint available from https://www.zeileis.org/papers/Hothorn+Hornik+Zeileis-2006.pdf
Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis and Torsten Hothorn (2007). Bias in Random Forest Variable Importance Measures: Illustrations, Sources and a Solution. BMC Bioinformatics, 8, 25. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-8-25
Carolin Strobl, James Malley and Gerhard Tutz (2009). An Introduction to Recursive Partitioning: Rationale, Application, and Characteristics of Classification and Regression Trees, Bagging, and Random forests. Psychological Methods, 14(4), 323–348.
cforest
, fastvarImp
## classification
data(iris)
iris2 = iris
iris2$Species = factor(iris$Species=="versicolor")
iris.cf = fastcforest(Species~., data=iris2, parallel=FALSE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.