RFcluster | R Documentation |
This function adapts random forests to work (albeit clumsily and inefficiently) with clustered categorical outcome data. For example, there may be multiple observations on individuals (clusters). Predictions are made fof the OOB (out of bag) clusters
RFcluster(formula, id, data, nfold = 15,
ntree=500, progress=TRUE, printit = TRUE, seed = 29)
formula |
Model formula |
id |
numeric, identifies clusters |
data |
data frame that supplies the data |
nfold |
numeric, number of folds |
ntree |
numeric, number of trees (number of bootstrap samples) |
progress |
Print information on progress of calculations |
printit |
Print summary information on accuracy |
seed |
Set seed, if required, so that results are exactly reproducible |
Bootstrap samples are taken of observations in the in-bag clusters. Predictions are made for all observations in the OOB clusters.
class |
Predicted values from cross-validation |
OOBaccuracy |
Cross-validation estimate of accuracy |
confusion |
Confusion matrix |
John Maindonald
https://maths-people.anu.edu.au/~johnm/nzsr/taws.html
## Not run:
library(mlbench)
library(randomForest)
data(Vowel)
RFcluster(formula=Class ~., id = V1, data = Vowel, nfold = 15,
ntree=500, progress=TRUE, printit = TRUE, seed = 29)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.