Random forest based clustering
Creates a clustering of random forest training instances. Random forest provides proximity of its training instances based on their out-of-bag classification. This information is usually passed to visualizations (e.g., scaling) and attribute importance measures.
a random forest model returned by
number of clusters
The method calls
pam function for clustering, initializing its distance matrix with random forest based similarity by calling
rfProximity with argument
An object of class
pam representing the clustering (see
?pam.object for details),
the most important being a vector of cluster assignments (named
cluster) to training instances used to generate the
John Adeyanju Alao (as a part of his BSc thesis) and Marko Robnik-Sikonja (thesis supervisor)
Leo Breiman: Random Forests. Machine Learning Journal, 45:5-32, 2001
1 2 3 4 5
Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.