inst/help/mlClusteringRandomForest.md

Random Forest Clustering

Random Forest clustering is a hard partitioning algorithm which aims to partition data into several clusters, where each observation belongs to only one group. This clustering method uses the Random Forest algorithm in an unsupervised way, with the outcome variable 'y' set to NULL. The Random Forest algorithm generates a proximity matrix which gives an estimate of the distance between observations based on the frequency of observations ending up in the same leaf node.

Assumptions

Input

Assignment Box

Tables

Plots

Training Parameters

Algorithmic Settings

Cluster Determination

Add Predicted Clusters to Data

Generates a new column in your dataset with the cluster labels of your cluster result. This gives you the option to inspect, classify, or predict the generated cluster labels.

Output

Random Forest Clustering Model Table

Random Forest Cluster Information

Evaluation Metrics Table

References

R-packages

Example



jasp-stats/jaspMachineLearning documentation built on April 5, 2025, 3:52 p.m.