nguforche/UnsupRF: Unsupervised Random Forest Clustering

Cluster data using randomForest proximities. A Random forest classifier is trained to predict the data labeled as class "True.Data" and a synthetic data labeled as class "Synthetic.Data". The synthetic data is generated by random sampling from the emperical distribution of the true data or by permuting true data. The proximities between observations in the true data is converted to a dissimilarity matrix and can be used by any clustering algorithm that accepts a dissimilarity matrix. Several routines for cluster validation and determination of optimal number of clusters are also implemented.

Getting started

Package details

AuthorChe Ngufor
MaintainerChe Ngufor <Ngufor.Che@mayo.edu>
LicenseGPL(>= 2)
Version1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("nguforche/UnsupRF")
nguforche/UnsupRF documentation built on May 5, 2019, 4:51 p.m.