generate_synthetic_data: Generate synthetic data for random forest
In talegari/forager: Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data

View source: R/generate_synthetic_data.R

Unsupervised learning of randomforest as suggested by Brieman (https://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm#unsup) involves creating synthetic data by sampling randomly from unvariate distributions of each covariate(feature). This supports two methods: First, where proportions or distribition is taken into account when sampling at random, second where the data is sampled assuming uniform distribution. The former corresponds to "Addcl1" from Horvath's paper (Unsupervised Learning With Random Forest Predictors: Tao Shi & Steve Horvath) and latter corresponds to "addc2".

1	generate_synthetic_data(dataset, prop, seed)

`dataset`	A dataframe
`prop`	Random sampling of covariates (when prop = TRUE) to generate synthetic data. Else, uniform sampling is used.
`seed`	Seed for sampling.

A dataframe with synthetic data.

talegari/forager documentation built on May 3, 2019, 4:01 p.m.

talegari/forager index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

talegari/forager
Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data

generate_synthetic_data: Generate synthetic data for random forest
In talegari/forager: Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data

Description

Usage

Arguments

Value

Related to generate_synthetic_data in talegari/forager...

R Package Documentation

Browse R Packages

We want your feedback!

talegari/forager Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data

generate_synthetic_data: Generate synthetic data for random forest In talegari/forager: Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data

Description

Usage

Arguments

Value

Related to generate_synthetic_data in talegari/forager...

R Package Documentation

Browse R Packages

We want your feedback!

talegari/forager
Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data

generate_synthetic_data: Generate synthetic data for random forest
In talegari/forager: Compute auxiliary information (proximity, dissimilarity, outlyingness, depth) and imputation from tree ensembles on new data