semiArtificial: Generator of Semi-Artificial Data

Package semiArtificial contains methods to generate and evaluate semi-artificial data sets. Based on a given data set different methods learn data properties using machine learning algorithms and generate new data with the same properties. The package currently includes the following data generator: -a RBF network based generator using rbfDDA from RSNNS package, -a Random Forest based generator for both classification and regression problems -a density forest based generator for unsupervised data Data evaluation support tools include: -single attribute based statistical evaluation: mean, median, standard deviation, skewness, kurtosis, medcouple, L/RMC, KS test, Hellinger distance -evaluation based on clustering using Adjusted Rand Index (ARI) and FM -evaluation based on classification performance with various learning models, eg, random forests.

AuthorMarko Robnik-Sikonja
Date of publication2015-09-04 01:11:01
MaintainerMarko Robnik-Sikonja <marko.robnik@fri.uni-lj.si>
LicenseGPL-3
Version2.0.1
http://lkm.fri.uni-lj.si/rmarko/software/

View on CRAN

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.