README.md

dcme

Overview

The dcme package provides functions to compute data complexity measures.

Installation

dcme is under development and not yet available on CRAN. You can install the development version using the devtools package as follows:

# install.packages("devtools")
devtools::install_github("RomeroBarata/dcme")

Data Complexity Measures

The following complexity measures are currently implemented:

Simple Measures

num_examples_majority, num_examples_minority, proportion_examples_majority, proportion_examples_minority, and IR are defined only for binary data sets.

Statistical Measures

Measures of Overlap of Individual Feature Values

Unfortunately the F1 and F2 measures are implemented only for binary data sets. General versions will be made available soon.

Measures of Separability of Classes

Measures of Geometry, Topology, and Density of Manifolds

References

Definitions and explanations of most functions implemented in the dcme package can be found in the following literature:

[1] Michie, D., Spiegelhalter, D. J., & Taylor, C. C. (1994). Machine learning, neural and statistical classification.

[2] Ho, T. K., & Basu, M. (2002). Complexity measures of supervised classification problems. IEEE transactions on pattern analysis and machine intelligence, 24(3), 289-300.



RomeroBarata/dcme documentation built on May 9, 2019, 2:24 p.m.