regression_summary: Summary statistics for the regression data sets

Description Usage Format Source

Description

Summary statistics for the regression data sets

Usage

1

Format

A data frame with 13 variables:

did:

OpenML identifier (see https://www.openml.org/)

MajorityClassSize:

Number of instances in majority class of target variable (always -1)

MaxNominalAttDistinctValues:

Maximum number of distinct values among attributes of the nominal type

MinorityClassSize:

Number of instances in minority class of target variable (always -1)

NumberOfClasses:

Number of classes in target variable

NumberOfFeatures:

Total number of features (equal to number of columns)

NumberOfInstances:

Number of data observations (equal to number of rows)

NumberOfInstancesWithMissingValues:

Number of instances with missing values (always 0)

NumberOfMissingValues:

Number of missing values (always 0)

NumberOfNumericFeatures:

Number of numeric features

NumberOfSymbolicFeatures:

Number of symbolic features

name:

Dataset name

status:

All datasets are currently categorised as 'active'

For further details, see https://www.openml.org/

Source

https://github.com/EpistasisLab/penn-ml-benchmarks


makeyourownmaker/pmlblite documentation built on Feb. 13, 2020, 11:46 p.m.