Home

/

GitHub

/

victorhb/ImbCoL

/

neighborhood: Measures of neighborhood

neighborhood: Measures of neighborhood
In victorhb/ImbCoL: Data Complexity Measures for Imbalanced Classification Tasks

Description Usage Arguments Details Value See Also Examples

View source: R/neighborhood.R

Classification task. The Neighborhood measures analyze the neighborhoods of the data items and try to capture class overlapping and the shape of the decision boundary. They work over a distance matrix storing the distances between all pairs of data points in the dataset.

neighborhood(...)

## Default S3 method:
neighborhood(x, y, measures = "all", ...)

## S3 method for class 'formula'
neighborhood(formula, data, measures = "all", ...)

`...`	Not used.
`x`	A data.frame contained only the input attributes.
`y`	A factor response vector with one label for each row/component of x.
`measures`	A list of measures names or `"all"` to include all them.
`formula`	A formula to define the class column.
`data`	A data.frame dataset contained the input attributes and class.

The following measures are allowed for this method:

"N1": Fraction of borderline points (N1) computes the percentage of vertexes incident to edges connecting examples of opposite classes in a Minimum Spanning Tree (MST). The default package to build the MST is igraph. If you are handling data with duplicated instances, we suggest using ape.
"N2": Ratio of intra/extra class nearest neighbor distance (N2) computes the ratio of two sums: intra-class and inter-class. The former corresponds to the sum of the distances between each example and its closest neighbor from the same class. The later is the sum of the distances between each example and its closest neighbor from another class (nearest enemy).
"N3": Error rate of the nearest neighbor (N3) classifier corresponds to the error rate of a one Nearest Neighbor (1NN) classifier, estimated using a leave-one-out procedure in dataset.
"N4": Non-linearity of the nearest neighbor classifier (N4) creates a new dataset randomly interpolating pairs of training examples of the same class and then induce a the 1NN classifier on the original data and measure the error rate in the new data points.
"T1": Fraction of hyperspheres covering data (T1) builds hyperspheres centered at each one of the training examples, which have their radios growth until the hypersphere reaches an example of another class. Afterwards, smaller hyperspheres contained in larger hyperspheres are eliminated. T1 is finally defined as the ratio between the number of the remaining hyperspheres and the total number of examples in the dataset.
"LSC": Local Set Average Cardinality (LSC) is based on Local Set (LS) and defined as the set of points from the dataset whose distance of each example is smaller than the distance from the exemples of the different class. LSC is the average of the LS.
"X_partial": X in [N1,N2,N3,N4,T1]. It is the decomposed version of the correponding X function. Instead of giving a single complexity value for the dataset, it returns one complexity value per class.

A list named by the requested neighborhood measure.

Other complexity-measures: linearity.class, overlapping

1
2
3

## Extract all neighborhood measures
data(iris)
neighborhood(Species ~ ., iris)

victorhb/ImbCoL documentation built on May 20, 2021, 12:18 p.m.

victorhb/ImbCoL index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

victorhb/ImbCoL
Data Complexity Measures for Imbalanced Classification Tasks

neighborhood: Measures of neighborhood
In victorhb/ImbCoL: Data Complexity Measures for Imbalanced Classification Tasks

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to neighborhood in victorhb/ImbCoL...

R Package Documentation

Browse R Packages

We want your feedback!

victorhb/ImbCoL Data Complexity Measures for Imbalanced Classification Tasks

neighborhood: Measures of neighborhood In victorhb/ImbCoL: Data Complexity Measures for Imbalanced Classification Tasks

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to neighborhood in victorhb/ImbCoL...

R Package Documentation

Browse R Packages

We want your feedback!

victorhb/ImbCoL
Data Complexity Measures for Imbalanced Classification Tasks

neighborhood: Measures of neighborhood
In victorhb/ImbCoL: Data Complexity Measures for Imbalanced Classification Tasks