Description Usage Arguments Details Value References See Also Examples
The function finds the points in the dataset that are tomek link using 1-NN and then removes only majority class instances that are tomek links.
1 |
X |
the input variables of the unbalanced dataset. |
Y |
the response variable of the unbalanced dataset. It must be a binary factor where the majority class is coded as 0 and the minority as 1. |
verbose |
print extra information (TRUE/FALSE) |
In order to compute nearest neighbors, only numeric features are allowed.
The function returns a list:
X |
input variables |
Y |
response variable |
id.rm |
index of instances removed |
I. Tomek. Two modifications of cnn. IEEE Trans. Syst. Man Cybern., 6:769-772, 1976.
1 2 3 4 5 6 7 8 | library(unbalanced)
data(ubIonosphere)
n<-ncol(ubIonosphere)
output<-ubIonosphere$Class
input<-ubIonosphere[ ,-n]
data<-ubTomek(X=input, Y= output)
newData<-cbind(data$X, data$Y)
|
Loading required package: mlr
Loading required package: ParamHelpers
Loading required package: foreach
Loading required package: doParallel
Loading required package: iterators
Loading required package: parallel
Instances removed 32 : 14.22 % of 0 class ; 9.12 % of training ; Time needed 0
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.