Description Usage Arguments Details Value References See Also Examples
One Side Selection is an undersampling method resulting from the application of Tomek links followed by the application of Condensed Nearest Neighbor.
1 |
X |
the input variables of the unbalanced dataset. |
Y |
the response variable of the unbalanced dataset. It must be a binary factor where the majority class is coded as 0 and the minority as 1. |
verbose |
print extra information (TRUE/FALSE) |
In order to compute nearest neighbors, only numeric features are allowed.
The function returns a list:
X |
input variables |
Y |
response variable |
M. Kubat, S. Matwin, et al. Addressing the curse of imbalanced training sets: one-sided selection. In MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE-, pages 179-186. MORGAN KAUFMANN PUBLISHERS, INC., 1997.
1 2 3 4 5 6 7 8 | library(unbalanced)
data(ubIonosphere)
n<-ncol(ubIonosphere)
output<-ubIonosphere$Class
input<-ubIonosphere[ ,-n]
data<-ubOSS(X=input, Y= output)
newData<-cbind(data$X, data$Y)
|
Loading required package: mlr
Loading required package: ParamHelpers
Loading required package: foreach
Loading required package: doParallel
Loading required package: iterators
Loading required package: parallel
Instances removed 31 : 14.62 % of 0 class ; 9.17 % of training ; Time needed 0
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.