RWO | R Documentation |
Resampling with RWO
RWO(x, y)
x |
feature matrix. |
y |
a factor class variable with two classes. |
RWO (Zhang and Li, 2014) is an oversampling method which generates data using variable standard error in a way that it preserves the variances of all variables.
Can work with classes more than 2.
a list with resampled dataset.
x_new |
Resampled feature matrix. |
y_new |
Resampled target variable. |
x_syn |
Generated synthetic feature data. |
y_syn |
Generated synthetic label data. |
Fatih Saglam, saglamf89@gmail.com
Zhang, H., & Li, M. (2014). RWO-Sampling: A random walk over-sampling approach to imbalanced data classification. Information Fusion, 20, 99-116.
set.seed(1)
x <- rbind(matrix(rnorm(2000, 3, 1), ncol = 2, nrow = 1000),
matrix(rnorm(100, 5, 1), ncol = 2, nrow = 50))
y <- as.factor(c(rep("negative", 1000), rep("positive", 50)))
plot(x, col = y)
# resampling
m <- RWO(x = x, y = y)
plot(m$x_new, col = m$y_new)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.