s-kganz/scutr: Balancing Multiclass Datasets for Classification Tasks

Imbalanced training datasets impede many popular classifiers. To balance training data, a combination of oversampling minority classes and undersampling majority classes is useful. This package implements the SCUT (SMOTE and Cluster-based Undersampling Technique) algorithm as described in Agrawal et. al. (2015) <doi:10.5220/0005595502260234>. Their paper uses model-based clustering and synthetic oversampling to balance multiclass training datasets, although other resampling methods are provided in this package.

Getting started

Package details

MaintainerKeenan Ganz <ganzkeenan1@gmail.com>
LicenseMIT + file LICENSE
Version0.2.0
URL https://github.com/s-kganz/scutr
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("s-kganz/scutr")
s-kganz/scutr documentation built on Nov. 23, 2023, 11:41 p.m.