scutr: Balancing Multiclass Datasets for Classification Tasks

Imbalanced training datasets impede many popular classifiers. To balance training data, a combination of oversampling minority classes and undersampling majority classes is useful. This package implements the SCUT (SMOTE and Cluster-based Undersampling Technique) algorithm as described in Agrawal et. al. (2015) <doi:10.5220/0005595502260234>. Their paper uses model-based clustering and synthetic oversampling to balance multiclass training datasets, although other resampling methods are provided in this package.

Getting started

Package details

AuthorKeenan Ganz [aut, cre]
MaintainerKeenan Ganz <ganzkeenan1@gmail.com>
LicenseMIT + file LICENSE
Version0.2.0
URL https://github.com/s-kganz/scutr
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("scutr")

Try the scutr package in your browser

Any scripts or data that you put into this service are public.

scutr documentation built on Nov. 18, 2023, 1:08 a.m.