themis: Extra Recipes Steps for Dealing with Unbalanced Data

A dataset with an uneven number of cases in each class is said to be unbalanced. Many models produce a subpar performance on unbalanced datasets. A dataset can be balanced by increasing the number of minority cases using SMOTE 2011 <arXiv:1106.1813>, BorderlineSMOTE 2005 <doi:10.1007/11538059_91> and ADASYN 2008 <>. Or by decreasing the number of majority cases using NearMiss 2003 <> or Tomek link removal 1976 <>.

Package details

AuthorEmil Hvitfeldt [aut, cre] (<>)
MaintainerEmil Hvitfeldt <>
LicenseMIT + file LICENSE
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the themis package in your browser

Any scripts or data that you put into this service are public.

themis documentation built on June 13, 2021, 1:06 a.m.