utiml: utiml: Utilities for Multi-Label Learning
In utiml: Utilities for Multi-Label Learning

Description Details Notes Cite as Author(s)

The utiml package is a framework for the application of classification algorithms to multi-label data. Like the well known MULAN used with Weka, it provides a set of multi-label procedures such as sampling methods, transformation strategies, threshold functions, pre-processing techniques and evaluation metrics. The package was designed to allow users to easily perform complete multi-label classification experiments in the R environment.

Currently, the main methods supported are:

Classification methods: ML Baselines, Binary Relevance (BR), BR+, Classifier Chains, Calibrated Label Ranking (CLR), Dependent Binary Relevance (DBR), Ensemble of Binary Relevance (EBR), Ensemble of Classifier Chains (ECC), Ensemble of Pruned Set (EPS), Hierarchy Of Multilabel classifiER (HOMER), Label specIfic FeaTures (LIFT), Label Powerset (LP), Meta-Binary Relevance (MBR or 2BR), Multi-label KNN (ML-KNN), Nested Stacking (NS), Pruned Problem Transformation (PPT), Pruned and Confident Stacking Approach (Prudent), Pruned Set (PS), Random k-labelsets (RAkEL), Recursive Dependent Binary Relevance (RDBR), Ranking by Pairwise Comparison (RPC)
Evaluation methods: Performing a cross-validation procedure, Confusion Matrix, Evaluate, Supported measures
Pre-process utilities: Fill sparse data, Normalize data, Remove attributes, Remove labels, Remove skewness labels, Remove unique attributes, Remove unlabeled instances, Replace nominal attributes
Sampling methods: Create holdout partitions, Create k-fold partitions, Create random subset, Create subset, Partition fold
Threshold methods: Fixed threshold, Cardinality threshold, MCUT, PCUT, RCUT, SCUT, Subset correction

However, there are other utilities methods not previously cited as as.bipartition, as.mlresult, as.ranking, multilabel_prediction, etc. More details and examples are available on utiml repository.

We use the mldr package, to manipulate multi-label data. See its documentation to more information about handle multi-label dataset.

  @article{RJ-2018-041,
     author = {Adriano Rivolli and Andre C. P. L. F. de Carvalho},
     title = {{The utiml Package: Multi-label Classification in R}},
     year = {2018},
     journal = {{The R Journal}},
     doi = {10.32614/RJ-2018-041},
     url = {https://doi.org/10.32614/RJ-2018-041},
     pages = {24--37},
     volume = {10},
     number = {2}
 }