Description Details Notes Cite as Author(s)
The utiml package is a framework for the application of classification algorithms to multi-label data. Like the well known MULAN used with Weka, it provides a set of multi-label procedures such as sampling methods, transformation strategies, threshold functions, pre-processing techniques and evaluation metrics. The package was designed to allow users to easily perform complete multi-label classification experiments in the R environment.
Currently, the main methods supported are:
Classification methods:
ML Baselines,
Binary Relevance (BR),
BR+,
Classifier Chains,
Calibrated Label Ranking (CLR),
Dependent Binary Relevance (DBR),
Ensemble of Binary Relevance (EBR),
Ensemble of Classifier Chains (ECC),
Ensemble of Pruned Set (EPS),
Hierarchy Of Multilabel classifiER (HOMER),
Label specIfic FeaTures (LIFT),
Label Powerset (LP),
Meta-Binary Relevance (MBR or 2BR),
Multi-label KNN (ML-KNN),
Nested Stacking (NS),
Pruned Problem Transformation (PPT),
Pruned and Confident Stacking Approach (Prudent),
Pruned Set (PS),
Random k-labelsets (RAkEL),
Recursive Dependent Binary Relevance (RDBR),
Ranking by Pairwise Comparison (RPC)
Evaluation methods:
Performing a cross-validation procedure,
Confusion Matrix,
Evaluate,
Supported measures
Pre-process utilities:
Fill sparse data,
Normalize data,
Remove attributes,
Remove labels,
Remove skewness labels,
Remove unique attributes,
Remove unlabeled instances,
Replace nominal attributes
Sampling methods:
Create holdout partitions,
Create k-fold partitions,
Create random subset,
Create subset,
Partition fold
Threshold methods:
Fixed threshold,
Cardinality threshold,
MCUT,
PCUT,
RCUT,
SCUT,
Subset correction
However, there are other utilities methods not previously cited as
as.bipartition, as.mlresult,
as.ranking, multilabel_prediction, etc. More
details and examples are available on
utiml repository.
We use the mldr package, to manipulate multi-label data.
See its documentation to more information about handle multi-label dataset.
1 2 3 4 5 6 7 8 9 10 11 | @article{RJ-2018-041,
author = {Adriano Rivolli and Andre C. P. L. F. de Carvalho},
title = {{The utiml Package: Multi-label Classification in R}},
year = {2018},
journal = {{The R Journal}},
doi = {10.32614/RJ-2018-041},
url = {https://doi.org/10.32614/RJ-2018-041},
pages = {24--37},
volume = {10},
number = {2}
}
|
Adriano Rivolli <rivolli@utfpr.edu.br>
This package is a result of my PhD at Institute of Mathematics and Computer Sciences (ICMC) at the University of Sao Paulo, Brazil.
PhD advisor: Andre C. P. L. F. de Carvalho
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.