Description Details Notes Cite as Author(s)
The utiml package is a framework for the application of classification algorithms to multi-label data. Like the well known MULAN used with Weka, it provides a set of multi-label procedures such as sampling methods, transformation strategies, threshold functions, pre-processing techniques and evaluation metrics. The package was designed to allow users to easily perform complete multi-label classification experiments in the R environment.
Currently, the main methods supported are:
Classification methods:
ML Baselines
,
Binary Relevance (BR)
,
BR+
,
Classifier Chains
,
Calibrated Label Ranking (CLR)
,
Dependent Binary Relevance (DBR)
,
Ensemble of Binary Relevance (EBR)
,
Ensemble of Classifier Chains (ECC)
,
Ensemble of Pruned Set (EPS)
,
Hierarchy Of Multilabel classifiER (HOMER)
,
Label specIfic FeaTures (LIFT)
,
Label Powerset (LP)
,
Meta-Binary Relevance (MBR or 2BR)
,
Multi-label KNN (ML-KNN)
,
Nested Stacking (NS)
,
Pruned Problem Transformation (PPT)
,
Pruned and Confident Stacking Approach (Prudent)
,
Pruned Set (PS)
,
Random k-labelsets (RAkEL)
,
Recursive Dependent Binary Relevance (RDBR)
,
Ranking by Pairwise Comparison (RPC)
Evaluation methods:
Performing a cross-validation procedure
,
Confusion Matrix
,
Evaluate
,
Supported measures
Pre-process utilities:
Fill sparse data
,
Normalize data
,
Remove attributes
,
Remove labels
,
Remove skewness labels
,
Remove unique attributes
,
Remove unlabeled instances
,
Replace nominal attributes
Sampling methods:
Create holdout partitions
,
Create k-fold partitions
,
Create random subset
,
Create subset
,
Partition fold
Threshold methods:
Fixed threshold
,
Cardinality threshold
,
MCUT
,
PCUT
,
RCUT
,
SCUT
,
Subset correction
However, there are other utilities methods not previously cited as
as.bipartition
, as.mlresult
,
as.ranking
, multilabel_prediction
, etc. More
details and examples are available on
utiml repository.
We use the mldr
package, to manipulate multi-label data.
See its documentation to more information about handle multi-label dataset.
1 2 3 4 5 6 7 8 9 10 11 | @article{RJ-2018-041,
author = {Adriano Rivolli and Andre C. P. L. F. de Carvalho},
title = {{The utiml Package: Multi-label Classification in R}},
year = {2018},
journal = {{The R Journal}},
doi = {10.32614/RJ-2018-041},
url = {https://doi.org/10.32614/RJ-2018-041},
pages = {24--37},
volume = {10},
number = {2}
}
|
Adriano Rivolli <rivolli@utfpr.edu.br>
This package is a result of my PhD at Institute of Mathematics and Computer Sciences (ICMC) at the University of Sao Paulo, Brazil.
PhD advisor: Andre C. P. L. F. de Carvalho
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.