evalmod: Evaluate models and calculate performance evaluation measures
In precrec: Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves

evalmod

R Documentation

Evaluate models and calculate performance evaluation measures

Description

The evalmod function calculates ROC and Precision-Recall curves for specified prediction scores and binary labels. It also calculate several basic performance evaluation measures, such as accuracy, error rate, and precision, by specifying mode as "basic".

Usage

evalmod(
  mdat,
  mode = NULL,
  scores = NULL,
  labels = NULL,
  modnames = NULL,
  dsids = NULL,
  posclass = NULL,
  na_worst = TRUE,
  ties_method = "equiv",
  calc_avg = TRUE,
  cb_alpha = 0.05,
  raw_curves = FALSE,
  x_bins = 1000,
  interpolate = TRUE,
  ...
)

Arguments

`mdat`	An `S3` object created by the `mmdata` function. It contains formatted scores and labels. The `evalmod` function ignores the following arguments when `mdat` is specified. `scores` `labels` `modnames` `dsids` `posclass` `na_worst` `ties_method` These arguments are internally passed to the `mmdata` function when `mdat` is unspecified. In that case, both `scores` and `labels` must be at least specified.
`mode`	A string that specifies the types of evaluation measures that the `evalmod` function calculates. "rocprc" ROC and Precision-Recall curves "prcroc" Same as above "basic" Normalized ranks vs. accuracy, error rate, specificity, sensitivity, precision, Matthews correlation coefficient, and F-score. "aucroc" Fast AUC(ROC) calculation with the U statistic
`scores`	A numeric dataset of predicted scores. It can be a vector, a matrix, an array, a data frame, or a list. The `join_scores` function can be useful to make scores with multiple datasets.
`labels`	A numeric, character, logical, or factor dataset of observed labels. It can be a vector, a matrix, an array, a data frame, or a list. The `join_labels` function can be useful to make labels with multiple datasets.
`modnames`	A character vector for the names of the models. The `evalmod` function automatically generates default names as "m1", "m2", "m3", and so on when it is `NULL`.
`dsids`	A numeric vector for test dataset IDs. The `evalmod` function automatically generates the default ID as `1` when it is `NULL`.
`posclass`	A scalar value to specify the label of positives in `labels`. It must be the same data type as `labels`. For example, `posclass = -1` changes the positive label from `1` to `-1` when `labels` contains `1` and `-1`. The positive label will be automatically detected when `posclass` is `NULL`.
`na_worst`	A Boolean value for controlling the treatment of NAs in `scores`. TRUE All NAs are treated as the worst scores FALSE All NAs are treated as the best scores
`ties_method`	A string for controlling ties in `scores`. "equiv" Ties are equivalently ranked "first" Ties are ranked in an increasing order as appeared "random" Ties are ranked in random order
`calc_avg`	A logical value to specify whether average curves should be calculated. It is effective only when `dsids` contains multiple dataset IDs. For instance, the function calculates the average for the model "m1" when `modnames` is `c("m1", "m1", "m1")` and `dsids` is `c(1, 2, 3)`. The calculation points are defined by `x_bins`.
`cb_alpha`	A numeric value with range [0, 1] to specify the alpha value of the point-wise confidence bounds calculation. It is effective only when `calc_avg` is set to `TRUE`. For example, it should be `0.05` for the 95% confidence level. The calculation points are defined by `x_bins`.
`raw_curves`	A logical value to specify whether all raw curves should be discarded after the average curves are calculated. It is effective only when `calc_avg` is set to `TRUE`.
`x_bins`	An integer value to specify the number of minimum bins on the x-axis. It is then used to define supporting points For instance, the x-values of the supporting points will be `c(0, 0.5, 1)` and `c(0, 0.25, 0.5, 0.75, 1)` when `x_bins = 2` and `x_bins = 4`, respectively. All corresponding y-values of the supporting points are calculated. `x_bins` is effective only when `mode` is set to `rocprc` or `prcroc`.
`interpolate`	A Boolean value to specify whether or not interpolation of ROC and precision-recall curves are performed. `x_bins` and `calc_avg` are ignored and when `x_bins` is set to `FALSE`. `interpolate` is effective only when `mode` is set to `rocprc` or `prcroc`.
`...`	These additional arguments are passed to `mmdata` for data preparation.

Value

The evalmod function returns an S3 object that contains performance evaluation measures. The number of models and the number of datasets can be controlled by modnames and dsids. For example, the number of models is "single" and the number of test datasets is "multiple" when modnames = c("m1", "m1", "m1") and dsids = c(1, 2, 3) are specified.

Different S3 objects have different default behaviors of S3 generics, such as plot, autoplot, and fortify.

The evalmod function returns one of the following S3 objects when mode is "prcroc". The objects contain ROC and Precision-Recall curves.

S3 object # of models # of test datasets

sscurves single single

mscurves multiple single

smcurves single multiple

mmcurves multiple multiple
The evalmod function returns one of the following S3 objects when mode is "basic". They contain five different basic evaluation measures; error rate, accuracy, specificity, sensitivity, and precision.

S3 object # of models # of test datasets

sspoints single single

mspoints multiple single

smpoints single multiple

mmpoints multiple multiple
The evalmod function returns the aucroc S3 object when mode is "aucroc", which can be used with 'print' and 'as.data.frame'.

Examples


##################################################
### Single model & single test dataset
###

## Load a dataset with 10 positives and 10 negatives
data(P10N10)

## Generate an sscurve object that contains ROC and Precision-Recall curves
sscurves <- evalmod(scores = P10N10$scores, labels = P10N10$labels)
sscurves

## Generate an sspoints object that contains basic evaluation measures
sspoints <- evalmod(
  mode = "basic", scores = P10N10$scores,
  labels = P10N10$labels
)
sspoints


##################################################
### Multiple models & single test dataset
###

## Create sample datasets with 100 positives and 100 negatives
samps <- create_sim_samples(1, 100, 100, "all")
mdat <- mmdata(samps[["scores"]], samps[["labels"]],
  modnames = samps[["modnames"]]
)

## Generate an mscurve object that contains ROC and Precision-Recall curves
mscurves <- evalmod(mdat)
mscurves

## Generate an mspoints object that contains basic evaluation measures
mspoints <- evalmod(mdat, mode = "basic")
mspoints


##################################################
### Single model & multiple test datasets
###

## Create sample datasets with 100 positives and 100 negatives
samps <- create_sim_samples(4, 100, 100, "good_er")
mdat <- mmdata(samps[["scores"]], samps[["labels"]],
  modnames = samps[["modnames"]],
  dsids = samps[["dsids"]]
)

## Generate an smcurve object that contains ROC and Precision-Recall curves
smcurves <- evalmod(mdat)
smcurves

## Generate an smpoints object that contains basic evaluation measures
smpoints <- evalmod(mdat, mode = "basic")
smpoints


##################################################
### Multiple models & multiple test datasets
###

## Create sample datasets with 100 positives and 100 negatives
samps <- create_sim_samples(4, 100, 100, "all")
mdat <- mmdata(samps[["scores"]], samps[["labels"]],
  modnames = samps[["modnames"]],
  dsids = samps[["dsids"]]
)

## Generate an mmcurve object that contains ROC and Precision-Recall curves
mmcurves <- evalmod(mdat)
mmcurves

## Generate an mmpoints object that contains basic evaluation measures
mmpoints <- evalmod(mdat, mode = "basic")
mmpoints


##################################################
### N-fold cross validation datasets
###

## Load test data
data(M2N50F5)

## Speficy nessesary columns to create mdat
cvdat <- mmdata(
  nfold_df = M2N50F5, score_cols = c(1, 2),
  lab_col = 3, fold_col = 4,
  modnames = c("m1", "m2"), dsids = 1:5
)

## Generate an mmcurve object that contains ROC and Precision-Recall curves
cvcurves <- evalmod(cvdat)
cvcurves

## Generate an mmpoints object that contains basic evaluation measures
cvpoints <- evalmod(cvdat, mode = "basic")
cvpoints

## Specify mmdata arguments from evalmod
cvcurves2 <- evalmod(
  nfold_df = M2N50F5, score_cols = c(1, 2),
  lab_col = 3, fold_col = 4,
  modnames = c("m1", "m2"), dsids = 1:5
)
cvcurves2


##################################################
### AUC with the U statistic
###

## mode = "aucroc" returns 'aucroc' S3 object
data(P10N10)

# 'aucroc' S3 object
uauc1 <- evalmod(
  scores = P10N10$scores, labels = P10N10$labels,
  mode = "aucroc"
)

# print 'aucroc'
uauc1

# as.data.frame 'aucroc'
as.data.frame(uauc1)

## It is 2-3 times faster than mode = "rocprc"
# A sample of 100,000
samp1 <- create_sim_samples(1, 50000, 50000)

# a function to test mode = "rocprc"
func_evalmod_rocprc <- function(samp) {
  curves <- evalmod(scores = samp$scores, labels = samp$labels)
  aucs <- auc(curves)
}

# a function to test mode = "aucroc"
func_evalmod_aucroc <- function(samp) {
  uaucs <- evalmod(
    scores = samp$scores, labels = samp$labels,
    mode = "aucroc"
  )
  as.data.frame(uaucs)
}

# Process time
system.time(res1 <- func_evalmod_rocprc(samp1))
system.time(res2 <- func_evalmod_aucroc(samp1))

# AUCs
res1
res2

precrec documentation built on Oct. 12, 2023, 1:06 a.m.

precrec index

README.md Introduction to precrec

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

precrec
Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves

evalmod: Evaluate models and calculate performance evaluation measures
In precrec: Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves

Evaluate models and calculate performance evaluation measures

Description

Usage

Arguments

Value

See Also

Examples

Related to evalmod in precrec...

R Package Documentation

Browse R Packages

We want your feedback!

`S3` object	# of models	# of test datasets
sscurves	single	single
mscurves	multiple	single
smcurves	single	multiple
mmcurves	multiple	multiple

precrec Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves

evalmod: Evaluate models and calculate performance evaluation measures In precrec: Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves

Evaluate models and calculate performance evaluation measures

Description

Usage

Arguments

Value

See Also

Examples

Related to evalmod in precrec...

R Package Documentation

Browse R Packages

We want your feedback!

precrec
Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves

evalmod: Evaluate models and calculate performance evaluation measures
In precrec: Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves