Model evaluation audit
In auditor: Model Audit - Verification, Validation, and Error Analysis

knitr::opts_chunk$set(warning = FALSE)
knitr::opts_chunk$set(message = FALSE)

In this vignette we present plots for evaluation of classification models.

Data

We work on titanic_imputed dataset form the DALEX package.

library(DALEX)
head(titanic_imputed)

Models

We fit 2 models: glm and randomForest.

model_glm <- glm(survived ~ ., data = titanic_imputed, family = "binomial")

library(randomForest)
model_rf <- randomForest(survived ~ ., data = titanic_imputed)

Preparation for evaluation analysis

The first step is creating an object that can be used to audit a model. It wraps up a model with meta-data. An alternative way is to use explain function from the package DALEX.

library(auditor)
exp_glm <- audit(model_glm, data = titanic_imputed, y = titanic_imputed$survived)
exp_rf <- audit(model_rf, data = titanic_imputed, y = titanic_imputed$survived)

Second step is creating auditor_model_evaluation object that can be further used for validating a model.

eva_glm <- model_evaluation(exp_glm)
eva_rf <- model_evaluation(exp_rf)

Plots

Receiver operating characteristic (ROC)

Receiver operating characteristic (ROC) curve is a tool for visualizing a classifier’s performance. It answers the question of how well the model discriminates between the two classes. The boundary between classes is determined by a threshold value. ROC illustrates the performance of a classification model at various threshold settings.

The diagonal line y = x corresponds to a classifier that randomly guess the positive class half the time. Any model that appears in the lower right part of plot performs worse than random guessing. The closer the curve is to the the left border and top border of plot, the more accurate the classifier is.

plot(eva_glm, eva_rf, type = "roc")
# or
# plot_roc(eva_glm, eva_rf)

LIFT chart

The LIFT chart is a rate of positive prediction (RPP) plotted against true positive (TP) on a threshold t.

The chart illustrates varying performance of the model for different thresholds. A random and ideal models are represented by dashed curves (lower and upper respectively). The closer the LIFT curve gets to the upper dashed curve (ideal model), the better a model is.

plot(eva_glm, eva_rf, type = "lift")
# or
# plot_lift(eva_glm, eva_rf)

Other methods

Other methods and plots are described in vignettes:

Any scripts or data that you put into this service are public.

auditor documentation built on Nov. 2, 2023, 6:13 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

auditor
Model Audit - Verification, Validation, and Error Analysis

Model evaluation audit
In auditor: Model Audit - Verification, Validation, and Error Analysis

Data

Models

Preparation for evaluation analysis

Plots

Receiver operating characteristic (ROC)

LIFT chart

Other methods

Try the auditor package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

auditor Model Audit - Verification, Validation, and Error Analysis

Model evaluation audit In auditor: Model Audit - Verification, Validation, and Error Analysis

Data

Models

Preparation for evaluation analysis

Plots

Receiver operating characteristic (ROC)

LIFT chart

Other methods

Try the auditor package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

auditor
Model Audit - Verification, Validation, and Error Analysis

Model evaluation audit
In auditor: Model Audit - Verification, Validation, and Error Analysis