cutpointr: Bootstrapping"
In cutpointr: Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

knitr::opts_chunk$set(fig.width = 6, fig.height = 5, fig.align = "center")
options(rmarkdown.html_vignette.check_title = FALSE)

Bootstrapping is implemented in cutpointr with two goals:

Determine optimal cutpoints with bootstrapping (as an alternative to determining them without bootstrapping)
Validate (any) cutpoint optimization with bootstrapping

This vignette will briefly go through some examples for both approaches.

Determine optimal cutpoints

Without bootstrapping: `maximize_metric`

As a first basic example, the cutpoint optimization will be demonstrated without any bootstrapping by maximizing the Youden-Index. Using the method maximize_metric, this is performed on the full data set:

library(cutpointr)
data(suicide)
opt_cut <- cutpointr(
    data = suicide,
    x = dsi,
    class = suicide,
    method = maximize_metric,
    metric = youden,
    pos_class = "yes",
    direction = ">="
)
summary(opt_cut)

The fields in the resulting R object opt_cut are to be interpreted as follows:

$optimal_cutpoint: The optimal cutpoint determined by maximizing the Youden-Index on the full suicide dataset.
$sensitivity: The sensitivity when applying the cutpoint to the full dataset.
$specificity: The specificity when applying the cutpoint to the full dataset.
$youden: The maximal Youden-Index (= sensitivity + specificity - 1), determined by the optimization.

Bootstrap cutpoints: `maximize_boot_metric`

The determination of the optimal cutpoint can also be performed using bootstrapping. Therefore, the methods maximize_boot_metric/minimize_boot_metric need to be chosen. These functions provide further arguments that can be used to configure the bootstrapping. These arguments can be viewed with help("maximize_boot_metric", "cutpointr"). The most important arguments are:

boot_cut: The number of bootstrapping repetitions.
boot_stratify: If the bootstrap samples are drawn in both classes separately before combining them, keep the number of positives/negatives constant in every sample.
summary_func: The summary function to aggregate the optimal cutpoints from the bootstrapping to arrive at one final optimal cutpoint.

The cutpoint is optimized in n=boot_cut bootstrap samples by maximizing/ minimizing the respective metric (e.g., the Youden-index in this example) in each of these bootstrap samples. Finally, the summary function is applied to aggregate the optimal cutpoints from the n=boot_cut bootstrap samples into one final 'optimal' cutpoint.

set.seed(123)
opt_cut <- cutpointr(
    data = suicide,
    x = dsi,
    class = suicide,
    method = maximize_boot_metric,
    boot_cut = 200,
    summary_func = mean,
    metric = youden,
    pos_class = "yes",
    direction = ">="
)
summary(opt_cut)

The fields in the resulting R object opt_cut are to be interpreted as follows:

$optimal_cutpoint: The optimal cutpoint, which is the aggregated value (as defined with summary_func) over all n=boot_cut bootstrap samples. Please note that no uncertainty measure (standard deviation, 95%-CI, etc.) is available here (a bootstrap distribution of these cutpoints can be generated using outer bootstrapping with boot_runs > 0 and maximize_metric, as explained below).
$sensitivity: The sensitivity when applying the optimal cutpoint to the full dataset.
$specificity: The specificity when applying the optimal cutpoint to the full dataset.
$youden: The Youden-Index when applying the optimal cutpoint to the full dataset.

Validate cutpoint optimization with bootstrapping

Any chosen methods to find the optimal cutpoints can be subsequently validated with bootstrapping. This can easily be activated by setting the argument boot_runs > 0. Please be aware that the first steps to calculate the optimal cutpoints with the specified method (as described above) will be performed in the very same manner as above, resulting in the same outputs as above (depending on the seed when bootstrapping cutpoints).

However, the method to calculate the optimal cutpoints will then additionally be performed on n=boot_runs bootstrap samples. For each of these bootstrap samples, several metrics and performance measures are available from the resulting $boot object, both for the in-bag (suffix: '_b') and the out-of-bag (suffix: '_oob') bootstrap samples. Please note that the optimal cutpoint is determined on the in-bag samples only and then just applied to the out-of-bag samples for validation purposes, so its value is available only once in the $boot object without a suffix.

`maximize_metric`

opt_cut <- cutpointr(
    data = suicide,
    x = dsi,
    class = suicide,
    method = maximize_metric,
    metric = youden,
    pos_class = "yes",
    direction = ">=",
    boot_runs = 100
)

The interpretation of fields in the resulting R object opt_cut is the same as above. The results from the bootstrapping are available from $boot.

summary(opt_cut)
opt_cut$boot[[1]] |> 
  head()

`maximize_boot_metric`

When bootstrapping cutpoints and also using the validation with bootstrapping, the optimal cutpoint will again first be determined as above in n=boot_cut bootstrap samples by maximizing/ minimizing the respective metric in each of these bootstrap samples and then by applying the summary function to aggregate the optimal cutpoints from the n=boot_cut bootstrap samples into one final 'optimal' cutpoint. Hence, using the same seeds here results in the same outputs as above, where no outer bootstrapping is applied.

In the validation routine, the chosen cutpoint optimization is then repeated in each of the n=boot_runs (outer) bootstrap samples: the optimal cutpoint is determined in each bootstrap sample by optimizing the metric on n=boot_cut (inner) bootstrap samples and applying the summary_func to aggregate them into one value.

Since the (inner) bootstrapping of optimal cutpoints is performed in each of the (outer) validation bootstrap samples, this can be computational very expensive and take some time to finish. Therefore, parallelization is implemented in cutpointr by just setting its argument allowParallel = TRUE and initializing a parallel environment.

library(doParallel)
library(doRNG)
cl <- makeCluster(2) # 2 cores
registerDoParallel(cl)
registerDoRNG(12)
set.seed(123)
opt_cut <- cutpointr(
    data = suicide,
    x = dsi,
    class = suicide,
    method = maximize_boot_metric,
    boot_cut = 200,
    summary_func = mean,
    metric = youden,
    pos_class = "yes",
    direction = ">=",
    boot_runs = 100,
    allowParallel = TRUE
)
stopCluster(cl)

Again, the interpretation of fields in the resulting R object opt_cut is the same as above. The results from the bootstrapping are available from $boot.

summary(opt_cut)
opt_cut$boot[[1]] |> 
  head()

Some visualizations of the bootstrapping results are available with the plot function:

plot(opt_cut)

The two plots in the lower half can be generated separately with plot_cut_boot(opt_cut) and plot_metric_boot(opt_cut).

Any scripts or data that you put into this service are public.

cutpointr documentation built on June 13, 2025, 5:09 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

cutpointr
Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

cutpointr: Bootstrapping"
In cutpointr: Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

Determine optimal cutpoints

Without bootstrapping: `maximize_metric`

Bootstrap cutpoints: `maximize_boot_metric`

Validate cutpoint optimization with bootstrapping

`maximize_metric`

`maximize_boot_metric`

Try the cutpointr package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

cutpointr Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

cutpointr: Bootstrapping" In cutpointr: Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

Determine optimal cutpoints

Without bootstrapping: maximize_metric

Bootstrap cutpoints: maximize_boot_metric

Validate cutpoint optimization with bootstrapping

maximize_metric

maximize_boot_metric

Try the cutpointr package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

cutpointr
Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

cutpointr: Bootstrapping"
In cutpointr: Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks

Without bootstrapping: `maximize_metric`

Bootstrap cutpoints: `maximize_boot_metric`

`maximize_metric`

`maximize_boot_metric`