R/addRecommendedEfficiency.R
In data.table.threads: Analyze Multi-Threading Performance for 'data.table' Functions

Documented in addRecommendedEfficiency

#' Function that adds recommended efficiency speedup lines and points to benchmarks
#'
#' This function adds to the timing results (or the benchmarked data). It computes the recommended efficiency speedup line and the point which denotes the recommended thread count, both being based on the specified efficiency value.
#'
#' @param benchmarkData A \code{data.table} of class \code{data_table_threads_benchmark} containing benchmarked results, which includes timings and speedup plot data (ideal and measured types) for each function.
#'
#' @param recommendedEfficiency A numeric value between 0 and 1 that defines the slope for the "Recommended" efficiency speedup line. (Default is 0.5)
#'
#' @return The input \code{data.table} with the recommended efficiency added to the plot data (attributes).
#'
#' @details This function allows users to add a "Recommended" efficiency line to previously computed benchmark data (without needing to recompute the timings). The recommended speedup is based on the provided efficiency value, which adjusts the slope of the speedup curve and correspondingly helps in the computation of the closest point of measured speedup to the "Recommended" speedup curve.
#'
#' @seealso \code{\link{findOptimalThreadCount}} for computing the benchmark data with measured and ideal speedup data.
#'
#' @export
#'
#' @examples
#' # Finding the best performing thread count for each benchmarked data.table function
#' # with a data size of 1000 rows and 10 columns:
#' benchmarks <- data.table.threads::findOptimalThreadCount(1e3, 10)
#' # Adding recommended efficiency to the plot data:
#' addRecommendedEfficiency(benchmarks, recommendedEfficiency = 0.6)

addRecommendedEfficiency <- function(benchmarkData, recommendedEfficiency = 0.5) 
{
  if(recommendedEfficiency <= 0 || recommendedEfficiency > 1)
  {
    stop("Recommended efficiency must be between 0 and 1.")
  }

  functions <- unique(benchmarkData$expr)
  systemThreadCount <- max(benchmarkData$threadCount)
  recommendedSpeedup <- seq(1, systemThreadCount * recommendedEfficiency, length.out = systemThreadCount)

  recommendedSpeedupData <- data.table(
    threadCount = seq(1, systemThreadCount, length.out = systemThreadCount),
    speedup = recommendedSpeedup,
    type = "Recommended"
  )

  speedupData <- data.table(
    expr = rep(functions, each = systemThreadCount),
    threadCount = rep(1:systemThreadCount, times = 2 * length(functions)),
    speedup = c(rep(seq(1, systemThreadCount), length(functions)), rep(recommendedSpeedup, length(functions))),
    type = rep(c("Ideal", "Recommended"), each = systemThreadCount * length(functions))
  )

  closestPoints <- benchmarkData[, {
    recommendedSubset <- recommendedSpeedupData[threadCount %in% .SD$threadCount]
    .SD[.SD$speedup >= recommendedSubset$speedup][which.max(speedup)]
  }, by = expr]
  closestPoints[, type := "Recommended"]

  # Using fill = TRUE for missing columns minTime, maxTime, and median in speedupData and maxSpeedup:
  combinedLineData <- rbind(speedupData, attr(benchmarkData, "lineData"), fill = TRUE)
  combinedPointData <- rbind(closestPoints, attr(benchmarkData, "pointData"), fill = TRUE)

  setattr(benchmarkData, "lineData", combinedLineData)
  setattr(benchmarkData, "pointData", combinedPointData)
  benchmarkData
}

Any scripts or data that you put into this service are public.

data.table.threads documentation built on April 3, 2025, 10:08 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

data.table.threads
Analyze Multi-Threading Performance for 'data.table' Functions

R/addRecommendedEfficiency.R
In data.table.threads: Analyze Multi-Threading Performance for 'data.table' Functions

Defines functions addRecommendedEfficiency

Documented in addRecommendedEfficiency

Try the data.table.threads package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

data.table.threads Analyze Multi-Threading Performance for 'data.table' Functions

R/addRecommendedEfficiency.R In data.table.threads: Analyze Multi-Threading Performance for 'data.table' Functions

Defines functions addRecommendedEfficiency

Documented in addRecommendedEfficiency

Try the data.table.threads package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

data.table.threads
Analyze Multi-Threading Performance for 'data.table' Functions

R/addRecommendedEfficiency.R
In data.table.threads: Analyze Multi-Threading Performance for 'data.table' Functions