R/calc.perf.R
In callsync: Recording Synchronisation, Call Detection and Assignment, Audio Analysis

Documented in calc.perf

#' @title calc.perf
#'
#' @description Calculates the performance of the detections. Detections are
#' true positive if they overlap to any extend with a ground truth selection.
#'
#' @param d data frame, detection selection table with start = start time in
#' seconds, end = end time in  seconds and file = file name
#' @param gt data frame, ground truth selection table with start = start time
#' in seconds, end = end time in seconds and file = file name
#'
#' @return Returns a named list with tp = the row numbers (in d) for the true
#' positives, fp = the row numbers (in d) for the false positives, fn = the row
#' numbers (in gt) for the false negatives, fp_rate = `length(fp)/nrow(d)`,
#' tp_rate = `length(tp)/nrow(gt)`, fn_rate = `length(fn)/nrow(gt)`.
#'
#' @examples
#' require(callsync)
#' require(seewave)
#' require(tuneR)
#' path_git = 'https://raw.githubusercontent.com'
#' path_repo = '/simeonqs/callsync/master/tests/testthat/files'
#' file_1 = '/audacity/chunk_15_ground_truth.txt'
#' url_1 = paste0(path_git, path_repo, file_1)
#' local_dir = paste(tempdir(), 'audacity', sep = '/')
#' local_file_1 = paste(tempdir(), file_1, sep = '/')
#' if(!dir.exists(local_dir)) dir.create(local_dir)
#' if(!file.exists(local_file_1))
#'   download.file(url_1, destfile = local_file_1, mode = 'wb',)
#' d = load.selection.tables.audacity(path_selection_tables = local_dir)
#' result = calc.perf(d, d)
#'
#' @export

calc.perf = function(d, gt){

  # Test and try to fix start and end columns
  ## if missing both types
  if(is.null(d$start) & is.null(d$Begin.time..s.))
    stop('No start column found in d.')
  if(is.null(d$end) & is.null(d$End.time..s.))
    stop('No end column found in d.')
  if(is.null(gt$start) & is.null(gt$Begin.Time..s.))
    stop('No start column found in gt.')
  if(is.null(gt$end) & is.null(gt$End.Time..s.))
    stop('No end column found in gt.')
  ## if missing end or start add from Raven format
  if(is.null(d$start) & !is.null(d$Begin.time..s.))
    d$start = d$Begin.time..s.
  if(is.null(d$end) & !is.null(d$End.time..s.))
    d$end = d$End.time..s.
  if(is.null(gt$start) & !is.null(gt$Begin.Time..s.))
    gt$start = gt$Begin.Time..s.
  if(is.null(gt$end) & !is.null(gt$End.Time..s.))
    gt$end = gt$End.Time..s.

  # Find the tp and fp
  tp = fp = c()
  for(i in seq_len(nrow(d))){
    sub = gt[which(gt$file == d$file[i]),]
    if(nrow(sub) == 0) {fp = c(fp, i); next}
    # keep if the start of the detection falls within a ground truth (after
    # start and before end)
    keep_start = sapply(1:nrow(sub), function(j)
      d$start[i] > sub$start[j] & d$start[i] < sub$end[j])
    # or if the end falls within detections -> either case there is some
    # overlap
    keep_end = sapply(1:nrow(sub), function(j)
      d$end[i] > sub$start[j] & d$end[i] < sub$end[j])
    # or if start and end fall around -> also overlap
    keep_around = sapply(1:nrow(sub), function(j)
      d$start[i] <= sub$start[j] & d$end[i] >= sub$end[j])
    keep = keep_start | keep_end | keep_around
    if(any(keep)) tp = c(tp, i) else fp = c(fp, i)
  }

  # Find the fn
  fn = c()
  for(i in seq_len(nrow(gt))){
    # fn is no detections for that file
    sub = d[which(d$file == gt$file[i]),]
    if(nrow(sub) == 0) {fn = c(fn, i); next}
    # keep if the start of the detection falls within a ground truth (after
    # start and before end)
    keep_start = sapply(1:nrow(sub), function(j)
      gt$start[i] > sub$start[j] & gt$start[i] < sub$end[j])
    # or if the end falls within detections -> either case there is some
    # overlap
    keep_end = sapply(1:nrow(sub), function(j)
      gt$end[i] > sub$start[j] & gt$end[i] < sub$end[j])
    # or if start and end fall around -> also overlap
    keep_around = sapply(1:nrow(sub), function(j)
      gt$start[i] <= sub$start[j] & gt$end[i] >= sub$end[j])
    keep = keep_start | keep_end | keep_around
    if(!any(keep)) fn = c(fn, i)
  }

  # Calculate rates
  fp_rate = length(fp)/nrow(d)
  tp_rate = length(tp)/nrow(gt)
  fn_rate = length(fn)/nrow(gt)

  # Return
  return(list(tp = tp,
              fp = fp,
              fn = fn,
              fp_rate = fp_rate,
              tp_rate = tp_rate,
              fn_rate = fn_rate))

}

Any scripts or data that you put into this service are public.

callsync documentation built on May 29, 2024, 5:30 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

callsync
Recording Synchronisation, Call Detection and Assignment, Audio Analysis

R/calc.perf.R
In callsync: Recording Synchronisation, Call Detection and Assignment, Audio Analysis

Defines functions calc.perf

Documented in calc.perf

Try the callsync package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

callsync Recording Synchronisation, Call Detection and Assignment, Audio Analysis

R/calc.perf.R In callsync: Recording Synchronisation, Call Detection and Assignment, Audio Analysis

Defines functions calc.perf

Documented in calc.perf

Try the callsync package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

callsync
Recording Synchronisation, Call Detection and Assignment, Audio Analysis

R/calc.perf.R
In callsync: Recording Synchronisation, Call Detection and Assignment, Audio Analysis