simpleRank: Implementation Mann-Kendall Test and Theil-Sen Approach

Documented in ts_score ts_slope ts_tau ts_test ts_variance

# TODO
# Unit Test to check if it really doesn't make any difference whether none, some or all ranks
#   are present in contingency table for ts_variance

#' Calculate Sen-Slope
#'
#' @description Calculation of Sen's slope. See Details.
#'
#' @param t A vector of ranks (numeric, or coercible to numeric).
#'   For Time Series the observation dates.
#' @param Y A vector of ranks. For Time Series the values observed at each
#'   \emph{t}.
#'
#' @export
#'
#' @details \loadmathjax Sen's slope is calculated as followed:
#'   \mjsdeqn{b = \frac{x_{n/2} + x_{n/2 + 1}}{2}}
#'   with \eqn{x} being the differences of every \mjseqn{\binom{n}{2}} value pairs:
#'   \mjsdeqn{x = (Y_{j} - Y_{i})/(t_{j} - t_{i}), 1 \le \, i < \, j \, \le n}
#'
#' @references Sen, Pranab Kumar (1968): Estimates of the Regression
#'   Coefficient Based on Kendall's Tau. In: Journal of the American
#'   Statistical Association 63 (324), P. 1379-1389. DOI: 10.2307/2285891
#'
#' @seealso \code{\link{ts_test}}
ts_slope <- function(t = NULL, Y) {
  N <- length(Y)
  t <- check_sequence(t, N)

  comb_t <- utils::combn(t, 2)
  comb_Y <- utils::combn(Y, 2)

  stats::median(
    # aufgrund der von Sen genannten Bediungungen zu t_j und t_i kann ich nicht einfach so die Summe bilden, auch wenn
    # das vielleicht schneller wäre.
    purrr::pmap_dbl(
      list(comb_Y[2, ], comb_Y[1, ], comb_t[2, ], comb_t[1, ]),
      function(Yj, Yi, tj, ti) {
        if (tj <= ti | tj - ti == 0) {
          return(NA)
        }
        (Yj - Yi) / (tj - ti)
      }
    ),
    na.rm = TRUE
  )
}


#' Calculate Test Statistic for Sen-Slope
#'
#' Calculate Test Statistic of Sen's Slope.
#'
#' @param t A vector of ranks (numeric, or coercible to numeric).
#'   For Time Series the observation dates.
#' @param Y A vector of ranks (numeric, or coercible to numeric).
#'   For Time Series the observation dates.
#' @param slope Slope of Rankings.
#'
#' @export
#'
#' @seealso \code{\link{ts_test}}
ts_score <- function(t = NULL, Y, slope) {
  N <- length(Y)
  t <- check_sequence(t, N)

  coeff_T <- t * slope

  z <- Y - coeff_T

  result <- sgn(t, z)

  list("coeff" = coeff_T, "score" = result)
}

#' Calculate Variance of Sen's Test Statistic
#'
#' Calculates variance Sen's Test Statistic Z.
#'
#' @param n A numeric. Length of the ranking.
#' @param u Contingency table of ranks in one ranking.
#'   Will be coerced to numeric vector.
#'
#' @export
#'
#' @details This is equivalent to the calculation of the
#'   variance Kendall's test statistic, accounting for
#'   duplicates in only one of the rankings:
#'   \deqn{\sigma^{2}_{S} = \frac{n(n-1)(2n+5) \times \sum_{t \in g_{e}}t(t-1)(2t+5)}{18}}
#'
#' @seealso \code{\link{ts_test}}
ts_variance <- function(n, u = rep(1, length.out = n)) {
  ((n * (n - 1) * (2 * n + 5)) - sum(u * (u - 1) * (2 * u + 5))) / 18
}

#' Calculate Theil-Sen Equivalent of Kendall's Tau
#'
#' @description Sen denotes this as \emph{U}. It's
#'   functionally equivalent to Kendall's Tau,
#'   however their calculations differ.
#'
#' @param t A vector of ranks (numeric, or coercible to numeric).
#'   For Time Series the observation dates.
#' @param n Length of ranking.
#' @param score Sen's Test statistic.
#'
#' @export
#'
#' @seealso \code{\link{ts_test}}
ts_tau <- function(t = NULL, n, score) {
  t <- check_sequence(t, n)

  N <- onesided_sgn(t)

  (1 / sqrt(N * choose(n, 2))) * score
}

#' Test Sen-Slope
#'
#' @description Tests if Sen's slope is significant. It's effectively
#'   a wrapper around all functions in this package starting
#'   with \code{ts_}.
#'
#' @param t A vector of ranks (numeric, or coercible to numeric).
#'   For Time Series the observation dates.
#' @param Y A vector of ranks. For Time Series the values observed at each
#'   \emph{t}.
#'
#' @return Built in tests like the \code{t.test} use a class "htest".
#'   Hopefully this will be my return value/class as well.
#'
#' @export
#'
#' @references
#' Sen, Pranab Kumar (1968): Estimates of the Regression Coefficient
#' Based on Kendall's Tau. In: Journal of the American Statistical
#' Association 63 (324), S. 1379–1389. DOI: 10.2307/2285891.
#'
#' Theil, H. (1950): A rank-invariant method of linear and polynomial
#' regression analysis, 1-2. Confidence regions for the parameters of
#' linear regression equations in two, three and more variables.
#' In: Indagationes Mathematicae XII (SP 5/49/R), 386-392, 521-525.
#'
#' @seealso \code{\link{ts_slope}} \code{\link{ts_variance}}
#'   \code{\link{ts_tau}} \code{\link{ts_score}}
ts_test <- function(t = NULL, Y) {
  name_t <- rlang::enexpr(t) # capture while it is still a promise
  name_Y <- rlang::enexpr(Y)

  N <- length(Y)

  t <- check_sequence(t, N)

  con_table_t <- table(t)

  slope_res <- ts_slope(t, Y)

  score_res <- ts_score(t, Y, slope_res)[["score"]]

  variance_res <- ts_variance(N, con_table_t)

  Z <- mk_statisitc(score_res, variance_res)

  tau <- ts_tau(t, N, score_res)

  return_list <- list(
    null.value = c("S" = 0),
    alternative = "two.sided",
    method = "Kendall's Test for Rank Correlation",
    estimates = c(
      "slope" = slope_res,
      "N" = onesided_sgn(t),
      "variance" = variance_res,
      "U [~Tau]" = tau
    ),
    data.name = paste0(
      "t = ", rlang::expr_deparse(name_t),
      ", Y = ", rlang::expr_deparse(name_Y)
    ),
    statistic = c("Z" = Z),
    parameters = c("n" = N), # honestly: Idk, in the trend package, n is given
    p.value = 1 - (stats::pnorm(Z, lower.tail = TRUE) - stats::pnorm(Z, lower.tail = FALSE))
  )

  class(return_list) <- "htest"

  return_list
}

Florian-Katerndahl/simpleRank documentation built on Dec. 17, 2021, 8:28 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Florian-Katerndahl/simpleRank
Implementation Mann-Kendall Test and Theil-Sen Approach

R/theilsen.R
In Florian-Katerndahl/simpleRank: Implementation Mann-Kendall Test and Theil-Sen Approach

Defines functions ts_test ts_tau ts_variance ts_score ts_slope

Documented in ts_score ts_slope ts_tau ts_test ts_variance

R Package Documentation

Browse R Packages

We want your feedback!

Florian-Katerndahl/simpleRank Implementation Mann-Kendall Test and Theil-Sen Approach

R/theilsen.R In Florian-Katerndahl/simpleRank: Implementation Mann-Kendall Test and Theil-Sen Approach

Defines functions ts_test ts_tau ts_variance ts_score ts_slope

Documented in ts_score ts_slope ts_tau ts_test ts_variance

R Package Documentation

Browse R Packages

We want your feedback!

Florian-Katerndahl/simpleRank
Implementation Mann-Kendall Test and Theil-Sen Approach

R/theilsen.R
In Florian-Katerndahl/simpleRank: Implementation Mann-Kendall Test and Theil-Sen Approach