add_difference_row.tbl_summary: Add differences rows between groups
In ddsjoberg/gtsummary: Presentation-Ready Data Summary and Analytic Result Tables

add_difference_row.tbl_summary

R Documentation

Add differences rows between groups

Description

Adds difference to tables created by tbl_summary() as additional rows. This function is often useful when there are more than two groups to compare.

Pairwise differences are calculated relative to the specified by variable's specified reference level.

Usage

## S3 method for class 'tbl_summary'
add_difference_row(
  x,
  reference,
  statistic = everything() ~ "{estimate}",
  test = NULL,
  group = NULL,
  header = NULL,
  adj.vars = NULL,
  test.args = NULL,
  conf.level = 0.95,
  include = everything(),
  pvalue_fun = label_style_pvalue(digits = 1),
  estimate_fun = list(c(all_continuous(), all_categorical(FALSE)) ~ label_style_sigfig(),
    all_dichotomous() ~ label_style_sigfig(scale = 100, suffix = "%"), all_tests("smd")
    ~ label_style_sigfig()),
  ...
)

Arguments

`x`	(`tbl_summary`) table created with `tbl_summary()`
`reference`	(scalar) Value of the `tbl_summary(by)` variable value that is the reference for each of the difference calculations. For factors, use the character level.
`statistic`	(`formula-list-selector`) Specifies summary statistics to display for each variable. The default is `everything() ~ "{estimate}"`. The statistics available to include will depend on the method specified in the `test` argument, but are generally `"estimate"`, `"std.error"`, `"parameter"`, `"statistic"`, `"conf.low"`, `"conf.high"`, `"p.value"`.
`test`	(`formula-list-selector`) Specifies the tests/methods to perform for each variable, e.g. `list(all_continuous() ~ "t.test", all_dichotomous() ~ "prop.test", all_categorical(FALSE) ~ "smd")`. See below for details on default tests and ?tests for details on available tests and creating custom tests.
`group`	(`tidy-select`) Variable name of an ID or grouping variable. The column can be used to calculate p-values with correlated data. Default is `NULL`. See tests for methods that utilize the `group` argument.
`header`	(`string`) When supplied, a header row will appear above the difference statistics.
`adj.vars`	(`tidy-select`) Variables to include in adjusted calculations (e.g. in ANCOVA models). Default is `NULL`.
`test.args`	(`formula-list-selector`) Containing additional arguments to pass to tests that accept arguments. For example, add an argument for all t-tests, use `test.args = all_tests("t.test") ~ list(var.equal = TRUE)`.
`conf.level`	(`numeric`) a scalar in the interval `⁠(0, 1)⁠` indicating the confidence level. Default is 0.95
`include`	(`tidy-select`) Variables to include in output. Default is `everything()`.
`pvalue_fun`	(`function`) Function to round and format p-values. Default is `label_style_pvalue()`. The function must have a numeric vector input, and return a string that is the rounded/formatted p-value (e.g. `pvalue_fun = label_style_pvalue(digits = 2)`).
`estimate_fun`	(`formula-list-selector`) List of formulas specifying the functions to round and format differences and confidence limits.
`...`	These dots are for future extensions and must be empty.

Details

The default labels for the statistic rows will often not be what you need to display. In cases like this, use modify_table_body() to directly update the label rows. Use show_header_names() to print the underlying column names to identify the columns to target when changing the label, which in this case will always be the 'label' column. See Example 2.

Value

a gtsummary table of class "tbl_summary"

Examples


# Example 1 ----------------------------------
trial |>
  tbl_summary(
    by = grade,
    include = c(age, response),
    missing = "no",
    statistic = all_continuous() ~ "{mean} ({sd})"
  ) |>
  add_stat_label() |>
  add_difference_row(
    reference = "I",
    statistic = everything() ~ c("{estimate}", "{conf.low}, {conf.high}", "{p.value}")
  )

# Example 2 ----------------------------------
# Function to build age-adjusted logistic regression and put results in ARD format
ard_odds_ratio <- \(data, variable, by, ...) {
  cardx::construct_model(
    data = data,
    formula = reformulate(response = variable, termlabels = c(by, "age")), # adjusting model for age
    method = "glm",
    method.args = list(family = binomial)
  ) |>
    cardx::ard_regression_basic(exponentiate = TRUE) |>
    dplyr::filter(.data$variable == .env$by)
}

trial |>
  tbl_summary(by = trt, include = response, missing = "no") |>
  add_stat_label() |>
  add_difference_row(
    reference = "Drug A",
    statistic = everything() ~ c("{estimate}", "{conf.low}, {conf.high}", "{p.value}"),
    test = everything() ~ ard_odds_ratio,
    estimate_fun = everything() ~ label_style_ratio()
  ) |>
  # change the default label for the 'Odds Ratio'
  modify_table_body(
    ~ .x |>
      dplyr::mutate(
        label = ifelse(label == "Coefficient", "Odds Ratio", label)
      )
  ) |>
  # add footnote about logistic regression
  modify_footnote_body(
    footnote = "Age-adjusted logistic regression model",
    column = "label",
    rows = variable == "response-row_difference"
  )

ddsjoberg/gtsummary documentation built on June 11, 2025, 10:29 p.m.