mean_group_tbl: Summarize multiple response variables by group or pattern
In summarytabl: Generate Summary Tables for Categorical, Ordinal, and Continuous Data

mean_group_tbl

R Documentation

Summarize multiple response variables by group or pattern

Description

mean_group_tbl() calculates summary statistics (i.e., mean, standard deviation, minimum, maximum, and count of non-missing values) for continuous (i.e., interval and ratio-level) variables, grouped either by another variable in your dataset or by a matched pattern in the variable names.

Usage

mean_group_tbl(
  data,
  var_stem,
  group,
  var_input = "stem",
  regex_stem = FALSE,
  ignore_stem_case = FALSE,
  group_type = "variable",
  group_name = NULL,
  regex_group = FALSE,
  ignore_group_case = FALSE,
  remove_group_non_alnum = TRUE,
  na_removal = "listwise",
  only = NULL,
  var_labels = NULL,
  ignore = NULL
)

Arguments

`data`	A data frame.
`var_stem`	A character vector with one or more elements, where each represents either a variable stem or the complete name of a variable present in `data`. A variable 'stem' refers to a common naming pattern shared among related variables, typically reflecting repeated measures of the same idea or a group of items assessing a single concept.
`group`	A character string representing a variable name or a pattern used to search for variables in `data`.
`var_input`	A character string specifying whether the values supplied to `var_stem` should be treated as variable stems (`stem`) or as complete variable names (`name`). By default, this is set to `stem`, so the function searches for variables that begin with each stem provided. Setting this argument to `name` directs the function to look for variables that exactly match the provided names.
`regex_stem`	A logical value indicating whether to use Perl-compatible regular expressions when searching for variable stems. Default is `FALSE`.
`ignore_stem_case`	A logical value indicating whether the search for columns matching the supplied `var_stem` is case-insensitive. Default is `FALSE`.
`group_type`	A character string that defines how the `group` argument should be interpreted. Should be one of `pattern` or `variable`. Defaults to `variable`, which searches for a matching variable name in `data`.
`group_name`	An optional character string used to rename the `group` column in the final table When `group_type` is set to `variable`, the column name defaults to the matched variable name from `data`. When set to `pattern`, the default column name is `group`.
`regex_group`	A logical value indicating whether to use Perl-compatible regular expressions when searching for `group` variables or matching variable name patterns. Default is `FALSE`.
`ignore_group_case`	A logical value specifying whether the search for a grouping variable (if `group_type` is `variable`) or for variables matching a pattern (if `group_type` is `pattern`) should be case-insensitive. Default is `FALSE`. Set to `TRUE` to ignore case.
`remove_group_non_alnum`	A logical value indicating whether to remove all non-alphanumeric characters (i.e., anything that is not a letter or number) from `group`. Default is `TRUE`.
`na_removal`	A character string that specifies the method for handling missing values: `pairwise` or `listwise`. Defaults to `listwise`.
`only`	A character string or vector of character strings of the types of summary data to return. Default is `NULL`, which returns both counts and percentages. To return only counts or percentages, use `count` or `percent`, respectively.
`var_labels`	An optional named character vector or list used to assign custom labels to variable names. Each element must be named and correspond to a variable included in the returned table. If `var_input` is set to `stem`, and any element is either unnamed or refers to a variable not present in the table, all labels will be ignored and the table will be printed without them.
`ignore`	An optional named vector or list indicating values to exclude from variables matching specified stems (or names), and, if applicable, from a grouping variable in `data`. Defaults to `NULL`, indicating that all values are retained. To specify exclusions for variables identified by `var_stem`, use the corresponding stems or variable names as names in the vector or list. To exclude multiple values from these variables or a grouping variable, supply them as a named list.

Value

A tibble showing summary statistics for continuous variables, grouped either by a specified variable in the dataset or by matching patterns in variable names.

Author(s)

Ama Nyame-Mensah

Examples

sdoh_child_ages_region <- 
  dplyr::select(sdoh, c(REGION, ACS_PCT_AGE_0_4, ACS_PCT_AGE_5_9,
                        ACS_PCT_AGE_10_14, ACS_PCT_AGE_15_17))

mean_group_tbl(data = sdoh_child_ages_region,
               var_stem = "ACS_PCT_AGE",
               group = "REGION",
               group_name = "us_region",
               na_removal = "pairwise",
               var_labels = c(
                 ACS_PCT_AGE_0_4 = "% of population between ages 0-4",
                 ACS_PCT_AGE_5_9 = "% of population between ages 5-9",
                 ACS_PCT_AGE_10_14 = "% of population between ages 10-14",
                 ACS_PCT_AGE_15_17 = "% of population between ages 15-17"))

set.seed(0222)
grouped_data <-
  data.frame(
    symptoms.t1 = sample(c(0:10, -999), replace = TRUE, size = 50),
    symptoms.t2 = sample(c(NA, 0:10, -999), replace = TRUE, size = 50)
  )

mean_group_tbl(data = grouped_data,
               var_stem = "symptoms",
               group = ".t\\d",
               group_type = "pattern",
               na_removal = "listwise",
               ignore = c(symptoms = -999))

summarytabl documentation built on Nov. 6, 2025, 5:07 p.m.

summarytabl index

Package overview README.md Introduction to summarytabl

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

summarytabl
Generate Summary Tables for Categorical, Ordinal, and Continuous Data

mean_group_tbl: Summarize multiple response variables by group or pattern
In summarytabl: Generate Summary Tables for Categorical, Ordinal, and Continuous Data

Summarize multiple response variables by group or pattern

Description

Usage

Arguments

Value

Author(s)

Examples

Related to mean_group_tbl in summarytabl...

R Package Documentation

Browse R Packages

We want your feedback!

summarytabl Generate Summary Tables for Categorical, Ordinal, and Continuous Data

mean_group_tbl: Summarize multiple response variables by group or pattern In summarytabl: Generate Summary Tables for Categorical, Ordinal, and Continuous Data

Summarize multiple response variables by group or pattern

Description

Usage

Arguments

Value

Author(s)

Examples

Related to mean_group_tbl in summarytabl...

R Package Documentation

Browse R Packages

We want your feedback!

summarytabl
Generate Summary Tables for Categorical, Ordinal, and Continuous Data

mean_group_tbl: Summarize multiple response variables by group or pattern
In summarytabl: Generate Summary Tables for Categorical, Ordinal, and Continuous Data