summarize_data: Summarize Data by Groups
In Kifidi: Summary Table and Means Plots

View source: R/summarize_data.R

summarize_data

R Documentation

Summarize Data by Groups

Description

This function summarizes the provided data column by one or two grouping variables. It calculates the mean, standard deviation, sample size, minimum, maximum, median, and standard error.

Usage

summarize_data(column_data, group_var1, group_var2 = NULL)

Arguments

`column_data`	A numeric vector containing the data to be summarized.
`group_var1`	A factor or vector to group the data by.
`group_var2`	An optional second factor or vector to group the data by.

Details

If only one grouping variable is provided, the function will summarize the data by that variable. If two grouping variables are provided, it will summarize the data by both variables.

Value

A data frame with the following columns:

Group1: The first grouping variable (from group_var1).
Group2: The second grouping variable (from group_var2), if provided.
Mean: The mean of the column_data for each group.
SD: The standard deviation of the column_data for each group.
N: The sample size for each group.
Min: The minimum value of the column_data for each group.
Max: The maximum value of the column_data for each group.
Median: The median value of the column_data for each group.
SE: The standard error of the mean for each group.

Output

A data frame with the above columns.

Note

The grouping variables and the data column can be of different lengths.

Author(s)

Oswald Omuron

References

No references available.

Examples

  # Example data
  example_data <- c(
    445, 372, 284, 247, 328, 98.8, 108.7, 100.8, 123.6, 129.9, 133.3,
    130.1, 123.1, 186.6, 215, 19.4, 19.3, 27.8, 26, 22, 30.9, 19.8,
    16.5, 20.2, 31, 21.1, 16.5, 19.7, 18.9, 27, 161.8, 117, 94.6, 97.5,
    142.7, 109.9, 118.3, 111.4, 96.5, 109, 114.1, 114.9, 101.2, 112.7,
    111.1, 194.8, 169.9, 159.1, 100.8, 130.8, 93.6, 105.7, 178.4, 203,
    172.2, 127.3, 128.3, 110.9, 124.1, 179.1, 293, 197.5, 139.1, 98.1,
    84.6, 81.4, 87.2, 71.1, 70.3, 120.4, 194.5, 167.5, 121, 86.5, 81.7
  )

  example_group1 <- c(
    rep("Palm", 15), rep("Papyrus", 10), rep("Typha", 15),
    rep("Eucalyptus", 15), rep("Rice farm", 20)
  )

  example_group2 <- rep(c(50, 40, 30, 20, 10), 15)

  # Create dataframe
  example_df <- data.frame(
    Vegetation_types = example_group1,
    Depth_revised = example_group2,
    EC_uS_cm = example_data
  )

  # Summarize by one grouping variable
  summary_one_group <- summarize_data(
    example_df$EC_uS_cm,
    example_df$Vegetation_types
  )
  print(summary_one_group)

  # Summarize by two grouping variables
  summary_two_groups <- summarize_data(
    example_df$EC_uS_cm,
    example_df$Vegetation_types,
    example_df$Depth_revised
  )
  print(summary_two_groups)

Kifidi documentation built on Oct. 11, 2024, 9:08 a.m.