{colleyRstats}: Functions to Streamline Statistical Analysis and ReportingCreated by Mark Colley
| Status | Usage | Miscellaneous |
|----|----|----|
| |
|
|
|
|
|
|
colleyRstats is a collection of custom R functions that streamline statistical analysis and result reporting. Built upon popular R packages such as ggstatsplot and ARTool, this collection offers a wide array of tools for simplifying reproducible analyses, generating high-quality visualizations, and producing APA-compliant outputs.
The primary goal of this package is to significantly reduce repetitive coding efforts, allowing you to focus on interpreting results. Whether you're dealing with ANOVA assumptions, reporting effect sizes, or creating publication-ready visualizations, colleyRstats makes these tasks easier.
| Type | Command |
|:------------|:-----------------------------------------|
| Release | install.packages("colleyRstats") |
For a short end-to-end workflow covering setup, assumption checks, plotting, and reporting, see the vignette:
vignette("getting-started", package = "colleyRstats")checkAssumptionsForAnovaThis function suite checks normality and homogeneity of variance assumptions for ANOVA models. Takes a vector of factors. For details on assumptions checking, refer to Datanovia.
Example:
checkAssumptionsForAnova(data = main_df, y = "dependent_var", factors = c("factor1", "factor2"))
ggwithinstatsWithPriorNormalityCheckAsterisk and ggbetweenstatsWithPriorNormalityCheckAsteriskThese functions include APA-compliant asterisks (e.g., *** for p < 0.001) on your ggwithinstats or ggbetweenstats plots. They automatically adjust for the appropriate test based on the data's normality.
Note: Avoid using these functions if your data has more than two groups, as geom_signif does not support more than two groups.

generateEffectPlotGenerates a plot that emphasizes either main effects or interaction effects, with clear formatting and options for publication-ready visuals. This function supports customizing group colors, axis labels, and plot size.
Example:
generateEffectPlot(df = main_df, x = "factor1", y = "dependent_var", fillColourGroup = "group", ytext = "Y Label", xtext = "X Label", legendPos = c(0.1, 0.2), shownEffect = "interaction")

reportNPAVGenerates APA-compliant LaTeX output for within-subject designs analyzed using np.anova. The function handles both main and interaction effects. The necessary LaTeX commands are:
\newcommand{\F}[3]{$F({#1},{#2})={#3}$}
\newcommand{\p}{\textit{p=}}
\newcommand{\pminor}{\textit{p$<$}}
Deprecated: reportNPAV() will be removed in colleyRstats 0.1.0 (2025-12-31). Use reportART() with ARTool instead.
Example:
model <- np.anova(tlx_mental ~ factor1 * factor2 + Error(Subject / factor1), data = main_df)
reportNPAV(model, "Dependent Variable")
reportNPAVChiSimilar to reportNPAV, but for between-subject designs. This function generates formatted LaTeX output for non-parametric ANOVA results. LaTeX commands:
\newcommand{\F}[3]{$F({#1},{#2})={#3}$}
\newcommand{\p}{\textit{p=}}
\newcommand{\pminor}{\textit{p$<$}}
reportNparLDReports the model produced by nparLD in APA-compliant format. While still supported, this function has largely been superseded by np.anova for non-parametric variance analyses.
reportMeanAndSDFor each level of an independent variable, this function calculates the mean and standard deviation of a dependent variable and returns them in APA-compliant LaTeX format:
\newcommand{\m}{\textit{M=}}
\newcommand{\sd}{\textit{SD=}}
Example:
reportMeanAndSD(main_df, iv = "factor1", dv = "dependent_var")
reportDunnTest and reportDunnTestTableThis function summarizes the results of FSA::dunnTest objects in text or table form. Both versions output LaTeX-ready results:
\newcommand{\padjminor}{\textit{p$_{adj}<$}}
\newcommand{\padj}{\textit{p$_{adj}$=}}
Example:
d <- dunnTest(dependent_var ~ factor1, data = main_df, method = "holm")
reportDunnTest(main_df, d, iv = "factor1", dv = "dependent_var")
reportARTGenerates LaTeX-formatted results from art models for factorial designs. The necessary LaTeX commands are:
\newcommand{\F}[3]{$F({#1},{#2})={#3}$}
\newcommand{\p}{\textit{p=}}
\newcommand{\pminor}{\textit{p$<$}}
Example:
model <- art(formula = dependent_var ~ factor1 * factor2 + Error(Subject / (factor1 * factor2)), data = main_df) |> anova()
reportART(anova(model), "Dependent Variable")
add_pareto_emoa_columnThis function adds a Pareto front classification column to a dataset, useful in multi-objective optimization scenarios.
Attention: must be done per User - Condition etc group.
Example:
# This would do it over **all** participants and **all** conditions
objectives <- c("objective1", "objective2", "objective3")
main_df <- add_pareto_emoa_column(main_df, objectives)
# This would do it **per** participant and **per** condition combination
# (so far, does not natively support piping ``|>'')
main_df <- main_df |>
group_by(User_ID, ConditionID) |>
mutate(PARETO_EMOA = add_pareto_emoa_column(pick(everything()), objectives = objectives)$PARETO_EMOA) |>
ungroup()
generateMoboPlot and generateMoboPlot2Creates a multi-objective optimization plot, visualizing sampling and optimization phases. This is particularly useful for visualizing iterations in optimization problems.
generateMoboPlot2 is appropriate when using https://github.com/Pascal-Jansen/Bayesian-Optimization-for-Unity/releases starting version 1.1.0.
Example:
generateMoboPlot2(data = main_df, x = "Iteration", y = "objective1", fillColourGroup = "group", ytext = "Y Axis Label")
Example Plot:

remove_outliers_REICalculates the Response Entropy Index (REI) and flags suspicious entries based on their REI percentile. This function is useful for identifying outliers in Likert scale data.
Example:
result <- remove_outliers_REI(main_df, header = TRUE, variables = "var1,var2,var3", range = c(1, 5))
replace_valuesReplaces specified values in a data frame with custom replacements. This can be used to clean or preprocess your data.
Example:
new_df <- replace_values(main_df, to_replace = c("bad_val1", "bad_val2"), replace_with = c("good_val1", "good_val2"))
reportNPAV() formats results from Lüpsen’s nonparametric ANOVA (np.anova) output. Deprecated: reportNPAV() will be removed in colleyRstats 0.1.0 (2025-12-31). Use reportART() with ARTool instead. NPAV is not shipped with this package, and it is loaded manually by the user from Lüpsen’s site: https://www.uni-koeln.de/~luepsen/R/.
This step requires internet access, so it is documented here (not in @examples, which should run offline during package checks).
# Download Lüpsen's NPAV bundle (anova.lib) and load it into a dedicated environment
npav_file <- tempfile(fileext = ".lib")
utils::download.file(
url = "https://www.uni-koeln.de/~luepsen/R/anova.lib",
destfile = npav_file,
mode = "wb",
quiet = TRUE
)
npav_env <- new.env(parent = base::emptyenv())
base::load(npav_file, envir = npav_env)
# Example
set.seed(1)
main_df <- data.frame(
UserID = factor(rep(1:12, each = 8)),
Video = factor(rep(c("V1", "V2"), times = 48)),
gesture = factor(rep(c("g1", "g2"), each = 4, times = 12)),
eHMI = factor(rep(c("off", "on"), each = 2, times = 24)),
tlx_mental = rnorm(96)
)
model <- npav_env$np.anova(
tlx_mental ~ Video * gesture * eHMI + Error(UserID / (gesture * eHMI)),
data = main_df
)
reportNPAV(model, dv = "mental workload")
If download.file() is blocked in your environment, download anova.lib manually from the NPAV page and point npav_file to the local path
For questions or remarks, please contact Mark Colley.
@misc{colley2025rstats,
author = {Mark Colley},
title = {colleyRstats: Functions to Streamline Statistical Analysis and Reporting},
year = {2025},
howpublished = {\url{https://github.com/M-Colley/colleyRstats}},
note = {A collection of custom R functions for streamlining statistical analysis, visualizations, and APA-compliant reporting.},
doi = {10.5281/zenodo.18046754},
url = {https://doi.org/10.5281/zenodo.18046754},
}
I am happy to receive any bug reports, suggestions, questions, and contributions to fix problems and add features.
Please use the GitHub issues system. Pull Requests for contributions
are encouraged.
The following presents some simple ways in which you can contribute (in increasing order of commitment):
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.