knitr::opts_chunk$set( collapse = TRUE, comment = "#>", fig.width = 7, fig.height = 4 )
library(rms) library(rmsMD)
The modelsummary_rms
function is designed to process output from models fitted using the rms package and generate a summarised dataframe of the results. The goal is to produce publication-ready summaries of the models.
For standard use with rms
models that use restricted cubic splines, please refer to the vignette Standard_workflow_with_restricted_cubic_splines.
For these vignettes we will use a simulated dataset to predict the impact of age, BMI, Sex and Smoking status on outcome after surgery. The models are for illustration purposes only.
# Load in the simulated data data <- simulated_rmsMD_data() # Set the datadist which is required for rms modelling # (these two lines are standard) dd <- datadist(data) options(datadist='dd')
Here is a simple example using a linear regression model ("ordinary least squares"; OLS). We use simulated data for the impact of patient factors on length of stay after an operation.
The output dataframe contains the estimated coefficients, their 95% confidence intervals, and the associated p-values. These are in a publication ready format.
# Fit a linear regression model using the rms package: fit_ols <- ols(lengthstay ~ age + bmi + sex + smoking, data = data) # Generate a model summary and display it as a dataframe modelsummary_rms(fit_ols) # rmsMD dataframe as a table knitr::kable(modelsummary_rms(fit_ols))
By default, the function uses the following stylistic settings:
You can modify these defaults to adjust the appearance of the output.
# Generate a model summary with custom styling options # & store as summary_custom summary_custom <- modelsummary_rms(fit_ols, combine_ci = FALSE, round_dp_coef = 2, round_dp_p = 4) # to display the dataframe as a table knitr::kable(summary_custom)
Exponentiating the coefficients of certain models makes the interpretation more intuitive (e.g. as odds ratios in logistic regression and hazard ratios in Cox models).
The modelsummary_rms package does this automatically for the core rms models ols
, lrm
, and cph
. This ensures OR and HR are displayed for logistic regression and Cox regression models respectively.
Below is an example using modelsummary_rms on an rms logistic regression model for postoperative complications. Note this automatically provides OR:
# fitting the model fit_lrm <- lrm(majorcomplication ~ age + bmi + sex + smoking, data = data) # rmsMD summary modelsummary_rms(fit_lrm) # displaying as a table knitr::kable(modelsummary_rms(fit_lrm))
The modelsummary_rms from rmsMD package is also capable of working with non-rms models, such as those fitted using base R functions like lm(). However, in these cases the package does not automatically determine the appropriate value for exp_coef, so it must be set manually.
For example, when using a linear model (where exponentiation of coefficients is not required), you should explicitly set exp_coef = FALSE.
# Fit a simple linear model using lm() from base R # (an example model fit without using rms package) fit_lm <- lm(majorcomplication ~ age + bmi + sex + smoking, data = data) # Generate a model summary for the non-RMS model # by explicitly setting exp_coef = FALSE modelsummary_rms(fit_lm, exp_coef = FALSE) # display rmsMD results as a table knitr::kable(modelsummary_rms(fit_lm, exp_coef = FALSE))
Restricted Cubic Splines (RCS) are a flexible modelling tool used to capture non-linear relationships between predictors and outcomes. In medicine, for the majority of continuous variables (e.g. age, blood pressure, or biomarker levels) the assumption of linearity may not hold. A key highlight of the rms package is the ability to analyse variables using RCS.
The rmsMD package is designed to report and summarise models that include RCS terms. Individual coefficients for RCS terms are difficult to interpret in isolation. Instead, an overall p-value can be generated to assess whether the overall relationship between the RCS variable and outcome is significant. By default modelsummary_rms
removes the individual RCS coefficients, replacing them with the overall p-value for that variable. We recommend that these are then plotted using the ggrmsM
function, shown below.
Display an overall p-value for the spline terms using the rcs_overallp
option.
When this option is set to TRUE
(which is the default), the function computes a single p-value that tests the overall significance of the spline terms for each variable. This overall p-value provides insight into whether the relationship between the predictor and the dependent variable is significant.
Hide the individual spline coefficients using the hide_rcs_coef
option.
Hiding the individual spline coefficients can be advantageous because these lack straightforward clinical interpretation. Instead, the focus is on the overall association captured by all RCS terms for that specific variable. This helps simplify the output. If the variable has a signficant association with outcome, we recommend plotting this relationship.
Here is an example model predicting occurence of complications after surgery (binary), with the continuous variables age and BMI modelled using restricted cubic splines with 4 knots:
# Fit a logistic regression model including a RCS for Age & BMI (with 4 knots) fit_lrm <- lrm( majorcomplication ~ rcs(age, 4) + # Age modelled using RCS with 4 knots rcs(bmi, 4) + # BMI also modelled with RCS with 4 knots sex + # Binary variable for sex smoking, # Categorical variable for smoking status data = data, # set x = TRUE, y = TRUE to allow # subsequent LR tests for lrm() and cph() models x = TRUE, y = TRUE ) # Generate an rmsMD model summary using default settings modelsummary_rms(fit_lrm) # Outputting this as a table knitr::kable(modelsummary_rms(fit_lrm))
Displaying individual RCS coefficients is not the default behaviour. If individual RCS coefficients are required, these can be added in by setting hide_rcs_coef
to FALSE
:
# Generate a model summary with rcs_overallp set to TRUE # and hide_rcs_coef set to TRUE modelsummary_rms(fit_lrm, hide_rcs_coef = FALSE) # Outputting this as a table knitr::kable(modelsummary_rms(fit_lrm, hide_rcs_coef = FALSE))
If overall p-values for the variables modelled with RCS are not wanted, rcs_overallp
can be set to FALSE
:
# Fit an OLS model including a restricted cubic spline for Age (with 4 knots) summary_spline_hide <- modelsummary_rms(fit_lrm, hide_rcs_coef = FALSE, rcs_overallp = FALSE) # Outputting this as a table knitr::kable(summary_spline_hide)
The rms package allows interactions with variables modelled using restricted cubic splines. In this setting, the individual coefficients for RCS terms and their interactions are difficult to interpret. modelsummary_rms
handles this situation by providing overall p-values for RCS variables (which give the overall p-value taking into account all spline terms and all of their interaction terms), and overall p-values for the interactions (takes into account linear and non-linear terms), instead of the individual coefficients. As above, this can be altered by changing rcs_overallp
and hide_rcs_coef
.
# Fit an OLS model with a restricted cubic spline for Age # and an interaction between Age and Exer. fit_lrm_interaction <- lrm(majorcomplication ~ rcs(age,4)*smoking + rcs(bmi,4) + sex + smoking, data = data, x = TRUE, y = TRUE) # Generate a model summary with default RCS output modelsummary_rms(fit_lrm_interaction) # Format the output as a nice table knitr::kable(modelsummary_rms(fit_lrm_interaction))
To get further information, for example for the overall effect of smoking (the variable interacted with the spine for age here), use anova
as below:
anova(fit_lrm_interaction, test = "LR")
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.