In thomashielscher/nonnestcox: Comparing nonnested CoxPH models

knitr::opts_chunk$set(collapse = TRUE, warning = FALSE, message = FALSE, echo=TRUE, comment = "#>")

Introduction

This package provides an implementation of Fine's test for comparing non-nested Cox regression models for survival data (Fine, 2002, Biometrika). This test builds on Voung's methodology of comparing non-nested models (1989, Econometrica). The alternative hypothesis tested is whether one of two competing models is closer to the unknown true model. Closeness is defined by the Kullback-Leibler distance.

The package also includes the necessary test on distinguishability of competing models using the individual linear predictors as proposed by Fine (2002). The alternative likelihood-ratio test for nested models as proposed by Fine (2002) is implemented, too.

Download

devtools::install_github("thomashielscher/nonnestcox")

Example

Load the package and the necessary survival package. We use the same data set and candidate models as in Fine (2002) section 5, consisting of 312 patients with primary biliary cirrhosis disease, with overall survival as endpoint. Model 1 includes age as only predictor, model 2 and 3 include age, albumin, bilirubin, oedema and prothrombin time as predictors, with the difference that albumin, bilirubin and prothrombin time were log-transformed in model 3. Thus, model 2 and 3 have the same dimension.

library(nonnestcox)
library(survival)

pbc  <- subset(pbc, !is.na(trt))
mod1 <- coxph(Surv(time, status==2) ~ age, data=pbc, x=T)
mod2 <- coxph(Surv(time, status==2) ~ age + albumin + bili + edema + protime, data=pbc,  x=T)
mod3 <- coxph(Surv(time, status==2) ~ age + log(albumin) + log(bili) + edema + log(protime), data=pbc, x=T)

AIC and partial log-likelihood of all models are as follows.

AIC(mod1); AIC(mod2); AIC(mod3)
logLik(mod1); logLik(mod2); logLik(mod3)

The individual log-likelihood contributions sum up to the model log-likelihood as they should be

sum(llcont(mod3))
logLik(mod3)[1]

Nested Models

Models 1 and 3 are nested. Models are clearly distinguishable. Classical and alternative likelihood-ratio test both indicate highly significant improvement of the larger model.

plrtest(mod3, mod1, nested=T)

Non-nested Models

Models 2 and 3 are not nested. Models are clearly distinguishable. The likelihood-ratio test significantly favors the model with log-transformed predictors as shown in Fine (2002).

plrtest(mod3, mod2, nested=F)

For non-nested models with different numbers of parameters, the test statistic can be adjusted for the difference in model complexity using a AIC or BIC-type correction. Of course, the initial test on distinguishability of models is not affected.

mod5 <- coxph(Surv(time, status==2) ~ age, data=pbc,  x=T)
mod6 <- coxph(Surv(time, status==2) ~ albumin + bili + edema + protime, data=pbc,  x=T)
plrtest(mod6, mod5, nested=F)
plrtest(mod6, mod5, nested=F, adjusted="BIC")

Implementation

Merkle et al (2016, Psychological Methods) apply Voung's test to structural equation models. The corresponding R package nonnest2 (Merkle and You, 2018) allows to compute Vuong's test for various regression models with a full likelihood. We built our package for Cox regression models in a similar structure.

The function llcont.coxph computes individual contributions to the partial log-likelihood using Efron's method of handling ties, which is consistent with the default for the coxph function from the R package survival.

The function plrtest does the actual testing. The Imhof method (Imhof, 1961) implemented in R package CompQuadForm (Duchesne et al, 2010) is used to compute the distribution function of the weighted sum of $\chi^2$ distributions for theorems 1 and 3 (Fine, 2002). In the call of plrtest, one only needs to provide two fitted coxph objects and specify whether the models are nested or not. It is important that coxph is run with the option x=T. Both models need to be fitted with the same data, which might be an issue in case of missing data. For non-nested models, one can adjust (option adjusted) for the difference in the number of parameters in both models using the AIC or BIC correction factor (Vuong, 1989), but for BIC using number of events instead of number of observations.

The output of plrtest is a S3 object of class finetest for which a print function exists.

thomashielscher/nonnestcox documentation built on Dec. 14, 2020, 5:16 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com