Synthetic.2-data: Synthetic Dataset #2: p < n case


Dataset from simulated regression survival model #2 as described in Dazard et al. (2015). Here, the regression function uses some informative predictors. The rest represent un-informative noisy covariates, which are not part of the design matrix. Survival time was generated from an exponential model with rate parameter λ (and mean \frac{1}{λ}) according to a Cox-PH model with hazard exp(eta), where eta(.) is the regression function. Censoring indicator were generated from a uniform distribution on [0, 3]. In this synthetic example, all covariates are continuous, i.i.d. from a multivariate uniform distribution on [0, 1].




Each dataset consists of a numeric matrix containing n=250 observations (samples) by rows and p=3 variables by columns, not including the censoring indicator and (censored) time-to-event variables. It comes as a compressed Rda data file.


Maintainer: "Jean-Eudes Dazard, Ph.D."

Acknowledgments: This project was partially funded by the National Institutes of Health NIH - National Cancer Institute (R01-CA160593) to J-E. Dazard and J.S. Rao.


See simulated survival model #2 in Dazard et al., 2015.


Questions? Problems? Suggestions? or email at

All documentation is copyright its authors; we didn't write any of that.