A subset of primary biliary cirrhosis (PBC) of the liver data in the book "Counting Process & Survival Analysis" by Fleming & Harrington (1991). This subset is used in Tibshirani (1997).
A data frame with 276 observations on the following 19 variables.
Survival times (either time to death or censoring) in days
Censoring indicator, 1=death, 0=censoring
Treatment indicator, 1=treatment by D-penicillamine, 0=placebo
Age in years (days divided by 365.25)
Sex, 0=male, 1=female
Presence of ascites, 0=no, 1=yes
Presence of hepatomegaly, 0=no, 1=yes
Presence of spiders, 0=no, 1=yes
Presence of edema, 0=no edema, 0.5=edema resolved by therapy, 1=edema not resolved by therapy
log(urine copper, mg/day)
log(SGOT, in U/ml)
log(triglycerides, in mg/dl)
log(platelet count, [the number of platelets per-cubic-milliliter of blood]/1000)
log(prothrombin time, in seconds)
Histologic stage of disease, graded 1, 2, 3, or 4
Survival data consisting of 276 patients with 17 covariates. Among them, 111 patients died (d=1) while others were censored (d=0). The covariates consist of a treatment indicator (trt), age, sex, 5 categorical variables (ascites, hepatomegaly, spider, edema, and stage of disease) and 9 log-transformed continuous variables (bilirubin, cholesterol, albumin, urine copper, alkarine, SGOT, triglycerides, platelet count, and prothrombine).
Fleming & Harrignton (1991); Tibshirani (1997)
Tibshirani R (1997), The Lasso method for variable selection in the Cox model, Statistics in Medicine, 385-395.
1 2 3
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.