prostate | R Documentation |
This dataset is used as an example in Hastie, Tibshirani & Freedman's Elements of Statistical Learning. It was included in the ElemStatLearn package which (at time of writing) is orphaned and no-longer available on CRAN.
prostate
An object of class data.frame
with 97 rows and 10 columns.
There are 8 predictors (columns 1:8), one outcome (column 9) and a marker for test/train data used in the textbook examples (column 10).
Observations are from 97 men who underwent prostatectomy. The original paper investigates the post-surgical characteristics that predict pre-surgical prostate-specific antigen (PSA) score (variable: lpsa).
Variables prefixed with 'l' have been log transformed.
The following descriptions have been adapted from Ryan Tibshirani's lecture notes on EDA #nolint
log PSA score
log cancer volume
log prostate weight
age of patient
log of the amount of benign prostatic hyperplasia
seminal vesicle invasion
log of capsular penetration
Gleason score
percent of Gleason scores 4 or 5
The dataset is provided in the original units, a scaled version can be
obtained with
sprostate <- data.frame(scale(prostate[,-10]), train = prostate[,10])
.
Observations are ordered by outcome.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.