tobacco | R Documentation |
A simulated datasets of 1,000 subjects, with the following variables:
data(tobacco)
A data frame with 1000 rows and 9 variables
gender Factor with 2 levels: “F” and “M”, having roughly 500 of each.
age Numerical.
age.gr Factor with 4 age categories.
BMI Body Mass Index (numerical).
smoker Factor (“Yes” / “No”).
cigs.per.day Number of cigarettes smoked per day (numerical).
diseased Factor (“Yes” / “No”).
disease Character.
samp.wgts Sampling weights (numerical).
A note on simulation: probability for an individual to fall into category “diseased” is based on an arbitrary function involving age, BMI and number of cigarettes per day.
A copy of this dataset is also available in French under the name “tabagisme”.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.