Framingham: Framingham Heart study dataset

Description Usage Format Source Examples

Description

This dataset contains 4135 patients with 16 risk factors. Some artificial missing data are created for TenYearCHD.

Usage

1

Format

A data frame with columns:

male

binary, male or female.

age

continuous, age of the patient.

education

categorial, levels coded 1 for some high school, 2 for a high school diploma or GED, 3 for some college or vocational school, and 4 for a college degree.

currentSmoker

binary, whether or not the patient is a current smoker.

cigsPerDay

continuous, the number of cigarettes that the person smoked on average in one day.

BPMeds

binary, whether or not the patient was on blood pressure medication.

prevalentStroke

binary, whether or not the patient had previously had a stroke.

prevalentHyp

binary, whether or not the patient was hypertensive.

diabetes

binary, whether or not the patient had diabetes.

totChol

continuous, total cholesterol level.

sysBP

continuous, systolic blood pressure.

diaBP

continuous, diastolic blood pressure.

BMI

continuous, Body Mass Index.

heartRate

continuous, heart rate.

glucose

continuous, glucose level.

TenYearCHD

binary, 10 year risk of coronary heart disease CHD.

Source

Kaggle Framingham Heart study dataset https://www.kaggle.com/amanajmera1/framingham-heart-study-dataset/

Examples

1
2

jialiwang1211/GCMlasso documentation built on May 14, 2019, 12:55 a.m.