PIMA: Pima Diabetes dataset

Description Usage Format Details Source

Description

Diabetes among women aged 21+ with Pima heritage

Usage

1
data("PIMA")

Format

A data frame with 392 observations on the following 8 variables.

Pregnant

a numeric vector, number of times the woman has been pregnant

Glucose

a numeric vector, plasma glucose concentration

BloodPressure

a numeric vector, diastolic blood pressure in mm Hg

BodyFat

a numeric vector, a measurement of the triceps skinfold thickness which is an indicator of body fat percentage

Insulin

a numeric vector, 2-hour serum insulin

BMI

a numeric vector, body mass index

Age

a numeric vector, years

Diabetes

a factor with levels No Yes

Details

Data on 768 women belonging to the Pima tribe. The purpose is to study the associations between having diabetes and various physiological characteristics. Although there are surely other factors (including genetic) that influence the chance of having diabetes, the hope is that by having women who are genetically similar (all from the Pima tribe), that these other factors are naturally accounted for.

Source

Adapted from the UCI data repository https://archive.ics.uci.edu/ml/datasets/Pima+Indians+Diabetes. A variable measuring the “diabetes pedigree function" has been omitted.


profpetrie/regclass documentation built on May 26, 2019, 8:33 a.m.