diabetes: Pima Indians Diabetes Database

Description Usage Format Source

Description

A dataset extracted from the National Institute of Diabetes and Digestive and Kidney Diseases. The dataset contains a number of medically relevant variables. It was created specifically (via extraction from a larger database) to allow users to experiment with methods to predict the occurence of diabetes in the patients. More information can be found in the URL provided below.

Usage

1

Format

A data frame with 768 rows and 9 columns:

Pregnancies

the number of times the patient has been pregnant

Glucose

the plasma glucose concentration measured for the patient after two hours in an oral glucose tolerance test (milligram/deciliter)

BloodPressure

the diastolic blood pressure of the patient (mm Hg)

SkinThickness

tricep skin fold thickness measured for the patient (mm)

Insulin

2-Hour serum insulin level measured for the patient (mu U/ml)

BMI

Body Mass Index of patient (weight in kg/(height in m)^2)

DiabetesPedigreeFunction

score assigned based on family history of diabetes

Age

age of the patient (years)

Outcome

binary indicating diabetes diagnosis for patient: 1 for yes and 0 for no

Source

https://www.kaggle.com/uciml/pima-indians-diabetes-database?select=diabetes.csv


samrabi1/ada.classification documentation built on May 23, 2020, 7:01 a.m.