diabetes: Pima Indians Diabetes Database
In samrabi1/ada.classification: Classification Methods: K-means and Random Forests

A dataset extracted from the National Institute of Diabetes and Digestive and Kidney Diseases. The dataset contains a number of medically relevant variables. It was created specifically (via extraction from a larger database) to allow users to experiment with methods to predict the occurence of diabetes in the patients. More information can be found in the URL provided below.

diabetes

A data frame with 768 rows and 9 columns:

Pregnancies: the number of times the patient has been pregnant
Glucose: the plasma glucose concentration measured for the patient after two hours in an oral glucose tolerance test (milligram/deciliter)
BloodPressure: the diastolic blood pressure of the patient (mm Hg)
SkinThickness: tricep skin fold thickness measured for the patient (mm)
Insulin: 2-Hour serum insulin level measured for the patient (mu U/ml)
BMI: Body Mass Index of patient (weight in kg/(height in m)^2)
DiabetesPedigreeFunction: score assigned based on family history of diabetes
Age: age of the patient (years)
Outcome: binary indicating diabetes diagnosis for patient: 1 for yes and 0 for no