MedicalCost: Medical Cost of 1,338 insureds.

Description Usage Format Details Source

Description

A dataset containing the medical costs ("charges") and other attributes of 1,338 insureds.

Usage

1

Format

A data frame with 1,338 rows and 10 variables:

age

age of the insured

sex

gender of the insured, binary (if female)

BMI

Body Mass Index of the insured

children

number of children covered as dependents

smoker

smoking status, binary (if the insured smokes)

region: northeast

binary (if the insured lives in that region)

region: southeast

binary (if the insured lives in that region)

region: southwest

binary (if the insured lives in that region)

region: northwest

binary (if the insured lives in that region)

charges

medical costs, in US dollars

Details

The original dataset contains 7 variables, but one-hot encoding was used on the "region" categorical variable. It is a dataset that was used in a Kaggle competition.

Source

https://github.com/stedy/Machine-Learning-with-R-datasets/blob/master/insurance.csv


mgoulet847/tagi documentation built on Dec. 21, 2021, 5:10 p.m.