hepatitis: Hepatitis data set

Description Usage Format Source Examples

Description

This data set is about hepatitus disease. The data set is obtained from UCI machine learning repository. There are 155 observations in total, however this data set consists of 80 observations after removing the observations with missing values. There are 19 features/ attributes where 13 attributes are binary while 6 attributes are discrete valued. The observations are catogarized in two classes classes die and live. There are 13 observations in class "die" and "67" in class live.

Usage

1

Format

A data frame with 80 observations on the following 20 variables.

Age

age of the patients in years, from 20 to 80 years.

Sex

Gender of patient, a factor at two levels coded by 1 (male) and 2(female)

Steroid

Steroid treatment, a factor at two levels coded by 1(yes) and 2(no) .

Antivirals

Antivirals medication, a factor at two levels 1 (yes) and 2 (no).

Fatigue

Fatigue is a frequent and disabling symptom reported by patients with chronic hepatitis, a factor at two levels 1 (yes) and 2 (no).

Malaise

Malaise one of the symptoms of hepatitis, a factor at two levels 1 (yes) and 2 (no).

Anorexia

Anorexia, loss of appetite, a factor at two levels 1 (yes) and 2 (no).

LiverBig

The size of liver increased or fatty, a factor at two levels 1 (yes) and 2 (no).

LiverFirm

A factor at two levels 1 (yes) and 2 (no).

SpleenPalpable

Splenomegaly is an enlargement of the spleen, a factor at two levels 1 (yes) and 2 (no).

Spiders

Enlarged blood vessels that resemble little spiders,a factor at two levels 1 (yes) and 2 (no).

Ascites

Ascites is the presence of excess fluid in the peritoneal cavity, a factor at two levels 1(yes) and 2(no)).

Varices

a factor at two levels 1(yes) and 2(no)).

Bilirubin

Bilirubin is a substance made when the body breaks down old red blood cells, continuous feature

AlkPhosphate

Alkaline phosphatase is an enzyme made in liver cells and bile ducts, a discrete valued feature reveals level Alkaline phosphatase.

Sgot

A discrete valued feature.

AlbuMin

A continous feature.

ProTime

A discrete valued feature.

Histology

a factor at two levels 1 (yes) and 2 (no).

Class

a factor at two levels 1(Die) or 2(Live).

Source

This data set is available on: https://archive.ics.uci.edu/ml/datasets/Hepatitis

Examples

1
2

Example output

'data.frame':	80 obs. of  20 variables:
 $ Age           : int  34 39 32 41 30 38 40 38 38 22 ...
 $ Sex           : int  1 1 1 1 1 1 1 1 1 2 ...
 $ Steroid       : int  2 1 2 2 2 1 1 2 1 2 ...
 $ Antivirals    : int  2 1 1 1 2 2 2 2 1 1 ...
 $ Fatigue       : int  2 2 1 1 1 1 1 2 2 1 ...
 $ Malaise       : int  2 2 2 2 2 1 2 2 2 2 ...
 $ Anorexia      : int  2 2 2 2 2 1 2 2 2 2 ...
 $ LiverBig      : int  2 1 2 2 2 2 2 2 1 2 ...
 $ LiverFirm     : int  2 1 1 1 1 2 1 2 1 2 ...
 $ SpleenPalpable: int  2 2 2 2 2 2 2 2 2 2 ...
 $ Spiders       : int  2 2 1 2 2 2 2 2 2 2 ...
 $ Ascites       : int  2 2 2 2 2 1 2 2 2 2 ...
 $ Varices       : int  2 2 2 2 2 2 2 2 2 2 ...
 $ Bilirubin     : num  0.9 1.3 1 0.9 2.2 2 0.6 0.7 0.7 0.9 ...
 $ AlkPhosphate  : int  95 78 59 81 57 72 62 53 70 48 ...
 $ Sgot          : int  28 30 249 60 144 89 166 42 28 20 ...
 $ AlbuMin       : num  4 4.4 3.7 3.9 4.9 2.9 4 4.1 4.2 4.2 ...
 $ ProTime       : int  75 85 54 52 78 46 63 85 62 64 ...
 $ Histology     : int  1 1 1 1 1 1 1 2 1 1 ...
 $ Class         : int  2 2 2 2 2 2 2 2 2 2 ...

ESKNN documentation built on May 2, 2019, 6:25 a.m.