hepatitis: The hepatitis dataset

Description Usage Format Details Source References Examples

Description

This is the hepatitis dataset from the UCI. The data was donated by Gail Gong.

Usage

1

Format

A data frame with 155 observations on the following 20 variables. This dataset contains a large number of missing values.

V1

Histology:no,yes

V2

age

V3

sex: male,female

V4

steroid:no,yes

V5

antivirals:no,yes

V6

fatigue:no, yes

V7

malaise:no, yes

V8

anorexia:no, yes

V9

liver big:no,yes

V10

liver firm:no,yes

V11

spleen palpable: no, yes

V12

spiders:no,yes

V13

ascites:no,yes

V14

Varices:no,yes

V15

Bilirubin

V16

alk phosphate

V17

sgot

V18

Albumin

V19

Protime

V20

Class:Die, Live

Details

The original dataset has the class labels in the first column.

Source

The UCI Machine Learning Database Repository at:

References

Diaconis,P. & Efron,B. (1983). Computer-Intensive Methods in Statistics. Scientific American, Volume 248.

Examples

1
2
3
#------Report and plot of missing values ------
data(hepatitis)
imagmiss(hepatitis,"Hepatitis")

Example output

Warning messages:
1: In rgl.init(initValue, onlyNULL) : RGL: unable to open X11 display
2: 'rgl_init' failed, running with rgl.useNULL = TRUE 
3: .onUnload failed in unloadNamespace() for 'rgl', details:
  call: fun(...)
  error: object 'rgl_quit' not found 
Report on missing values for  Hepatitis :

Number of missing values overall: 167
Percent of missing values overall: 5.670628
Features with missing values (percent): 
        V4         V6         V7         V8         V9        V10        V11 
 0.6451613  0.6451613  0.6451613  0.6451613  6.4516129  7.0967742  3.2258065 
       V12        V13        V14        V15        V16        V17        V18 
 3.2258065  3.2258065  3.2258065  3.8709677 18.7096774  2.5806452 10.3225806 
       V19 
43.2258065 

Percent of features with missing values: 78.94737
Number of instances with missing values: 75
Percent of instances with missing values: 48.3871

dprep documentation built on May 29, 2017, 11:01 a.m.