german: German credit scoring data

germanR Documentation

German credit scoring data

Description

See website for details of data attributes

Usage

german

Format

A data frame with 1000 observations on the following 21 variables.

V1

a factor with levels A11 A12 A13 A14

V2

a numeric vector

V3

a factor with levels A30 A31 A32 A33 A34

V4

a factor with levels A40 A41 A410 A42 A43 A44 A45 A46 A48 A49

V5

a numeric vector

V6

a factor with levels A61 A62 A63 A64 A65

V7

a factor with levels A71 A72 A73 A74 A75

V8

a numeric vector

V9

a factor with levels A91 A92 A93 A94

V10

a factor with levels A101 A102 A103

V11

a numeric vector

V12

a factor with levels A121 A122 A123 A124

V13

a numeric vector

V14

a factor with levels A141 A142 A143

V15

a factor with levels A151 A152 A153

V16

a numeric vector

V17

a factor with levels A171 A172 A173 A174

V18

a factor with levels good bad

V19

a factor with levels A191 A192

V20

a factor with levels A201 A202

V21

a numeric vector

Details

700 good and 300 bad credits with 20 predictor variables. Data from 1973 to 1975. Stratified sample from actual credits with bad credits heavily oversampled. A cost matrix can be used.

Source

http://archive.ics.uci.edu/datasets

References

Grömping, U. (2019). South German Credit Data: Correcting a Widely Used Data Set. Report 4/2019, Reports in Mathematics, Physics and Chemistry, Department II, Beuth University of Applied Sciences Berlin.

Examples

data(german)

gamclass documentation built on Aug. 21, 2023, 5:07 p.m.