DONOR: DONOR dataset

Description Usage Format Details

Description

Adapted from the KDD-CUP-98 data set concerning data regarding donations made to a national veterans organization.

Usage

1
data("DONOR")

Format

A data frame with 19372 observations on the following 50 variables.

Donate

a factor with levels No Yes

Donation.Amount

a numeric vector

ID

a numeric vector

MONTHS_SINCE_ORIGIN

a numeric vector, number of months donor has been in the database

DONOR_AGE

a numeric vector

IN_HOUSE

a numeric vector, 1 if person has donated to the charity's “In House" program

URBANICITY

a factor with levels ? C R S T U

SES

a factor with levels ? 1 2 3 4, one of five possible codes indicating socioeconomic status

CLUSTER_CODE

a factor with levels . 01 02 ... 53, one of 54 possible cluster codes, which are unique in terms of socioeconomic status, urbanicity, ethnicity, and other demographic characteristics

HOME_OWNER

a factor with levels H U

DONOR_GENDER

a factor with levels A F M U

INCOME_GROUP

a numeric vector, but in reality one of 7 possible income groups inferred from demographics

PUBLISHED_PHONE

a numeric vector, listed (1) vs not listed (0)

OVERLAY_SOURCE

a factor with levels B M N P, source from which the donor was match; B is both sources and N is neither

MOR_HIT_RATE

a numeric vector, number of known times donor has responded to a mailed solicitation from a group other than the charity

WEALTH_RATING

a numeric vector, but in reality one of 10 groups based on demographics

MEDIAN_HOME_VALUE

a numeric vector, inferred from other variables

MEDIAN_HOUSEHOLD_INCOME

a numeric vector, inferred from other variables

PCT_OWNER_OCCUPIED

a numeric vector, percent of owner-occupied housing near where person lives

PER_CAPITA_INCOME

a numeric vector, of neighborhood in which person lives

PCT_ATTRIBUTE1

a numeric vector, percent of residents in person's neighborhood that are male and active military

PCT_ATTRIBUTE2

a numeric vector, percent of residents in person's neighborhood that are male and veterans

PCT_ATTRIBUTE3

a numeric vector, percent of residents in person's neighborhood that are Vietnam veterans

PCT_ATTRIBUTE4

a numeric vector, percent of residents in person's neighborhood that are WW2 veterans

PEP_STAR

a numeric vector, 1 if has achieved STAR donor status and 0 otherwise

RECENT_STAR_STATUS

a numeric vector, 1 if achieved STAR within last 4 years

RECENCY_STATUS_96NK

a factor with levels A (active) E (inactive) F (first time) L (lapsing)N (new) S (star donor) as of 1996.

FREQUENCY_STATUS_97NK

a numeric vector indicating number of times donated in last period (but period is determined by RECENCY STATUS 96NK)

RECENT_RESPONSE_PROP

a numeric vector, proportion of responses to the individual to the number of (card or other) solicitations from the charitable organization since four years ago

RECENT_AVG_GIFT_AMT

a numeric vector, average donation from the individual to the charitable organization since four years ago

RECENT_CARD_RESPONSE_PROP

a numeric vector, number of times the individual has responded to a card solicitation from the charitable organization since four years ago

RECENT_AVG_CARD_GIFT_AMT

a numeric vector, average donation from the individual in response to a card solicitation from the charitable organization since four years ago

RECENT_RESPONSE_COUNT

a numeric vector, number of times the individual has responded to a promotion (card or other) from the charitable organization since four years ago

RECENT_CARD_RESPONSE_COUNT

a numeric vector, number of times the individual has responded to a card solicitation from the charitable organization since four years ago

MONTHS_SINCE_LAST_PROM_RESP

a numeric vector, number of months since the individual has responded to a promotion by the charitable organization

LIFETIME_CARD_PROM

a numeric vector, total number of card promotions sent to the individual by the charitable organization

LIFETIME_PROM

a numeric vector, total number of promotions sent to the individual by the charitable organization

LIFETIME_GIFT_AMOUNT

a numeric vector, total lifetime donation amount from the individual to the charitable organization

LIFETIME_GIFT_COUNT

a numeric vector, total number of donations from the individual to the charitable organization

LIFETIME_AVG_GIFT_AMT

a numeric vector, lifetime average donation from the individual to the charitable organization

LIFETIME_GIFT_RANGE

a numeric vector, difference between maximum and minimum donation amounts from the individual

LIFETIME_MAX_GIFT_AMT

a numeric vector

LIFETIME_MIN_GIFT_AMT

a numeric vector

LAST_GIFT_AMT

a numeric vector

CARD_PROM_12

a numeric vector, number of card promotions sent to the individual by the charitable organization in the last 12 months

NUMBER_PROM_12

a numeric vector, number of promotions (card or other) sent to the individual by the charitable organization in the last 12 months

MONTHS_SINCE_LAST_GIFT

a numeric vector

MONTHS_SINCE_FIRST_GIFT

a numeric vector

FILE_AVG_GIFT

a numeric vector, same as LIFETIME_AVG_GIFT_AMT

FILE_CARD_GIFT

a numeric vector, lifetime average donation from the individual in response to all card solicitations from the charitable organization

Details

Originally, this data was used with the 1998 KDD competition (https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html). This particular version has been adapted from the version available in SAS Enterprise Miner (http://support.sas.com/documentation/cdl/en/emgsj/61207/PDF/default/emgsj.pdf Appendix 2 for descriptions of variable names). One goal is to determine whether a past donor donated in response to the 97NK mail solicitation and (if so), how much, based on age, gender, most recent donation amount, total gift amount, etc.


regclass documentation built on May 29, 2017, 11:15 p.m.