DONOR: DONOR dataset

Description Usage Format Details

Description

Adapted from the KDD-CUP-98 data set concerning data regarding donations made to a national veterans organization.

Usage

1
data("DONOR")

Format

A data frame with 19372 observations on the following 50 variables.

Donate

a factor with levels No Yes

Donation.Amount

a numeric vector

ID

a numeric vector

MONTHS_SINCE_ORIGIN

a numeric vector, number of months donor has been in the database

DONOR_AGE

a numeric vector

IN_HOUSE

a numeric vector, 1 if person has donated to the charity's “In House" program

URBANICITY

a factor with levels ? C R S T U

SES

a factor with levels ? 1 2 3 4, one of five possible codes indicating socioeconomic status

CLUSTER_CODE

a factor with levels . 01 02 ... 53, one of 54 possible cluster codes, which are unique in terms of socioeconomic status, urbanicity, ethnicity, and other demographic characteristics

HOME_OWNER

a factor with levels H U

DONOR_GENDER

a factor with levels A F M U

INCOME_GROUP

a numeric vector, but in reality one of 7 possible income groups inferred from demographics

PUBLISHED_PHONE

a numeric vector, listed (1) vs not listed (0)

OVERLAY_SOURCE

a factor with levels B M N P, source from which the donor was match; B is both sources and N is neither

MOR_HIT_RATE

a numeric vector, number of known times donor has responded to a mailed solicitation from a group other than the charity

WEALTH_RATING

a numeric vector, but in reality one of 10 groups based on demographics

MEDIAN_HOME_VALUE

a numeric vector, inferred from other variables

MEDIAN_HOUSEHOLD_INCOME

a numeric vector, inferred from other variables

PCT_OWNER_OCCUPIED

a numeric vector, percent of owner-occupied housing near where person lives

PER_CAPITA_INCOME

a numeric vector, of neighborhood in which person lives

PCT_ATTRIBUTE1

a numeric vector, percent of residents in person's neighborhood that are male and active military

PCT_ATTRIBUTE2

a numeric vector, percent of residents in person's neighborhood that are male and veterans

PCT_ATTRIBUTE3

a numeric vector, percent of residents in person's neighborhood that are Vietnam veterans

PCT_ATTRIBUTE4

a numeric vector, percent of residents in person's neighborhood that are WW2 veterans

PEP_STAR

a numeric vector, 1 if has achieved STAR donor status and 0 otherwise

RECENT_STAR_STATUS

a numeric vector, 1 if achieved STAR within last 4 years

RECENCY_STATUS_96NK

a factor with levels A (active) E (inactive) F (first time) L (lapsing)N (new) S (star donor) as of 1996.

FREQUENCY_STATUS_97NK

a numeric vector indicating number of times donated in last period (but period is determined by RECENCY STATUS 96NK)

RECENT_RESPONSE_PROP

a numeric vector, proportion of responses to the individual to the number of (card or other) solicitations from the charitable organization since four years ago

RECENT_AVG_GIFT_AMT

a numeric vector, average donation from the individual to the charitable organization since four years ago

RECENT_CARD_RESPONSE_PROP

a numeric vector, number of times the individual has responded to a card solicitation from the charitable organization since four years ago

RECENT_AVG_CARD_GIFT_AMT

a numeric vector, average donation from the individual in response to a card solicitation from the charitable organization since four years ago

RECENT_RESPONSE_COUNT

a numeric vector, number of times the individual has responded to a promotion (card or other) from the charitable organization since four years ago

RECENT_CARD_RESPONSE_COUNT

a numeric vector, number of times the individual has responded to a card solicitation from the charitable organization since four years ago

MONTHS_SINCE_LAST_PROM_RESP

a numeric vector, number of months since the individual has responded to a promotion by the charitable organization

LIFETIME_CARD_PROM

a numeric vector, total number of card promotions sent to the individual by the charitable organization

LIFETIME_PROM

a numeric vector, total number of promotions sent to the individual by the charitable organization

LIFETIME_GIFT_AMOUNT

a numeric vector, total lifetime donation amount from the individual to the charitable organization

LIFETIME_GIFT_COUNT

a numeric vector, total number of donations from the individual to the charitable organization

LIFETIME_AVG_GIFT_AMT

a numeric vector, lifetime average donation from the individual to the charitable organization

LIFETIME_GIFT_RANGE

a numeric vector, difference between maximum and minimum donation amounts from the individual

LIFETIME_MAX_GIFT_AMT

a numeric vector

LIFETIME_MIN_GIFT_AMT

a numeric vector

LAST_GIFT_AMT

a numeric vector

CARD_PROM_12

a numeric vector, number of card promotions sent to the individual by the charitable organization in the last 12 months

NUMBER_PROM_12

a numeric vector, number of promotions (card or other) sent to the individual by the charitable organization in the last 12 months

MONTHS_SINCE_LAST_GIFT

a numeric vector

MONTHS_SINCE_FIRST_GIFT

a numeric vector

FILE_AVG_GIFT

a numeric vector, same as LIFETIME_AVG_GIFT_AMT

FILE_CARD_GIFT

a numeric vector, lifetime average donation from the individual in response to all card solicitations from the charitable organization

Details

Originally, this data was used with the 1998 KDD competition (https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html). This particular version has been adapted from the version available in SAS Enterprise Miner (http://support.sas.com/documentation/cdl/en/emgsj/61207/PDF/default/emgsj.pdf Appendix 2 for descriptions of variable names). One goal is to determine whether a past donor donated in response to the 97NK mail solicitation and (if so), how much, based on age, gender, most recent donation amount, total gift amount, etc.


profpetrie/regclass documentation built on May 26, 2019, 8:33 a.m.