Description Usage Format Details Source
This dataset is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was required, in order to access if the product (bank term deposit) would be ('yes') or not ('no') subscribed.
The full dataset was described and analyzed in [Moro et al., 2011]., see source
section.
1 |
bank
is a tbl data frame with 45211 observations on 20 variables.
The data is ordered by call id (and also by date, from May 2008 to November 2010).
The 20 variables are organized as follows.
Bank client data:
id
phone call id (integer: from 1 to 45211);
age
age (integer);
job
type of job (factor with 12 levels: admin., unknown, unemployed, management, housemaid, entrepreneur, student, blue-collar, self-employed, retired, technician, services);
marital
marital status (factor with 3 levels: married, divorced, single). divorced means divorced or widowed;
education
education (factor with 4 levels: unknown, secondary, primary, tertiary);
default
has credit in default? (factor with 2 levels: yes, no);
balance
average yearly balance, in euros (integer);
housing
has housing loan? (factor with 2 levels: yes, no);
loan
has personal loan? (factor with 2 levels: yes, no).
Related with the last contact of the current campaign:
contact
contact communication type (factor with 3 levels: unknown, telephone, cellular);
day
last contact day of the month (integer);
month
last contact month of year (factor with 12 levels: jan, feb, mar, ..., nov, dec);
year
last contact year (integer: 2008, 2009, 2010);
date
last contact date (POSIXct
date);
duration
last contact duration, in seconds (integer);
y
has the client subscribed a term deposit? (factor with 2 levels: yes, no).
Other attributes:
campaign
number of contacts performed during this campaign and for this client (integer: it includes last contact);
pdays
number of days that passed by after the client was last contacted from a previous campaign (integer). -1 means client who was not previously contacted;
previous
number of contacts performed before this campaign and for this client (integer);
poutcome
outcome of the previous marketing campaign (factor with 4 levels: "unknown", "other", "failure", "success").
This dataset is publicly available for research at UCI Machine Learning repository. The details are described in [Moro et al., 2011].
Please include this citation if you plan to use this dataset:
[Moro et al., 2011] S. Moro, R. Laureano and P. Cortez. Using Data Mining for Bank Direct Marketing: An Application of the CRISP-DM Methodology.
In P. Novais et al. (Eds.), Proceedings of the European Simulation and Modelling Conference - ESM'2011, pp. 117-121, GuimarĂ£es, Portugal, October, 2011. EUROSIS.
Available at http://hdl.handle.net/1822/14838.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.