2. INTRODUCTION
The data is related with direct marketing campaigns of a Portuguese banking institution.
The marketing campaigns were based on phone calls.
2
Sources- Created by: Paulo Cortez (Univ. Minho) and Sérgio Moro (ISCTE-IUL) @ 2012
4. Exploratory Data Analysis
Concat
• Train
• Test
Missing Value
• Not Present At First
Instance
• (unknown ‘, ‘-1’ )
Duplica
te
•Present
Statistica
l
Summary
Outlier
•Present
Skewness
•Log X
transformatio
n
Campaign
•Campaign 1
•Campaign 2
4
Feature
Engineering
Pattern Detecting
Via
Visualization
5. Inspection of DATASET
5
•Number of features in dataset : 17
•Number of Observations in dataset : 49730
Data Type Number
Numerical 6
Categorical 11
Age Job Marital Education Default Balance Housing Loan Contact Day Month Duration Campaign Pdays Previous Poutcome
Y
0 58
manage
ment
married tertiary no 2143 yes no unknown 5 may 261 1 -1 0 unknown no
1 44
technicia
n
single secondary no 29 yes no unknown 5 may 151 1 -1 0 unknown no
2 33
entrepre
neur
married secondary no 2 yes yes unknown 5 may 76 1 -1 0 unknown no
3 47
blue-
collar
married unknown no 1506 yes no unknown 5 may 92 1 -1 0 unknown no
4 33 unknown single unknown no 1 no no unknown 5 may 198 1 -1 0 unknown no
17. Is loan going to affect Subscription of Deposit ?
Housing Loan Personal Loan
20XX
PRESENTATION TITLE
17
18. SUMMARY
18
SEGMENT 1
• HH, Entrepreneur , Unemployed , Student
• secondary education
• 8th and 14th day
• Feb , Nov
SEGMENT 2
• Service ,Retired , Admin
• 7th day
• Aug ,Jan
• No Housing loan
• Yes Personal Loan