0% found this document useful (0 votes)

69 views28 pages

Predictive Analysis For Retail Banking

Logistic regression was chosen as the best predictive model for a retail banking marketing campaign dataset based on an accuracy of 80.9%. Exploratory data analysis including correlation heatmaps and model selection using min-max scaling were performed. Logistic regression outperformed k-nearest neighbors, decision trees, random forests, and support vector machines on this imbalanced classification problem to predict customer subscription to banking term deposits.

Uploaded by

Mai Nguyễn Thị

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views28 pages

Predictive Analysis For Retail Banking

Uploaded by

Mai Nguyễn Thị

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Predictive Analytics for Retail Banking

Done by
vamsi
RETAIL BANKING ??!

 Typical mass-market banking in which individual customers use local branches

of larger commercial banks. Services offered include savings and checking
accounts, mortgages, personal loans, debit/credit cards. The focus is on the
customer.
 The main challenges this sector are :
• What is the suitable product to recommend to a customer ?
• What is the best time to market the product ?
• Which is the most effective channel to contact a customer ?
PROBLEM STATEMENT

 In this problem, the data is related with direct marketing campaigns of a banking
institution. The marketing campaigns were based on phone calls. Often, more
than one contact to the same client was required, in order to access if the product
(bank term deposit) would be ('yes') or not ('no’) subscribed. The goal is to
predict if the client will subscribe a term deposit.
ABOUT DATASET

 This is the classic marketing bank dataset uploaded originally in the UCI
Machine Learning Repository. The dataset gives you information about a
marketing campaign of a financial institution in which you will have to analyse
in order to find ways to look for future strategies in order to improve future
marketing campaigns for the bank.
Here are what the columns in the data set
represent:
 Age : Age of the client- (numeric)
 Job : Client’s occupation - (categorical) (admin, blue-collar, entrepreneur, housemaid, management, retired, self
employed, services, student, technician, unemployed, unknown)
 Marital : Client’s marital status - (categorical) (divorced, married, single, unknown, note: divorced means
divorced or widowed)
 Education : Client’s education level - (categorical)
 Default : Indicates if the client has credit in default - (categorical) (no, yes)
 Balance :average yearly balance, in euros (numeric).
 Housing : Does the client as a housing loan? - (categorical) (no, yes)
 Loan : Does the client as a personal loan? - (categorical) (no, yes)
 Contact : Type of communication contact - (categorical) (unknown, cellular, telephone)
 Day : Day of last contact with client.
 Month : Month of last contact with client - (categorical) (Jan - Dec)
 Duration : Duration of last contact with client, in seconds - (numeric)
For benchmark purposes only, and not reliable for predictive modelling.

 Campaign : number of contacts performed during this campaign and for this client
(numeric, includes last contact) - (numeric)
(includes last contact)

 Pdays : Number of days passed client was last contacted - (numeric)

(-1 means client was not previously contacted)

 Previous : Number of client contacts performed before this campaign - (numeric)

 Poutcome : Previous marketing campaign outcome - (categorical)

 Deposit : subscription verified. (output)

EXPLORATORY DATA ANALYSIS(EDA)
CORRELATION
USING
HEATMAP
0divorced
1married
2single
Model Selection
Why Min Max Scaler?

 Since the output variable is

in 0’s and 1’s form, We
need to scale down our
feature variables to the
range of 0 and 1
TEST SIZE

80-20 *Recommended for banking sector

Accuracies compared …

• K-nearest Neighbour: 75.3%

• Logistic Regression: 80.9%
• Decision Tree: 78.2%
• Random Forest Classifier: 78%
• Support vector Machine: 53%
Confusion Matrices..

KNN Logistic Regression Decision Tree Random Forest SVM

GRAPHS
GRAPHS (CONT.)
GRAPHS (CONT.)
WE CHOOSE

LOGISTIC REGRESSION
Accuracy = 80.9%
CONCLUSION

 Most classification problems in the real world are imbalanced. Also, almost always data
sets have missing values. In this post, we covered strategies to deal with both missing
values and imbalanced data sets. We also explored different ways of building ensembles
in sklearn. Below are some takeaway points:
 Sometimes we may be willing to give up some improvement to the model if that would
increase the complexity much more than the percentage change in the improvement to the
evaluation metrics.
 When building ensemble models, try to use good models that are as different as possible
to reduce correlation between the base learners. We could’ve enhanced our stacked
ensemble model by adding Dense Neural Network and some other kind of base learners as
well as adding more layers to the stacked model.
 Easy Ensemble usually performs better than any other resampling methods.

Project 2
100% (1)
Project 2
17 pages
Data Mining Business Report Hansraj Yadav
83% (12)
Data Mining Business Report Hansraj Yadav
34 pages
8 Steps to Problem Solving: Six Sigma
From Everand
8 Steps to Problem Solving: Six Sigma
Mohit Sharma
3.5/5 (3)
List of Koc Standards & Specifications Applicable For JPF-4 & JPF-5 PDF
25% (4)
List of Koc Standards & Specifications Applicable For JPF-4 & JPF-5 PDF
5 pages
SS Teamproject Documentation
No ratings yet
SS Teamproject Documentation
33 pages
Data Mininig Project
67% (3)
Data Mininig Project
28 pages
Telecom Customer Churn Project Report
50% (2)
Telecom Customer Churn Project Report
25 pages
Eb018460 01
50% (2)
Eb018460 01
10 pages
(Robert J. Thierauf) Knowledge Management Systems PDF
100% (1)
(Robert J. Thierauf) Knowledge Management Systems PDF
376 pages
Quadexp IDS Project
No ratings yet
Quadexp IDS Project
22 pages
BDMDM Telemarketing
No ratings yet
BDMDM Telemarketing
16 pages
Banking Dataset - Marketing Targets
No ratings yet
Banking Dataset - Marketing Targets
19 pages
Analysis and Presentation For Bank Marketing Data: Vinay Kumar MS by Research Scholar IIT Kharagpur +91-8348575432
No ratings yet
Analysis and Presentation For Bank Marketing Data: Vinay Kumar MS by Research Scholar IIT Kharagpur +91-8348575432
20 pages
Data Mining Case Study PDF
No ratings yet
Data Mining Case Study PDF
21 pages
Data Mining Case Study PDF
100% (1)
Data Mining Case Study PDF
21 pages
Bank Names
No ratings yet
Bank Names
2 pages
Ensemble Techniques Project
100% (2)
Ensemble Techniques Project
28 pages
Final_Bank Customer Response Prediction Model
No ratings yet
Final_Bank Customer Response Prediction Model
23 pages
Project Report
No ratings yet
Project Report
19 pages
Bank Additional Names
No ratings yet
Bank Additional Names
2 pages
Project Presentation
No ratings yet
Project Presentation
19 pages
Project Presentation.
No ratings yet
Project Presentation.
19 pages
Dataset Information
No ratings yet
Dataset Information
1 page
Abigail Tsani Darmawan - Streamlining Bank Campaign Promotion (Batch 16)
No ratings yet
Abigail Tsani Darmawan - Streamlining Bank Campaign Promotion (Batch 16)
56 pages
Ex 5.1 Customer Behaviour Prediction
No ratings yet
Ex 5.1 Customer Behaviour Prediction
8 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Data Analytics on Banking
No ratings yet
Data Analytics on Banking
3 pages
Classification - Bank - Marketing - Dataset - Jupyter Notebook
No ratings yet
Classification - Bank - Marketing - Dataset - Jupyter Notebook
23 pages
Part A Doc 1
No ratings yet
Part A Doc 1
21 pages
24msp3077 1st Rev
No ratings yet
24msp3077 1st Rev
20 pages
EEE - 559: Mathematical Pattern Recognition Individual Project Abinaya Manimaran
No ratings yet
EEE - 559: Mathematical Pattern Recognition Individual Project Abinaya Manimaran
41 pages
Final Review Presentation 24msp3077
No ratings yet
Final Review Presentation 24msp3077
26 pages
Machine Learning Using Python Question Paper 1
No ratings yet
Machine Learning Using Python Question Paper 1
4 pages
Marketing Project: Reza Marzban
No ratings yet
Marketing Project: Reza Marzban
18 pages
PWC PPT DOC
No ratings yet
PWC PPT DOC
19 pages
Revenue Predictor - Udit Ennam PDF
No ratings yet
Revenue Predictor - Udit Ennam PDF
30 pages
Project 2 Classification Models
No ratings yet
Project 2 Classification Models
5 pages
ML Project
100% (1)
ML Project
10 pages
Telemarketing Dataset Analysis: Group 7 Abhishek Jagdale Nilay N Sonal Mittal Swapnil B Swapnil T Vishal Sinha
No ratings yet
Telemarketing Dataset Analysis: Group 7 Abhishek Jagdale Nilay N Sonal Mittal Swapnil B Swapnil T Vishal Sinha
21 pages
Thera Bank Loan Purchase Modelling
No ratings yet
Thera Bank Loan Purchase Modelling
44 pages
Copy of Week two
No ratings yet
Copy of Week two
39 pages
Project Report-Micro Credit Loan
No ratings yet
Project Report-Micro Credit Loan
8 pages
Report-Yifan_Lu.1
No ratings yet
Report-Yifan_Lu.1
13 pages
Bank Marketing Case-Study:: Relevant Information About The Data
100% (1)
Bank Marketing Case-Study:: Relevant Information About The Data
1 page
Churn Analysis of Bank Customers
100% (1)
Churn Analysis of Bank Customers
12 pages
Report Varsha GanapathyRao 10539034
No ratings yet
Report Varsha GanapathyRao 10539034
17 pages
Banking_Project_final
No ratings yet
Banking_Project_final
38 pages
Bank Marketing Project
No ratings yet
Bank Marketing Project
18 pages
Default Payment Analysis of Credit Card Clients: July 2018
No ratings yet
Default Payment Analysis of Credit Card Clients: July 2018
7 pages
Machine Learning - Project
80% (10)
Machine Learning - Project
14 pages
Dissertation Presentation Bidyut Mondal
No ratings yet
Dissertation Presentation Bidyut Mondal
22 pages
A data-driven approach to predict the success of bank telemarketing
No ratings yet
A data-driven approach to predict the success of bank telemarketing
35 pages
Supervised Learning problem for solving
No ratings yet
Supervised Learning problem for solving
2 pages
Ids Case Study
No ratings yet
Ids Case Study
15 pages
Advanced E-Commerce Business Questions and Analytical Hints
From Everand
Advanced E-Commerce Business Questions and Analytical Hints
Zemelak Goraga
No ratings yet
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
From Everand
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
MAX EDITORIAL
No ratings yet
E Commerce Project
No ratings yet
E Commerce Project
12 pages
College Presentation
No ratings yet
College Presentation
9 pages
mlproj
No ratings yet
mlproj
49 pages
SUKUMARREVIEWPPT2
No ratings yet
SUKUMARREVIEWPPT2
24 pages
A Short Guide to Marketing Model Alignment & Design: Advanced Topics in Goal Alignment - Model Formulation
From Everand
A Short Guide to Marketing Model Alignment & Design: Advanced Topics in Goal Alignment - Model Formulation
David Young
No ratings yet
Daa-01
No ratings yet
Daa-01
11 pages
Monetization Tactics
From Everand
Monetization Tactics
Lucas Morgan
No ratings yet
Chief Accountant Cover Letter
No ratings yet
Chief Accountant Cover Letter
1 page
Analytical Model For Critical Impact Energy of Spalling and Penetration in Concrete Wall
No ratings yet
Analytical Model For Critical Impact Energy of Spalling and Penetration in Concrete Wall
12 pages
4 stroke engine work booklet answers.
No ratings yet
4 stroke engine work booklet answers.
3 pages
Electronic Commerce: Living in The IT Era
No ratings yet
Electronic Commerce: Living in The IT Era
36 pages
Machinery of Govt
No ratings yet
Machinery of Govt
63 pages
Users Manual: Concerto GS Platform Lift
No ratings yet
Users Manual: Concerto GS Platform Lift
12 pages
Kinematics
No ratings yet
Kinematics
42 pages
Authentic Listening: Pivotal Figures
No ratings yet
Authentic Listening: Pivotal Figures
2 pages
STE Research 2 W1 LAS
No ratings yet
STE Research 2 W1 LAS
20 pages
Reading List
No ratings yet
Reading List
2 pages
Notes On The Plastic Industry
No ratings yet
Notes On The Plastic Industry
14 pages
Career Fest - Proposal
No ratings yet
Career Fest - Proposal
8 pages
The Complete Guide To The TOEFL PBT BOOK-pages-98-104
No ratings yet
The Complete Guide To The TOEFL PBT BOOK-pages-98-104
7 pages
Project 5 Sem Final Report
No ratings yet
Project 5 Sem Final Report
41 pages
Which Type Are You?: The OSRAM Lamp Type Guide
No ratings yet
Which Type Are You?: The OSRAM Lamp Type Guide
8 pages
HN0043 2
No ratings yet
HN0043 2
2 pages
2.2 - Kim, Jaegwon - The Many Problems of Mental Causation
No ratings yet
2.2 - Kim, Jaegwon - The Many Problems of Mental Causation
32 pages
Argelith Lieferprogramm GB
No ratings yet
Argelith Lieferprogramm GB
24 pages
Lesson 3
No ratings yet
Lesson 3
6 pages
IECEarthing Brochure
No ratings yet
IECEarthing Brochure
6 pages
100M Project Estimate
No ratings yet
100M Project Estimate
74 pages
Fundamentals of Fisheries MGT Ramon
No ratings yet
Fundamentals of Fisheries MGT Ramon
52 pages
Step-By-Step Guide: Bind DNS: Last Updated: 12 October 2006 Authors: Simon Edwards & Colleen Romero
No ratings yet
Step-By-Step Guide: Bind DNS: Last Updated: 12 October 2006 Authors: Simon Edwards & Colleen Romero
10 pages
2 - DPD00212V006 General Instruction
0% (1)
2 - DPD00212V006 General Instruction
45 pages
Chapter-3-Measures of Central Tendency
No ratings yet
Chapter-3-Measures of Central Tendency
19 pages
Processing Station - Part 1 (Mpob HQ 21711)
100% (2)
Processing Station - Part 1 (Mpob HQ 21711)
56 pages
How To Use Moldex3D To Assess Gate Freeze Time and Optimize Packing Time
No ratings yet
How To Use Moldex3D To Assess Gate Freeze Time and Optimize Packing Time
5 pages

Predictive Analysis For Retail Banking

Uploaded by

Predictive Analysis For Retail Banking

Uploaded by

Predictive Analytics for Retail Banking

 Typical mass-market banking in which individual customers use local branches

 Pdays : Number of days passed client was last contacted - (numeric)

 Previous : Number of client contacts performed before this campaign - (numeric)

 Poutcome : Previous marketing campaign outcome - (categorical)

 Deposit : subscription verified. (output)

 Since the output variable is

80-20 *Recommended for banking sector

• K-nearest Neighbour: 75.3%

KNN Logistic Regression Decision Tree Random Forest SVM

You might also like