SlideShare a Scribd company logo
1 of 4
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7101
Improving Prediction of Potential Clients for Bank Term Deposits using
Machine Learning Approaches
Prabodh Wankhede1, Rohit Singh2, Rutesh Rathod3, Jayesh Patil4, T.D. Khadtare5
1,2,3,4BE Student, SITS, Pune
5Prof., Dept. of Computer Engineering, SITS, Pune, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Novel results for bank telemarketing have been
obtained to enhance bank term deposits by employing
machine learning approaches. Acquiring term deposits in
banking is always an essential business for them and it isgood
marketing campaign always plays an essential role in
financial selling. More often than not, it is always a challenge
to financial institutions to identify the group of customers for
this purpose. Since a very few customers respond positively to
direct telemarketing campaigns, there is a class imbalance in
the data. This inherent problem of class imbalanceinresponse
modeling brings some additional difficulties into response
prediction. As a result, the prediction models are generally
biased towards non-respondent customers and this adds a
further challenge. In this paper, the focus is on critical feature
engineering and investigating thepredictive machinelearning
approaches like Logistic Regression, Support Vector Machine,
Random Forest, Neural Networks, and XGBoost. The best
performing predictivemodel havebeen identified whichcanbe
deployed in bank telemarketing campaigns.
Key Words: Bank, term deposit, telemarketing,
imbalance, prediction, classification
1. INTRODUCTION
While banks do not need the deposits to offer loans, they do
need it to balance it by attracting client deposits. Banks can
borrow funds from the government treasuryatveryminimal
interest rate, but it’s still more expensive than borrowing
from the bank’s own depositors. How does the deposit have
an effect on stability in banking has become a crucial issuein
the growth of banking institutions. In banking business, the
question of attracting the customers for term deposits has
been around for some time and has arisen again after the
global economic crisis. In response to the crisis, a number of
countries considerably extended the coverage in their
protection nets on the way to repair marketplaceconfidence
and to ward off potential contagious runs on their banking
sectors. Through a marketing campaign about contacting
clients on telephone directly, the bank intends to select the
favourable set of clients that are highly inclined to deposit.It
is beneficial for narrowing the range of potential customers,
elevating the rate of success as well as reducing the cost of
marketing process. The remaining paper is organized as
follows. In Section 2, we review the related literature. We
describe the dataset in Section 3 and data pre-processing in
Section 4. Traditional machine learning approaches are
discussed in Section 5. This is followed by implementation
and their results. Finally, we conclude in Section 6.
2. LITERATURE REVIEW
We surveyed papers relevant to bank telemarketing
campaigns based on machine learning approaches and
identified four distinct approaches the researchers worked
on. The following is an exploration of literature review in
each category. The problem of predicting target customers
using bank telemarketing has been addressed using neural
networks [1]. But since the data used by the author [1] is
unbalanced (because 11.07% of customers subscribe to a
certificate of deposit) and the methodologies attributed to
neural networks, they had to deal with the data in either of
the three ways — ignoring the problem, undersampling the
majority class, and oversampling the minority class. The
disadvantage of undersampling is that wemayriskremoving
the majority class instances which is more representative,
thus discarding useful information. The disadvantage of
oversampling is that, since it simply adds replicated
observations in original dataset, it ends up adding multiple
observations of several types, thus leading to overfitting.
Logistic Regression tend to be easily understood by humans,
have the advantage of fitting models and providing good
predictions in classification task [1]. Logistic Regression is
suitable for a small amount of feature space, which implies
such model is generally ill-fitting with multiplevariables.For
logistic regression models, unbalanced training data affects
only the estimate of model intercept — it skews all the
predicted probabilities, which in turn compromises the
predictions. Consequently, the authors clearly implant the
suggestion forimprovement[2].Inanotherapproachfocused
on sensitivity analysis, the authors have shown with rminer
package [3] how a model is influenced by each of its input
attributes in percentageoftheremaining.Theperformanceof
support vector machine (SVM) on the same dataset [1] has
also been tested [1] showing better results compared to
logistic regression. While SVMsdonotperformwellonhighly
skewed/imbalanced data sets, authors have made no
comment on class imbalance in the paper. For unbalanced
datasets, the separating hyperplane found by the SVM canbe
biased in a manner to favor predictions of the majority class
on the test samples. Better results are portrayed by Random
Forest Classifier with an accuracy of 87% and it is the most
promising classifier with respect to predictive ability [4]. To
handle class imbalance, ‘Spread-subsample’ was used to
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7102
balance the class values. The class values were suitably
balanced as 5255 instances being ‘No’ and 5,255 instances
being ‘Yes’ totaling 10,510 instances. Although the training
accuracy of such model is high, accuracy on unseen data will
be worse. Another caveat is that Random forests are built on
decision trees, and decision trees are sensitive to class
imbalance because each tree tends to be biased in the same
direction and magnitude (onaverage) byclassimbalance[4].
Gradient Boosting classifier performed slightly worse than
other classifiers [5]. Though gradient based methods
generally give better results, gradient boosted trees are
harder to fit than random forests. Gradient boosting
algorithms generally have three parameters which can be
fine-tuned — shrinkage parameter, depth of the tree,andthe
number of trees. The proper training of these parameters is
required for good fit, failing which may result in overfitting
[5]. Bagging and boosting areensemble-basedmeta-learning
algorithms that pool decisions from multiple classifiers [5].
They have their caveats. First, the sample size of the training
data affects the performance of both bagging and boostingin
the ensemble design. Second, boosting was designed for an
inherently poor classification model given a large amount of
data while bagging tends to be more useful for classification
problems with a limited amount of available data. Third, the
degree of class imbalance may play a role in affecting the
model performance [5]. Rotation Forest is another method
for generating classifier ensembles based on feature
extraction [6]. Rotation Forest ensemble was executed on a
random selection of 33 benchmark datasets from the UCI
repository and compared it with Bagging, AdaBoost, and
Random Forest.TheresultswerefavorabletoRotationForest
to a great extent. The accuracy level that wasachievedforthe
dataset in the first run was in the order of approximately
85%. The literature review shows that there is scope for
further improvement of the models by improving further
feature engineering and hyperparameter tuning. Since
XGBoost is popular in handling class imbalance and existing
data is inherently imbalanced, we are encouraged to
investigate the results with XGBoost.
3. DATASET
The dataset was downloaded from UCI Machine Learning
Repository [1]. The data is related with direct marketing
campaigns of a Portuguese banking institution. The
marketing campaigns were based on phone calls. It consists
of 41,188 customers’ data. Total 21 featuresarerecorded for
each customer. These features include:
• Customer features - age, job, marital status, education,
housing, default and loan
• Phone call features - contact, month,dayoftheweek and
phone call duration
• Social and economicfactors -employmentvariationrate,
consumer price index, consumer confidence index, 3 month
Euribor rate and number of employees
• Other attributes - campaign, pdays, previous and
poutcome
4. DATA PREPROCESSING
The dataset is semi-colon separated (;). We converted it into
comma separated values (csv). The output variable named
"y" is string which has values either "yes" or "no". Therefore,
weconverted it into Booleanvalues - Truefor"yes"andFalse
for "no". 6 out of 21 features have missing values default,
education,housing,loan,job,andmarital.Missingvalueshave
been imputed by using multivariate imputation by chained
equation. This method is based on fully conditional
specification, where each incompletevariableisimputedbya
separatemodel. "Age" variable is binnedinto4bins—Teens,
Young Adults, Adults,andSeniorCitizens.Featurebinningisa
method of turning continuous variables into categorical
values. Three features are removed to increase model
interpretability. "euribor3m" is removed because it is highly
correlated with other variables leading to severe mutli-
collinearity. "duration" is removed despite being a strong
predictor to make the model realistic becauseaftertheendof
call, "y" is obviously known. "pdays" is removed because it
had constant values. Synthetic Minority Oversampling
TEchnique (SMOTE) is applied to modify imbalanced data
into balanced distribution. SMOTE generates a random
sample of minority class observations using bootstrapping
and k-nearest neighbors to shift the classifier learning
towards minority bias [7].
5.MODELING,IMPLEMENTATIONANDEVALUATION
In data preprocessing, we found that there is class
imbalance. Class imbalance is encountered in most of the
real world classifications. In some of the cases, class
imbalance is not just common, it is expected. To handleclass
imbalance, different models’ researchershaveworkedon[6]
[2] [3] [1] [5], have their own approaches. In the following
sub-sections, we explore the theoretical aspects and
implementation issues of the models.
A. Logistic Regression Analysis
During analysis of data, we found that customer depositrate
was highly dependent on social and economic factors. We
applied logistic regression model for predicting decision
based on social and economic factors andcustomerfeatures.
Fig.-1: ROC Curve for Logistic Regression
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7103
Optimal cut-off point was found for ROC curve (Fig. 2) to
include few false positives because banks can afford to
contact few customers who are unlikely to deposit rather
than losing potential customers who have proclivity to
deposit. The scores above optimal cut-off pointareclassified
as positive, and those below as negative. Thearea under ROC
curve is obtained with optimal cut-off point as 0.6162.
Fig -2: ROC Curve for Logistic Regression with Cut-off
point
B. Support Vector Machine
Analysis Support Vector Machine is implemented to check if
we could get better accuracy than logistic regression. Since
SVM is a computationally demanding algorithm, weused the
smaller and stratified version of original dataset [1]. The
dataset is preprocessed in the same way as done for logistic
regression.
Fig.3: ROC Curve for SVM
Optimal cut-off point was found using cost function. The
value of area under ROC curve (Fig. 4)cameouttobe0.7182.
Fig.4: ROC Curve for SVM with Cut-off point
C. Random Forest
Analysis Another approach that we tested is to execute
random forest on the dataset. Random forests do not have
the issue of overfitting as decision trees, and they are an
effective tool for prediction. The area under ROC curve (Fig.
5) is better than that of SVM.
Fig.5: ROC Curve for Random Forest
Optimal cut-off point is obtained by means of cost function.
This cut-off point is "optimal" in thesensethatitweighsboth
sensitivity and specificity equally. After cut-off point, the
area under ROC curve (Fig. 6) is 0.6989, which isslightlyless
than that of SVM.
Fig.6: ROC Curve for Random Forest with Cut-off point
D. Extreme Gradient Boosting (XGBoost) Analysis
Unlike gradient boosted machine, where tree pruning stops
once a negative loss is encountered [8], XGBoost grows the
tree up to max depth and then prune backward until the
improvement in loss function is below a threshold. The data
preprocessing for XGBoost is little bit different from other
models. Age is binned into 4 bins - Teens, Young Adults,
Adults, Senior Citizens. Missing values are not required to
impute, hence, not imputed. One hot encoding and label
encoding is performed on categorical variables and target
variables respectively. Unlike other algorithms, XGBoost
provides special data structure called DMatrix to create
training matrix and test matrix. A dictionary of default
parameters is created which needs to be passed for training.
Using default parameters and cross validation, the best
iteration is found in order to avoid overfitting. The area
under ROC curve (Fig. 7) obtained is 0.79.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7104
Fig.7: ROC Curve for XGBoost
The optimal cut-off value is found using cost function and
scores are classified above it as positive, those below as
negative. This optimized model is executed again on test
matrix. The new area under ROC curve (Fig. 8) is 0.7368.
Fig.8: ROC Curve for XGBoost with Cut-off point
E. Results and Discussion
Test accuracy is not a very good indicator of model
performance in case of class imbalance. We are more
concerned about the customers who say “yes” rather than
those who say “no”. We decided to choose ROC curve as
performance metric. Like precision and recall, accuracy is
divided into sensitivity and specificity and models are
chosen based on the balance thresholdsofthesevalues.After
comparing results of algorithms with three aforementioned
performance metrics, we observed that XGBoost is the top
performing algorithm. XGBoost gives better performance
than all other algorithms as is shown by Area under curve
(AUC).
Logistic
Regression
Random
Forest
Support Vector
Machine
XGBoost
AUC 0.6162 0.6989 0.7182 0.7368
F1 Score 0.9295 0.9306 0.9206 0.9291
Test Accuracy 0.8726 0.8771 0.8610 0.8351
Table -1: Summary of Results
We have also predicted realistic results with theexclusionof
“duration” variable whereas other researchers have clearly
ignored it despite of it being a verygoodindicator.According
to Hosmer and Lemeshow goodness of fit (GOF) test, logistic
regression model doesn’t fit the data well. The results of
Hosmer and Lemeshow goodness of fit (GOF) test are: X-
squared = 219.7, df = 8, p-value < 2.2e-16 The p-valueisvery
low (<0.05), so logistic regression model should be rejected.
6. CONCLUSION
The problem of identifying best performing predictive
model that predicts potential clients for term deposits have
been tackled. With the help of critical literature review and
data preprocessing, we characterized the dataset and
accordingly implemented the different predictive models
and compared their results with AUC of ROC asperformance
metric. The challenge was to tackle class imbalance and
severe multicollinearity in the dataset. We found that
XGBoost (AUC = 0.7368) performs better than rest of the
algorithms. Though AUC of ROC curve is not very high, but
the predictions are reliable owing to the novelty of data
preprocessing. The value of AUC can be improved with a
larger dataset having more instances of minority class.
REFERENCES
[1] Sergio Moro, Paulo Cortez, Paulo Rita, “A data-driven
approach to predict the success of bank telemarketing”,
Elsevier, June 2014.
[2] Yiyan Jiang, “Using Logistic Regression Model to Predict
the Success of Bank Telemarketing”, International
Journal on Data Science and Technology, June 21, 2018.
[3] Sergio Moro Paulo Cortez Raul M. S. Laureano, 2013. "A
data mining approach for bank telemarketing using the
rminer package and r tool", Working PapersSeries213-
06, ISCTE-IUL, Business Research Unit (BRU-IUL).
[4] Justice Asare-Frempong, Manoj Jaybalan, “Predicting
Customer Response to Bank Direct Telemarketing
Campaign”, 2017 International Conference on
Engineering Technology and Technopreneurship
(ICE2T), 18-20 Sept. 2017
[5] Y. Pan, Z. Tang, “Ensemble methods in bank direct
marketing”, 2014 11th International Conference on
Service Systems and Service Management(ICSSSM),25-
27 June 2014.
[6] J.J. Rodrguez, L.I. Kuncheva, C.J. Alonso, “Rotation
Forest: A New Classifier Ensemble Method”, IEEE
Transactions on Pattern Analysis and Machine
Intelligence, Volume 28, Issue 10, October 2006.
[7] Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall,
W. Philip Kegelmeyer, “SMOTE: Synthetic Minority
Oversampling Technique”, Journal of Artificial
Intelligence Research, Volume 16, June 2002.
[8] J. Friedman. “Greedy function approximation:a gradient
boosting machine. Annals of Statistics”, 29(5):1189–
1232, 2001.R. Nicole, “Title of paperwithonly firstword
capitalized,” J. Name Stand. Abbrev., in press.

More Related Content

What's hot

Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modelingEsteban Ribero
 
Applying Convolutional-GRU for Term Deposit Likelihood Prediction
Applying Convolutional-GRU for Term Deposit Likelihood PredictionApplying Convolutional-GRU for Term Deposit Likelihood Prediction
Applying Convolutional-GRU for Term Deposit Likelihood PredictionVandanaSharma356
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio ManagementIRJET Journal
 
IRJET- Analyzing Voting Results using Influence Matrix
IRJET- Analyzing Voting Results using Influence MatrixIRJET- Analyzing Voting Results using Influence Matrix
IRJET- Analyzing Voting Results using Influence MatrixIRJET Journal
 
Selection of Equipment by Using Saw and Vikor Methods
Selection of Equipment by Using Saw and Vikor Methods Selection of Equipment by Using Saw and Vikor Methods
Selection of Equipment by Using Saw and Vikor Methods IJERA Editor
 
Prediction of potential customers for term deposit
Prediction of potential customers for term depositPrediction of potential customers for term deposit
Prediction of potential customers for term depositPranov Mishra
 
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Sagar Deogirkar
 
Customer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R OpenCustomer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R OpenPoo Kuan Hoong
 
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET Journal
 
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...IRJET Journal
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetRohan Choksi
 
IRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET Journal
 
Churn Modeling For Mobile Telecommunications
Churn Modeling For Mobile TelecommunicationsChurn Modeling For Mobile Telecommunications
Churn Modeling For Mobile TelecommunicationsSalford Systems
 
IRJET- Fatigue Analysis of Offshore Steel Structures
IRJET- Fatigue Analysis of Offshore Steel StructuresIRJET- Fatigue Analysis of Offshore Steel Structures
IRJET- Fatigue Analysis of Offshore Steel StructuresIRJET Journal
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationVishva Abeyrathne
 
A Two Stage Classification Model for Call Center Purchase Prediction
A Two Stage Classification Model for Call Center Purchase PredictionA Two Stage Classification Model for Call Center Purchase Prediction
A Two Stage Classification Model for Call Center Purchase PredictionTELKOMNIKA JOURNAL
 
churn prediction in telecom
churn prediction in telecom churn prediction in telecom
churn prediction in telecom Hong Bui Van
 
Risk assessment of information production using extended risk matrix approach
Risk assessment of information production using extended risk matrix approachRisk assessment of information production using extended risk matrix approach
Risk assessment of information production using extended risk matrix approachTELKOMNIKA JOURNAL
 
Telecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticsTelecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticssheetal sharma
 
Automated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning ModelsAutomated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning ModelsIRJET Journal
 

What's hot (20)

Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modeling
 
Applying Convolutional-GRU for Term Deposit Likelihood Prediction
Applying Convolutional-GRU for Term Deposit Likelihood PredictionApplying Convolutional-GRU for Term Deposit Likelihood Prediction
Applying Convolutional-GRU for Term Deposit Likelihood Prediction
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio Management
 
IRJET- Analyzing Voting Results using Influence Matrix
IRJET- Analyzing Voting Results using Influence MatrixIRJET- Analyzing Voting Results using Influence Matrix
IRJET- Analyzing Voting Results using Influence Matrix
 
Selection of Equipment by Using Saw and Vikor Methods
Selection of Equipment by Using Saw and Vikor Methods Selection of Equipment by Using Saw and Vikor Methods
Selection of Equipment by Using Saw and Vikor Methods
 
Prediction of potential customers for term deposit
Prediction of potential customers for term depositPrediction of potential customers for term deposit
Prediction of potential customers for term deposit
 
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
Comparative Study of Machine Learning Algorithms for Sentiment Analysis with ...
 
Customer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R OpenCustomer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R Open
 
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
 
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn dataset
 
IRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom Industry
 
Churn Modeling For Mobile Telecommunications
Churn Modeling For Mobile TelecommunicationsChurn Modeling For Mobile Telecommunications
Churn Modeling For Mobile Telecommunications
 
IRJET- Fatigue Analysis of Offshore Steel Structures
IRJET- Fatigue Analysis of Offshore Steel StructuresIRJET- Fatigue Analysis of Offshore Steel Structures
IRJET- Fatigue Analysis of Offshore Steel Structures
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using Classification
 
A Two Stage Classification Model for Call Center Purchase Prediction
A Two Stage Classification Model for Call Center Purchase PredictionA Two Stage Classification Model for Call Center Purchase Prediction
A Two Stage Classification Model for Call Center Purchase Prediction
 
churn prediction in telecom
churn prediction in telecom churn prediction in telecom
churn prediction in telecom
 
Risk assessment of information production using extended risk matrix approach
Risk assessment of information production using extended risk matrix approachRisk assessment of information production using extended risk matrix approach
Risk assessment of information production using extended risk matrix approach
 
Telecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticsTelecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analytics
 
Automated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning ModelsAutomated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning Models
 

Similar to IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using Machine Learning Approaches

MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSMUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSIRJET Journal
 
Bank Customer Segmentation & Insurance Claim Prediction
Bank Customer Segmentation & Insurance Claim PredictionBank Customer Segmentation & Insurance Claim Prediction
Bank Customer Segmentation & Insurance Claim PredictionIRJET Journal
 
Data Mining on Customer Churn Classification
Data Mining on Customer Churn ClassificationData Mining on Customer Churn Classification
Data Mining on Customer Churn ClassificationKaushik Rajan
 
Customer churn classification using machine learning techniques
Customer churn classification using machine learning techniquesCustomer churn classification using machine learning techniques
Customer churn classification using machine learning techniquesSindhujanDhayalan
 
EVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERN
EVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERNEVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERN
EVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERNIRJET Journal
 
Loan Analysis Predicting Defaulters
Loan Analysis Predicting DefaultersLoan Analysis Predicting Defaulters
Loan Analysis Predicting DefaultersIRJET Journal
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction SystemIRJET Journal
 
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRYPROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRYIJITCA Journal
 
IRJET- Financial Analysis using Data Mining
IRJET- Financial Analysis using Data MiningIRJET- Financial Analysis using Data Mining
IRJET- Financial Analysis using Data MiningIRJET Journal
 
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...ijsc
 
CUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNING
CUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNINGCUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNING
CUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNINGIRJET Journal
 
A Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningA Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningIRJET Journal
 
Machine Learning Approaches to Predict Customer Churn in Telecommunications I...
Machine Learning Approaches to Predict Customer Churn in Telecommunications I...Machine Learning Approaches to Predict Customer Churn in Telecommunications I...
Machine Learning Approaches to Predict Customer Churn in Telecommunications I...IRJET Journal
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET Journal
 
Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...
Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...
Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...IRJET Journal
 
Predicting reaction based on customer's transaction using machine learning a...
Predicting reaction based on customer's transaction using  machine learning a...Predicting reaction based on customer's transaction using  machine learning a...
Predicting reaction based on customer's transaction using machine learning a...IJECEIAES
 
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageSurvey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageIRJET Journal
 

Similar to IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using Machine Learning Approaches (20)

MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSMUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
 
Bank Customer Segmentation & Insurance Claim Prediction
Bank Customer Segmentation & Insurance Claim PredictionBank Customer Segmentation & Insurance Claim Prediction
Bank Customer Segmentation & Insurance Claim Prediction
 
Data Mining on Customer Churn Classification
Data Mining on Customer Churn ClassificationData Mining on Customer Churn Classification
Data Mining on Customer Churn Classification
 
Customer churn classification using machine learning techniques
Customer churn classification using machine learning techniquesCustomer churn classification using machine learning techniques
Customer churn classification using machine learning techniques
 
EVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERN
EVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERNEVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERN
EVALUTION OF CHURN PREDICTING PROCESS USING CUSTOMER BEHAVIOUR PATTERN
 
Loan Analysis Predicting Defaulters
Loan Analysis Predicting DefaultersLoan Analysis Predicting Defaulters
Loan Analysis Predicting Defaulters
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction System
 
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRYPROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
 
IRJET- Financial Analysis using Data Mining
IRJET- Financial Analysis using Data MiningIRJET- Financial Analysis using Data Mining
IRJET- Financial Analysis using Data Mining
 
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
AI TESTING: ENSURING A GOOD DATA SPLIT BETWEEN DATA SETS (TRAINING AND TEST) ...
 
CUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNING
CUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNINGCUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNING
CUSTOMER SEGMENTATION IN SHOPPING MALL USING CLUSTERING IN MACHINE LEARNING
 
B05840510
B05840510B05840510
B05840510
 
B05840510
B05840510B05840510
B05840510
 
A Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningA Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine Learning
 
Machine Learning Approaches to Predict Customer Churn in Telecommunications I...
Machine Learning Approaches to Predict Customer Churn in Telecommunications I...Machine Learning Approaches to Predict Customer Churn in Telecommunications I...
Machine Learning Approaches to Predict Customer Churn in Telecommunications I...
 
Manuscript dss
Manuscript dssManuscript dss
Manuscript dss
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
 
Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...
Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...
Unfolding the Credit Card Fraud Detection Technique by Implementing SVM Algor...
 
Predicting reaction based on customer's transaction using machine learning a...
Predicting reaction based on customer's transaction using  machine learning a...Predicting reaction based on customer's transaction using  machine learning a...
Predicting reaction based on customer's transaction using machine learning a...
 
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageSurvey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 

IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using Machine Learning Approaches

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7101 Improving Prediction of Potential Clients for Bank Term Deposits using Machine Learning Approaches Prabodh Wankhede1, Rohit Singh2, Rutesh Rathod3, Jayesh Patil4, T.D. Khadtare5 1,2,3,4BE Student, SITS, Pune 5Prof., Dept. of Computer Engineering, SITS, Pune, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - Novel results for bank telemarketing have been obtained to enhance bank term deposits by employing machine learning approaches. Acquiring term deposits in banking is always an essential business for them and it isgood marketing campaign always plays an essential role in financial selling. More often than not, it is always a challenge to financial institutions to identify the group of customers for this purpose. Since a very few customers respond positively to direct telemarketing campaigns, there is a class imbalance in the data. This inherent problem of class imbalanceinresponse modeling brings some additional difficulties into response prediction. As a result, the prediction models are generally biased towards non-respondent customers and this adds a further challenge. In this paper, the focus is on critical feature engineering and investigating thepredictive machinelearning approaches like Logistic Regression, Support Vector Machine, Random Forest, Neural Networks, and XGBoost. The best performing predictivemodel havebeen identified whichcanbe deployed in bank telemarketing campaigns. Key Words: Bank, term deposit, telemarketing, imbalance, prediction, classification 1. INTRODUCTION While banks do not need the deposits to offer loans, they do need it to balance it by attracting client deposits. Banks can borrow funds from the government treasuryatveryminimal interest rate, but it’s still more expensive than borrowing from the bank’s own depositors. How does the deposit have an effect on stability in banking has become a crucial issuein the growth of banking institutions. In banking business, the question of attracting the customers for term deposits has been around for some time and has arisen again after the global economic crisis. In response to the crisis, a number of countries considerably extended the coverage in their protection nets on the way to repair marketplaceconfidence and to ward off potential contagious runs on their banking sectors. Through a marketing campaign about contacting clients on telephone directly, the bank intends to select the favourable set of clients that are highly inclined to deposit.It is beneficial for narrowing the range of potential customers, elevating the rate of success as well as reducing the cost of marketing process. The remaining paper is organized as follows. In Section 2, we review the related literature. We describe the dataset in Section 3 and data pre-processing in Section 4. Traditional machine learning approaches are discussed in Section 5. This is followed by implementation and their results. Finally, we conclude in Section 6. 2. LITERATURE REVIEW We surveyed papers relevant to bank telemarketing campaigns based on machine learning approaches and identified four distinct approaches the researchers worked on. The following is an exploration of literature review in each category. The problem of predicting target customers using bank telemarketing has been addressed using neural networks [1]. But since the data used by the author [1] is unbalanced (because 11.07% of customers subscribe to a certificate of deposit) and the methodologies attributed to neural networks, they had to deal with the data in either of the three ways — ignoring the problem, undersampling the majority class, and oversampling the minority class. The disadvantage of undersampling is that wemayriskremoving the majority class instances which is more representative, thus discarding useful information. The disadvantage of oversampling is that, since it simply adds replicated observations in original dataset, it ends up adding multiple observations of several types, thus leading to overfitting. Logistic Regression tend to be easily understood by humans, have the advantage of fitting models and providing good predictions in classification task [1]. Logistic Regression is suitable for a small amount of feature space, which implies such model is generally ill-fitting with multiplevariables.For logistic regression models, unbalanced training data affects only the estimate of model intercept — it skews all the predicted probabilities, which in turn compromises the predictions. Consequently, the authors clearly implant the suggestion forimprovement[2].Inanotherapproachfocused on sensitivity analysis, the authors have shown with rminer package [3] how a model is influenced by each of its input attributes in percentageoftheremaining.Theperformanceof support vector machine (SVM) on the same dataset [1] has also been tested [1] showing better results compared to logistic regression. While SVMsdonotperformwellonhighly skewed/imbalanced data sets, authors have made no comment on class imbalance in the paper. For unbalanced datasets, the separating hyperplane found by the SVM canbe biased in a manner to favor predictions of the majority class on the test samples. Better results are portrayed by Random Forest Classifier with an accuracy of 87% and it is the most promising classifier with respect to predictive ability [4]. To handle class imbalance, ‘Spread-subsample’ was used to
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7102 balance the class values. The class values were suitably balanced as 5255 instances being ‘No’ and 5,255 instances being ‘Yes’ totaling 10,510 instances. Although the training accuracy of such model is high, accuracy on unseen data will be worse. Another caveat is that Random forests are built on decision trees, and decision trees are sensitive to class imbalance because each tree tends to be biased in the same direction and magnitude (onaverage) byclassimbalance[4]. Gradient Boosting classifier performed slightly worse than other classifiers [5]. Though gradient based methods generally give better results, gradient boosted trees are harder to fit than random forests. Gradient boosting algorithms generally have three parameters which can be fine-tuned — shrinkage parameter, depth of the tree,andthe number of trees. The proper training of these parameters is required for good fit, failing which may result in overfitting [5]. Bagging and boosting areensemble-basedmeta-learning algorithms that pool decisions from multiple classifiers [5]. They have their caveats. First, the sample size of the training data affects the performance of both bagging and boostingin the ensemble design. Second, boosting was designed for an inherently poor classification model given a large amount of data while bagging tends to be more useful for classification problems with a limited amount of available data. Third, the degree of class imbalance may play a role in affecting the model performance [5]. Rotation Forest is another method for generating classifier ensembles based on feature extraction [6]. Rotation Forest ensemble was executed on a random selection of 33 benchmark datasets from the UCI repository and compared it with Bagging, AdaBoost, and Random Forest.TheresultswerefavorabletoRotationForest to a great extent. The accuracy level that wasachievedforthe dataset in the first run was in the order of approximately 85%. The literature review shows that there is scope for further improvement of the models by improving further feature engineering and hyperparameter tuning. Since XGBoost is popular in handling class imbalance and existing data is inherently imbalanced, we are encouraged to investigate the results with XGBoost. 3. DATASET The dataset was downloaded from UCI Machine Learning Repository [1]. The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. It consists of 41,188 customers’ data. Total 21 featuresarerecorded for each customer. These features include: • Customer features - age, job, marital status, education, housing, default and loan • Phone call features - contact, month,dayoftheweek and phone call duration • Social and economicfactors -employmentvariationrate, consumer price index, consumer confidence index, 3 month Euribor rate and number of employees • Other attributes - campaign, pdays, previous and poutcome 4. DATA PREPROCESSING The dataset is semi-colon separated (;). We converted it into comma separated values (csv). The output variable named "y" is string which has values either "yes" or "no". Therefore, weconverted it into Booleanvalues - Truefor"yes"andFalse for "no". 6 out of 21 features have missing values default, education,housing,loan,job,andmarital.Missingvalueshave been imputed by using multivariate imputation by chained equation. This method is based on fully conditional specification, where each incompletevariableisimputedbya separatemodel. "Age" variable is binnedinto4bins—Teens, Young Adults, Adults,andSeniorCitizens.Featurebinningisa method of turning continuous variables into categorical values. Three features are removed to increase model interpretability. "euribor3m" is removed because it is highly correlated with other variables leading to severe mutli- collinearity. "duration" is removed despite being a strong predictor to make the model realistic becauseaftertheendof call, "y" is obviously known. "pdays" is removed because it had constant values. Synthetic Minority Oversampling TEchnique (SMOTE) is applied to modify imbalanced data into balanced distribution. SMOTE generates a random sample of minority class observations using bootstrapping and k-nearest neighbors to shift the classifier learning towards minority bias [7]. 5.MODELING,IMPLEMENTATIONANDEVALUATION In data preprocessing, we found that there is class imbalance. Class imbalance is encountered in most of the real world classifications. In some of the cases, class imbalance is not just common, it is expected. To handleclass imbalance, different models’ researchershaveworkedon[6] [2] [3] [1] [5], have their own approaches. In the following sub-sections, we explore the theoretical aspects and implementation issues of the models. A. Logistic Regression Analysis During analysis of data, we found that customer depositrate was highly dependent on social and economic factors. We applied logistic regression model for predicting decision based on social and economic factors andcustomerfeatures. Fig.-1: ROC Curve for Logistic Regression
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7103 Optimal cut-off point was found for ROC curve (Fig. 2) to include few false positives because banks can afford to contact few customers who are unlikely to deposit rather than losing potential customers who have proclivity to deposit. The scores above optimal cut-off pointareclassified as positive, and those below as negative. Thearea under ROC curve is obtained with optimal cut-off point as 0.6162. Fig -2: ROC Curve for Logistic Regression with Cut-off point B. Support Vector Machine Analysis Support Vector Machine is implemented to check if we could get better accuracy than logistic regression. Since SVM is a computationally demanding algorithm, weused the smaller and stratified version of original dataset [1]. The dataset is preprocessed in the same way as done for logistic regression. Fig.3: ROC Curve for SVM Optimal cut-off point was found using cost function. The value of area under ROC curve (Fig. 4)cameouttobe0.7182. Fig.4: ROC Curve for SVM with Cut-off point C. Random Forest Analysis Another approach that we tested is to execute random forest on the dataset. Random forests do not have the issue of overfitting as decision trees, and they are an effective tool for prediction. The area under ROC curve (Fig. 5) is better than that of SVM. Fig.5: ROC Curve for Random Forest Optimal cut-off point is obtained by means of cost function. This cut-off point is "optimal" in thesensethatitweighsboth sensitivity and specificity equally. After cut-off point, the area under ROC curve (Fig. 6) is 0.6989, which isslightlyless than that of SVM. Fig.6: ROC Curve for Random Forest with Cut-off point D. Extreme Gradient Boosting (XGBoost) Analysis Unlike gradient boosted machine, where tree pruning stops once a negative loss is encountered [8], XGBoost grows the tree up to max depth and then prune backward until the improvement in loss function is below a threshold. The data preprocessing for XGBoost is little bit different from other models. Age is binned into 4 bins - Teens, Young Adults, Adults, Senior Citizens. Missing values are not required to impute, hence, not imputed. One hot encoding and label encoding is performed on categorical variables and target variables respectively. Unlike other algorithms, XGBoost provides special data structure called DMatrix to create training matrix and test matrix. A dictionary of default parameters is created which needs to be passed for training. Using default parameters and cross validation, the best iteration is found in order to avoid overfitting. The area under ROC curve (Fig. 7) obtained is 0.79.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 7104 Fig.7: ROC Curve for XGBoost The optimal cut-off value is found using cost function and scores are classified above it as positive, those below as negative. This optimized model is executed again on test matrix. The new area under ROC curve (Fig. 8) is 0.7368. Fig.8: ROC Curve for XGBoost with Cut-off point E. Results and Discussion Test accuracy is not a very good indicator of model performance in case of class imbalance. We are more concerned about the customers who say “yes” rather than those who say “no”. We decided to choose ROC curve as performance metric. Like precision and recall, accuracy is divided into sensitivity and specificity and models are chosen based on the balance thresholdsofthesevalues.After comparing results of algorithms with three aforementioned performance metrics, we observed that XGBoost is the top performing algorithm. XGBoost gives better performance than all other algorithms as is shown by Area under curve (AUC). Logistic Regression Random Forest Support Vector Machine XGBoost AUC 0.6162 0.6989 0.7182 0.7368 F1 Score 0.9295 0.9306 0.9206 0.9291 Test Accuracy 0.8726 0.8771 0.8610 0.8351 Table -1: Summary of Results We have also predicted realistic results with theexclusionof “duration” variable whereas other researchers have clearly ignored it despite of it being a verygoodindicator.According to Hosmer and Lemeshow goodness of fit (GOF) test, logistic regression model doesn’t fit the data well. The results of Hosmer and Lemeshow goodness of fit (GOF) test are: X- squared = 219.7, df = 8, p-value < 2.2e-16 The p-valueisvery low (<0.05), so logistic regression model should be rejected. 6. CONCLUSION The problem of identifying best performing predictive model that predicts potential clients for term deposits have been tackled. With the help of critical literature review and data preprocessing, we characterized the dataset and accordingly implemented the different predictive models and compared their results with AUC of ROC asperformance metric. The challenge was to tackle class imbalance and severe multicollinearity in the dataset. We found that XGBoost (AUC = 0.7368) performs better than rest of the algorithms. Though AUC of ROC curve is not very high, but the predictions are reliable owing to the novelty of data preprocessing. The value of AUC can be improved with a larger dataset having more instances of minority class. REFERENCES [1] Sergio Moro, Paulo Cortez, Paulo Rita, “A data-driven approach to predict the success of bank telemarketing”, Elsevier, June 2014. [2] Yiyan Jiang, “Using Logistic Regression Model to Predict the Success of Bank Telemarketing”, International Journal on Data Science and Technology, June 21, 2018. [3] Sergio Moro Paulo Cortez Raul M. S. Laureano, 2013. "A data mining approach for bank telemarketing using the rminer package and r tool", Working PapersSeries213- 06, ISCTE-IUL, Business Research Unit (BRU-IUL). [4] Justice Asare-Frempong, Manoj Jaybalan, “Predicting Customer Response to Bank Direct Telemarketing Campaign”, 2017 International Conference on Engineering Technology and Technopreneurship (ICE2T), 18-20 Sept. 2017 [5] Y. Pan, Z. Tang, “Ensemble methods in bank direct marketing”, 2014 11th International Conference on Service Systems and Service Management(ICSSSM),25- 27 June 2014. [6] J.J. Rodrguez, L.I. Kuncheva, C.J. Alonso, “Rotation Forest: A New Classifier Ensemble Method”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 28, Issue 10, October 2006. [7] Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, W. Philip Kegelmeyer, “SMOTE: Synthetic Minority Oversampling Technique”, Journal of Artificial Intelligence Research, Volume 16, June 2002. [8] J. Friedman. “Greedy function approximation:a gradient boosting machine. Annals of Statistics”, 29(5):1189– 1232, 2001.R. Nicole, “Title of paperwithonly firstword capitalized,” J. Name Stand. Abbrev., in press.