SlideShare a Scribd company logo
1 of 28
Data Analytics for Financial Services
Bank Of America
Bhavya Kothari
Introduction
Background
 Reviewed consumer complaints data from the Consumer Financial Protection
Bureau (CFPB), an independent agency of the U.S. government, responsible
for consumer financial protection.
 Analyzed complaints for Bank of America between 2012 to 2015 to better
understand if the data had any actionable insights that an organization could
use.
About Bank of America
 Bank of America is an American multinational banking and financial services
corporation headquartered in Charlotte.
 It is the second largest bank holding company in U. S. by assets.
 Products- consumer banking, corporate banking, finance and insurance,
investment banking, mortgage loans, private banking, private equity, wealth
management, credit cards.
Analysis Techniques
Business Objective 1: Predict and forecast Responses to
Consumer Complaints to better manage resources (monetary
and people)
 Build a Decision Tree to predict the Company's response to
consumer complaints
 Build a Forecasting model to forecast Complaints/timing of
Expenses
Business Objective 2: Manage Brand image from Online Reviews
 Text Analytics on the complaint descriptions from Twitter
Exploratory Data Analysis
About the Dataset
 The data contains information regarding complaints
received for different financial organizations.
 The dataset contains the fields - Complaint ID,
Product, Sub-product, Issue, Sub-Issue, State, ZIP
Code, Submitted via, Date received, Date sent to
company, Company response, Timely response,
Consumer disputed.
 We focus on complaints with Date Received between
1/1/2012 and 12/31/2015. There is a total of
516696 complaints in this time period.
Complaints by Product
 Highest number of complaints are for Mortgage product (176,975), followed by Debt Collection
(92,446), Credit Reporting (82,598) and Credit card (62,326).
Top 10 Companies by Number of
Complaints
 Bank of America receives the highest number of complaints (53,876) followed by Wells Fargo
and JPMorgan Chase.
Company Responses to Credit Card
Complaints
 Closed with Explanation
(35,412) and Closed with
Monetary Relief (12,823) are
the top two company responses
for Credit Card Complaints.
Exploratory data analysis: Bank of America
 Bank of America is the fourth most complained company .
 Total number of complaints for bank of America is 6846
Top 10 Most Complained about
Credit Card Companies
Bank of America: Complaints by Product
 Majority of the complaints are for Mortgage (35,284) followed by Bank Account or service
(9,809)
Bank of America: Issues by Product
 The top issues for Bank of America Bank account or service are Account opening, closing, or
management (4,120) and Deposits and Withdrawals (2,684).
Bank of America: Issues by Product
 The top issues for Bank of America credit cards are Billing disputes (1,106) and Other(671)
followed by closing/cancelling account (633).
Bank of America: Complaints across
States
Highest number of complaints for Bank of America were received from
California(10,271), followed by Florida (6,128)followed by New
York.
Bank of America: Complaints across
States for Credit Card Product
Highest number of complaints for Bank of America Credit Card
product were received from California (966), Florida (685) and New
York(610)
Analysis Techniques &
Results
Analysis Techniques
Business Objective 1: Predict and forecast Responses to
Consumer Complaints to better manage resources (monetary
and people)
 Build a Decision Tree to predict the Company's response to
consumer complaints
 Build a Forecasting model to forecast Complaints/timing of
Expenses
Business Objective 2: Manage Brand image from Online Reviews
 Text Analytics on the complaint descriptions from twitter
Predictive Modeling: Decision Tree
Decision Tree to predict the Company's response to consumer
complaints
 Target variable - Company response
 Input variables - Product, Issue, Submitted via, Timely response,
Consumer disputed,
 Built using rpart package
 We divide data into training(900 observation), test datasets (304
observation). Training data is used for building model and test
data is used to test the selected model.
Predictive Modeling: Decision Tree
Predictive Modeling: Decision Tree after
Pruning
Predictive Modeling: Decision Tree
Validation
 Test the decision tree model accuracy.
 While classifying in closed with explanation the
accuracy is 90%.
Time-series Forecasting model
Build a Forecasting model to forecast Complaints/timing of
Expenses
 Analyzed seasonality in complaints (specific quarter of times of the
year, peaks in complaints ) and interesting trends.
 Built using R forecast package and ARIMA forecasting models
 Input variables - Number of Complaints against Date Received
 Time interval – quarter
Time-series Forecasting model
 The default model selected by the system is ARIMA Model with
lower MAPE 9.39. This means that, on average, there is a 9.39%
difference between the forecast and actual values. The forecast
model is usually the model with the best fit (lowest MAPE) among
all of the models considered for the series
Time-series Forecasting model
 There are spikes in complaints that were closed with explanation in 2012
APR.
Time-series Forecasting model
 Long term trend shows that complaints are reducing over time.
 It also shows some seasonality in complaints in APR Month.
Text Analytics on Online Reviews
Text Analytics on the complaint descriptions from Twitter
 Used text clustering technique to understand natural groupings
and capture any interesting insights from the complaints
description text
 Extracted 64 reviews for Bank Of America from Twitter using
R(TwitterR package) software
 Built using R.
 Sentiment analysis of twitter data to align customer reviews.
 Cluster algorithm - Number of clusters – 3
Text Analytics on Online Reviews
Text Analytics on Online Reviews
 Sentiment analysis on the data and scores assigned and classify
them in to positive, negative ,neutral.
Text Analytics on Online Reviews
 Cluster 1 looks like Customer Service experience as the reviews
talk about help ,bad,good,dispute,cancel,agree etc.
 Cluster 2 looks like Billing, fee and payment issues as the reviews
talk about start ,late ,close ,pay ,look ,payment etc.
 Cluster 3 looks like Account and fund processing as the reviews
contain terms such as fund, check, deposit, process, etc.

More Related Content

What's hot

Derak Presentation
Derak Presentation Derak Presentation
Derak Presentation Eric Gero
 
Customer Segmentation Project
Customer Segmentation ProjectCustomer Segmentation Project
Customer Segmentation ProjectAditya Ekawade
 
Data Mining in Life Insurance Business
Data Mining in Life Insurance BusinessData Mining in Life Insurance Business
Data Mining in Life Insurance BusinessAnkur Khanna
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationVishva Abeyrathne
 
Uses of analytics in the field of Banking
Uses of analytics in the field of BankingUses of analytics in the field of Banking
Uses of analytics in the field of BankingNiveditasri N
 
Bankruptcy prediction final report by sanapathi kiran kumar
Bankruptcy prediction final report by sanapathi kiran kumarBankruptcy prediction final report by sanapathi kiran kumar
Bankruptcy prediction final report by sanapathi kiran kumarKiranKumarSanapathi1
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET Journal
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicagoKM Chicago
 
Bank churn with Data Science
Bank churn with Data ScienceBank churn with Data Science
Bank churn with Data ScienceCarolyn Knight
 
Data Mining Techniques for CRM
Data Mining Techniques for CRMData Mining Techniques for CRM
Data Mining Techniques for CRMShyaamini Balu
 
E-commerce online review for detecting influencing factors users perception
E-commerce online review for detecting influencing factors users perceptionE-commerce online review for detecting influencing factors users perception
E-commerce online review for detecting influencing factors users perceptionjournalBEEI
 
Personal customized recommendation system reflecting purchase criteria and pr...
Personal customized recommendation system reflecting purchase criteria and pr...Personal customized recommendation system reflecting purchase criteria and pr...
Personal customized recommendation system reflecting purchase criteria and pr...IJECEIAES
 
Democratization of Analytics
Democratization of AnalyticsDemocratization of Analytics
Democratization of AnalyticsPrajakta Vaidya
 
Big data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataBig data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataRick Bouter
 
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?VIRGOkonsult
 

What's hot (20)

Derak Presentation
Derak Presentation Derak Presentation
Derak Presentation
 
B05840510
B05840510B05840510
B05840510
 
Customer Segmentation Project
Customer Segmentation ProjectCustomer Segmentation Project
Customer Segmentation Project
 
Data Mining in Life Insurance Business
Data Mining in Life Insurance BusinessData Mining in Life Insurance Business
Data Mining in Life Insurance Business
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using Classification
 
Uses of analytics in the field of Banking
Uses of analytics in the field of BankingUses of analytics in the field of Banking
Uses of analytics in the field of Banking
 
Bankruptcy prediction final report by sanapathi kiran kumar
Bankruptcy prediction final report by sanapathi kiran kumarBankruptcy prediction final report by sanapathi kiran kumar
Bankruptcy prediction final report by sanapathi kiran kumar
 
Bank marketing mini-project
Bank marketing mini-projectBank marketing mini-project
Bank marketing mini-project
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
 
Bank churn with Data Science
Bank churn with Data ScienceBank churn with Data Science
Bank churn with Data Science
 
Unit ii data analytics
Unit ii data analytics Unit ii data analytics
Unit ii data analytics
 
Data Mining Techniques for CRM
Data Mining Techniques for CRMData Mining Techniques for CRM
Data Mining Techniques for CRM
 
E-commerce online review for detecting influencing factors users perception
E-commerce online review for detecting influencing factors users perceptionE-commerce online review for detecting influencing factors users perception
E-commerce online review for detecting influencing factors users perception
 
Personal customized recommendation system reflecting purchase criteria and pr...
Personal customized recommendation system reflecting purchase criteria and pr...Personal customized recommendation system reflecting purchase criteria and pr...
Personal customized recommendation system reflecting purchase criteria and pr...
 
Classes of Model
Classes of ModelClasses of Model
Classes of Model
 
Democratization of Analytics
Democratization of AnalyticsDemocratization of Analytics
Democratization of Analytics
 
Big data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataBig data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer data
 
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
 

Similar to Syntelli_BankOfAmerica_FinalPresentation

Proposal_review_consumer_complaints
Proposal_review_consumer_complaintsProposal_review_consumer_complaints
Proposal_review_consumer_complaintsAakash Parwani
 
CFPB Consumer Complaints Report - Tableau
CFPB Consumer Complaints Report - TableauCFPB Consumer Complaints Report - Tableau
CFPB Consumer Complaints Report - TableauNisheet Mahajan
 
Tools and Techniques for Quality Management
Tools and Techniques for Quality ManagementTools and Techniques for Quality Management
Tools and Techniques for Quality ManagementNazrul Islam
 
Chapter 10 Tools and Techniques for Quality Management.ppt
Chapter 10 Tools and Techniques for Quality Management.pptChapter 10 Tools and Techniques for Quality Management.ppt
Chapter 10 Tools and Techniques for Quality Management.pptDr. Nazrul Islam
 
ConsumerComplaints_VisualReport
ConsumerComplaints_VisualReportConsumerComplaints_VisualReport
ConsumerComplaints_VisualReportAakash Parwani
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementIJERA Editor
 
Data MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData MiningData MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData Miningabdulraqeebalareqi1
 
Barclays - Case Study Competition | ISB | National Finalist
Barclays - Case Study Competition | ISB | National FinalistBarclays - Case Study Competition | ISB | National Finalist
Barclays - Case Study Competition | ISB | National FinalistNaveen Kumar
 
Analytics & Insurance. Serene Zawaydeh
Analytics & Insurance. Serene Zawaydeh Analytics & Insurance. Serene Zawaydeh
Analytics & Insurance. Serene Zawaydeh Serene Zawaydeh
 
FDA_SAKEC2018.pptx
FDA_SAKEC2018.pptxFDA_SAKEC2018.pptx
FDA_SAKEC2018.pptxmviji
 
Data analytics in finance broucher
Data analytics in finance broucher Data analytics in finance broucher
Data analytics in finance broucher Daniel Thomas
 
Gmid associates services portfolio bank
Gmid associates  services portfolio bankGmid associates  services portfolio bank
Gmid associates services portfolio bankPankaj Jha
 
Chapter8 - Beyond Classification
Chapter8 - Beyond ClassificationChapter8 - Beyond Classification
Chapter8 - Beyond ClassificationAnna Olecka
 
Data Integrity White Paper
Data Integrity White PaperData Integrity White Paper
Data Integrity White PaperExperian
 
Application of predictive analytics
Application of predictive analyticsApplication of predictive analytics
Application of predictive analyticsPrasad Narasimhan
 
Data Science Use Cases in Retail & Healthcare Industries.pdf
Data Science Use Cases in Retail & Healthcare Industries.pdfData Science Use Cases in Retail & Healthcare Industries.pdf
Data Science Use Cases in Retail & Healthcare Industries.pdfKaty Slemon
 
Business Statistics | Understanding business for Alliance data
Business Statistics | Understanding business for Alliance dataBusiness Statistics | Understanding business for Alliance data
Business Statistics | Understanding business for Alliance dataHarender Singh
 
Big data: What's the big deal?
Big data: What's the big deal?Big data: What's the big deal?
Big data: What's the big deal?Penser
 

Similar to Syntelli_BankOfAmerica_FinalPresentation (20)

Proposal_review_consumer_complaints
Proposal_review_consumer_complaintsProposal_review_consumer_complaints
Proposal_review_consumer_complaints
 
CFPB Consumer Complaints Report - Tableau
CFPB Consumer Complaints Report - TableauCFPB Consumer Complaints Report - Tableau
CFPB Consumer Complaints Report - Tableau
 
Tools and Techniques for Quality Management
Tools and Techniques for Quality ManagementTools and Techniques for Quality Management
Tools and Techniques for Quality Management
 
Chapter 10 Tools and Techniques for Quality Management.ppt
Chapter 10 Tools and Techniques for Quality Management.pptChapter 10 Tools and Techniques for Quality Management.ppt
Chapter 10 Tools and Techniques for Quality Management.ppt
 
ConsumerComplaints_VisualReport
ConsumerComplaints_VisualReportConsumerComplaints_VisualReport
ConsumerComplaints_VisualReport
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
 
Data MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData MiningData MiningData MiningData MiningData Mining
Data MiningData MiningData MiningData Mining
 
Data Mining Lec1.pptx
Data Mining Lec1.pptxData Mining Lec1.pptx
Data Mining Lec1.pptx
 
Barclays - Case Study Competition | ISB | National Finalist
Barclays - Case Study Competition | ISB | National FinalistBarclays - Case Study Competition | ISB | National Finalist
Barclays - Case Study Competition | ISB | National Finalist
 
Analytics & Insurance. Serene Zawaydeh
Analytics & Insurance. Serene Zawaydeh Analytics & Insurance. Serene Zawaydeh
Analytics & Insurance. Serene Zawaydeh
 
FDA_SAKEC2018.pptx
FDA_SAKEC2018.pptxFDA_SAKEC2018.pptx
FDA_SAKEC2018.pptx
 
Data analytics in finance broucher
Data analytics in finance broucher Data analytics in finance broucher
Data analytics in finance broucher
 
Gmid associates services portfolio bank
Gmid associates  services portfolio bankGmid associates  services portfolio bank
Gmid associates services portfolio bank
 
Chapter8 - Beyond Classification
Chapter8 - Beyond ClassificationChapter8 - Beyond Classification
Chapter8 - Beyond Classification
 
Data Integrity White Paper
Data Integrity White PaperData Integrity White Paper
Data Integrity White Paper
 
Application of predictive analytics
Application of predictive analyticsApplication of predictive analytics
Application of predictive analytics
 
Data Science Use Cases in Retail & Healthcare Industries.pdf
Data Science Use Cases in Retail & Healthcare Industries.pdfData Science Use Cases in Retail & Healthcare Industries.pdf
Data Science Use Cases in Retail & Healthcare Industries.pdf
 
Business Statistics | Understanding business for Alliance data
Business Statistics | Understanding business for Alliance dataBusiness Statistics | Understanding business for Alliance data
Business Statistics | Understanding business for Alliance data
 
20 ccp using logistic
20 ccp using logistic20 ccp using logistic
20 ccp using logistic
 
Big data: What's the big deal?
Big data: What's the big deal?Big data: What's the big deal?
Big data: What's the big deal?
 

Syntelli_BankOfAmerica_FinalPresentation

  • 1. Data Analytics for Financial Services Bank Of America Bhavya Kothari
  • 2. Introduction Background  Reviewed consumer complaints data from the Consumer Financial Protection Bureau (CFPB), an independent agency of the U.S. government, responsible for consumer financial protection.  Analyzed complaints for Bank of America between 2012 to 2015 to better understand if the data had any actionable insights that an organization could use. About Bank of America  Bank of America is an American multinational banking and financial services corporation headquartered in Charlotte.  It is the second largest bank holding company in U. S. by assets.  Products- consumer banking, corporate banking, finance and insurance, investment banking, mortgage loans, private banking, private equity, wealth management, credit cards.
  • 3. Analysis Techniques Business Objective 1: Predict and forecast Responses to Consumer Complaints to better manage resources (monetary and people)  Build a Decision Tree to predict the Company's response to consumer complaints  Build a Forecasting model to forecast Complaints/timing of Expenses Business Objective 2: Manage Brand image from Online Reviews  Text Analytics on the complaint descriptions from Twitter
  • 5. About the Dataset  The data contains information regarding complaints received for different financial organizations.  The dataset contains the fields - Complaint ID, Product, Sub-product, Issue, Sub-Issue, State, ZIP Code, Submitted via, Date received, Date sent to company, Company response, Timely response, Consumer disputed.  We focus on complaints with Date Received between 1/1/2012 and 12/31/2015. There is a total of 516696 complaints in this time period.
  • 6. Complaints by Product  Highest number of complaints are for Mortgage product (176,975), followed by Debt Collection (92,446), Credit Reporting (82,598) and Credit card (62,326).
  • 7. Top 10 Companies by Number of Complaints  Bank of America receives the highest number of complaints (53,876) followed by Wells Fargo and JPMorgan Chase.
  • 8. Company Responses to Credit Card Complaints  Closed with Explanation (35,412) and Closed with Monetary Relief (12,823) are the top two company responses for Credit Card Complaints.
  • 9. Exploratory data analysis: Bank of America  Bank of America is the fourth most complained company .  Total number of complaints for bank of America is 6846 Top 10 Most Complained about Credit Card Companies
  • 10. Bank of America: Complaints by Product  Majority of the complaints are for Mortgage (35,284) followed by Bank Account or service (9,809)
  • 11. Bank of America: Issues by Product  The top issues for Bank of America Bank account or service are Account opening, closing, or management (4,120) and Deposits and Withdrawals (2,684).
  • 12. Bank of America: Issues by Product  The top issues for Bank of America credit cards are Billing disputes (1,106) and Other(671) followed by closing/cancelling account (633).
  • 13. Bank of America: Complaints across States Highest number of complaints for Bank of America were received from California(10,271), followed by Florida (6,128)followed by New York.
  • 14. Bank of America: Complaints across States for Credit Card Product Highest number of complaints for Bank of America Credit Card product were received from California (966), Florida (685) and New York(610)
  • 16. Analysis Techniques Business Objective 1: Predict and forecast Responses to Consumer Complaints to better manage resources (monetary and people)  Build a Decision Tree to predict the Company's response to consumer complaints  Build a Forecasting model to forecast Complaints/timing of Expenses Business Objective 2: Manage Brand image from Online Reviews  Text Analytics on the complaint descriptions from twitter
  • 17. Predictive Modeling: Decision Tree Decision Tree to predict the Company's response to consumer complaints  Target variable - Company response  Input variables - Product, Issue, Submitted via, Timely response, Consumer disputed,  Built using rpart package  We divide data into training(900 observation), test datasets (304 observation). Training data is used for building model and test data is used to test the selected model.
  • 19. Predictive Modeling: Decision Tree after Pruning
  • 20. Predictive Modeling: Decision Tree Validation  Test the decision tree model accuracy.  While classifying in closed with explanation the accuracy is 90%.
  • 21. Time-series Forecasting model Build a Forecasting model to forecast Complaints/timing of Expenses  Analyzed seasonality in complaints (specific quarter of times of the year, peaks in complaints ) and interesting trends.  Built using R forecast package and ARIMA forecasting models  Input variables - Number of Complaints against Date Received  Time interval – quarter
  • 22. Time-series Forecasting model  The default model selected by the system is ARIMA Model with lower MAPE 9.39. This means that, on average, there is a 9.39% difference between the forecast and actual values. The forecast model is usually the model with the best fit (lowest MAPE) among all of the models considered for the series
  • 23. Time-series Forecasting model  There are spikes in complaints that were closed with explanation in 2012 APR.
  • 24. Time-series Forecasting model  Long term trend shows that complaints are reducing over time.  It also shows some seasonality in complaints in APR Month.
  • 25. Text Analytics on Online Reviews Text Analytics on the complaint descriptions from Twitter  Used text clustering technique to understand natural groupings and capture any interesting insights from the complaints description text  Extracted 64 reviews for Bank Of America from Twitter using R(TwitterR package) software  Built using R.  Sentiment analysis of twitter data to align customer reviews.  Cluster algorithm - Number of clusters – 3
  • 26. Text Analytics on Online Reviews
  • 27. Text Analytics on Online Reviews  Sentiment analysis on the data and scores assigned and classify them in to positive, negative ,neutral.
  • 28. Text Analytics on Online Reviews  Cluster 1 looks like Customer Service experience as the reviews talk about help ,bad,good,dispute,cancel,agree etc.  Cluster 2 looks like Billing, fee and payment issues as the reviews talk about start ,late ,close ,pay ,look ,payment etc.  Cluster 3 looks like Account and fund processing as the reviews contain terms such as fund, check, deposit, process, etc.