SlideShare a Scribd company logo
1 of 24
Download to read offline
ExploratoryDataAnalysis
CONSUMERFINANCE
COMPANY
Decisionson Loan
applications basedon
available data by
simple EDA
ShreyaSolanki
BUSINESS
OBJECTIVE
Toapprove the loan applications of the clients who are capableof
repaying the loans.
In other words, the company wants to understand the driving
factors (or driver variables) behind loan default, i.e. the variables
which are strong indicators of default. The company can utilise this
knowledge for its portfolio and risk assessment.
EDA on the available data to understand how customer
attributes and loan attributes influence the tendency of
default.
Identifying the patterns which indicate if a client has
difficulty paying their installments, to take different actions
like denying the loan, reducing loan amount, lending at
higher interest rates to riskyclients.
TYPESOF
DECISIONON
LOAN
APPLICATIONS
Approved: TheCompany hasapproved loan Application
Cancelled: Theclient cancelled the application sometime
during approval. Either the client changed her/his mind
about the loan or in some casesdue to ahigher risk ofthe
client he received worse pricing which he did notwant.
Refused: Thecompany had rejected the loan (because the
client does not meet their requirementsetc.).
Unused offer: Loan hasbeen cancelled by the client but on
different stagesof the process.
Risks
Associated
with decision
If the applicant is not likely to repay the
loan, i.e. he/she is likely to default,
then approving the loan may lead to a
financial lossfor thecompany.
If the applicant is likely to repay the
loan, then not approving the loan
theresults in a loss of business to
company.
Business
understanding
Theloan providing companies find it hard
to give loans to the people due to their
insufficient or non-existent credit history.
Becauseof that, some consumers useit
astheir advantage by becoming a
defaulter.
Using EDAto analyse the patterns
present in the data to ensure that the
applicants capable of repaying the loan
are not rejected and those who are
unlikely to pay are notapproved.
Dataset
Application Dataset: 122
columns and records of 307511
unique Application ID
PreviousApplication Dataset:
37 columns and 1670214
PreviousApplicant ID
Resultsof analysis– SignificantCATEGORICALVariables– Forrejecting or approvingthe client
By the EDA so far and results discussed above, theseare the Categoricalvariablesto be consideredfor making the decision on a new client
Application DataVariables
NAME_CONTRACT_TYPE
NAME_EDUCATION_TYPE
NAME_HOUSING_TYPE
NAME_TYPE_SUITE
FAMILY_STATUS
OCUUPATION_TYPE
NAME_INCOME_TYPE
FLAG_OWN_REALTY
FLAG_OWN_CAR
Merged data variables:The previousdata also has a lot to influence on the decision, Although this is not a completeanalysis but, thesevariablesare more significant than others.
NAME_PORTFOLIO
NAME_TYPE_SUITE_PREV
NAME_CLIENT_TYPE
NAME_PRODUCT_TYPE
NAME_SELLER_INDUSTRY
NAME_INCOME_TYPE
Theinsights and results of the analysisof each variable are mentioned along with the plots in the next pages.
Univariate AnalysisSegmentedover TARGETvariable 0 forno payment difficulties and 1 for
defaults in:
-Application Data
-Application and Previous Merged Data
Imbalance of Data for TARGETvariable 0 and 1 in ApplicationData Imbalance of Data for TARGETvariable 0 and 1 in PreviousData
Univariate AnalysisSegmentedover TARGET0 AND1
CONTRACTTYPEConsumer,Cashand Revolving Loans:
Application Data: PERCENTAGEOF
DEFAULTERS(TARGET=1)
IN CASHLOANS– 8.35%
IN REVOLVINGLOANS–5.48%
Contract Typeof previousapplications:
PERCENTAGEOFDEFAULTERS(TARGET=1)
IN CASHLOANS– 9.12%
IN REVOLVINGLOANS-10.46
IN ConsumerLoans-7.70%
Onmerging the two datasets:
Revolving Loansare slightly better in terms of numberof
defaulters turningup
Univariate AnalysisSegmentedover TARGET0 AND1
Applicant OwnCaror not Applicants own Realty ornot
RESULTSOFANALYSIS:
FLAG_OWN_CAR:
Thereare moreapplicants
who do not havecar in
the TARGET1 applicants
who havepayment
difficulties
FLAG_OWN_REALTY:
Applicants who own
realty
Not amajor differencein
ratio
Univariate AnalysisSegmentedover Target0 and1
Applicant’s SuiteType Applicant’s FamilyStatus
RESULTSOFANALYSIS:
Type_Suite:
Unaccompaniedshowa
higherDefaulter
Family_Status:
Married on the other
hand lower defaultsare
found
Univariate AnalysisSegmentedover Target0 and 1 Applicant’s HousingType
Applicant’s EducationType
RESULTSOFANALYSIS:
Applicant’s Housing
Type:
Thosewho own House
turn lesserintoDefaulters
Thosewho are with
parents are more of arisk
of beingdefaulters
Applicant’s Education
Type:
Secondaryeducationare
at higher risk of turning
into defaulters asper
current data
Higher education
applicants couldbe
potentially better
repayers
Univariate AnalysisSegmentedover Target0 and1
Applicant’s OccupationType
RESULTSOFANALYSIS:
As The Laborers , Sales Staff, Core
Staff, Managers and Drivers are the
highest number ofapplicants.
TheLaborers turn out to be even
higher defaulters aswell,
Sales Staff is also higher in count of
defaulters.
Managers on the other hand are
slightly low in count in being
defaulter thanrepayers.
According to this information ,
Managers can be preferred higher
over laborers and Sales Staff, but
this needsto be furtheranalysed
Univariate AnalysisSegmentedover Target0 and1
Applicant’s LoanApplication ProcessStartDay Applicant’s Gender
RESULTSOFANALYSIS:
CODE_GENDER:
Wecannot saymuch about Gender
becauseit showsalmost no Bias
here in the segmentation on
TARGET
WEEKDAY_APPR_PROCESS_START:
Not significant but aslightlyless
Defaulters onTUESDAY
DataImbalance :Applicant’s OccupationTypegrouped on LoanContract TypesegmentedonTARGET
Resultsof analysis– previousand current application mergeddata
NAME_PORTFOLIO:
POSseemsto be betterin
terms of repayers
While XNAand CARDSare
more towardsdefaults
NAME_TYPE_SUITE_PREV:In
the previous data,the same
pattern isseen.
Unaccompaniedand moreof
defaulters
Resultsof analysis– previousand current application mergeddata
NAME_CLIENT_TYPE:
Ascompared to Repeaterand
New type of client , the
Refreshedclient type is a
better type that repaysbut
only aslight difference is
observed
NAME_Product_TYPE:
Walk-in type is observed tobe
high at turning into defaulter
compared to XNAand x-sell
which are more of repayin
type
Resultsof analysis– previousand current application mergeddata
NAME_SELLER_INDUSTRY:
Consumerelectronics is better
at repaying
While XNAis amajor defaulter
turning category.(XNAis
unknown industry here)
NAME_INCOME_TYPE:
Working: This is the most risky
category asdefaulter turn up is
highest
CommercialAssoc.: slightly
lower default turnup.
Pensioner: lower defaults
observed and hencecouldbe
taken into consideration
Stateservant: Thishaving
lesserdefaults
Unemployed:
Student: negligible numberof
applicants but no defaulters.
Couldbe agood sourcefor
businessfor company.And
alsosafeclients
Businessman:safeclients as
no defaulters, but negligible
applicants.
Insightsfrom theCorrelation
Significant NUMERICVariables – Forrejectingor approving the client
Bythe EDAsofar and results discussedabove, these are the Numeric variables which have high correlation In the
Application Data
.
• AMT_INCOME_TOTAL
• AMT_CREDIT
• AMT_ANNUITY
• AMT_GOODS_PRICE
In the previousand current application analysis the following variablesarefound to be highly correlated:
• AMT_ANNUITY_PREV
• AMT_APPLICATION
• AMT_CREDIT_PREV
• AMT_DOWN_PAYMENT
• AMT_GOODS_PRICE_PREV
• CNT_PAYMENT
Theresults of the analysis of each of these variables is mentionedalong
withthe plots in the next pages.
Univariate analysisof MergedPreviousandCurrent Application Dataof few Variables
Bivariate analysisof Numeric Variables– Merged Previousand CurrentApplication Data
TARGET1
TARGET0
Correlation of the most significantnumeric
variables Application data
TARGET0
TARGET1
Correlationof the most significant numeric variableson the MergedData
TARGET0
TARGET1
Insights from correlation
top 3correlations of appdf TARGET0:
• 1.AMT_GOODS_PRICE and AMT_CREDIT :0.912
• 2.AMT_ANNUITY and AMT_CREDIT :0.643
• 3.AMT_ANNUITY and AMT_INCOME_TOTAL :0.345
top 3correlations of appdf TARGET1:
• 1.AMT_GOODS_PRICE and AMT_CREDIT :0.890
• 2.AMT_ANNUITY and AMT_CREDIT :0.621
• 3.AMT_ANNUITY and AMT_INCOME_TOTAL :0.305
top 3correlations of prev_app_dfTARGET0
• 1.AMT_APPLICATION and AMT_GOODS_PRICE_PREV :0.999
• 2.AMT_GOODS_PRICE_PREV and AMT_CREDIT_PREV :0.932
• 3.AMT_CREDIT_PREV andAMT_APPLICATION :0.878
top 3correlations of prev_app_dfTARGET1
• 1. AMT_APPLICATION and AMT_GOODS_PRICE_PREV :0.999
• 2.AMT_GOODS_PRICE_PREV and AMT_CREDIT_PREV : 0.932
• 3.AMT_CREDIT_PREV andAMT_APPLICATION :0.889

More Related Content

What's hot

Bipul customer perception-towards-internet-banking
Bipul   customer perception-towards-internet-bankingBipul   customer perception-towards-internet-banking
Bipul customer perception-towards-internet-banking
Bipul Kumar
 
Model building in credit card and loan approval
Model building in credit card and loan approval Model building in credit card and loan approval
Model building in credit card and loan approval
Venkata Reddy Konasani
 

What's hot (20)

Banking Credit Risk- EDA.pptx
Banking Credit Risk- EDA.pptxBanking Credit Risk- EDA.pptx
Banking Credit Risk- EDA.pptx
 
Credit EDA Case Study : Exploratory Data Analysis on Bank Loan Data
Credit EDA Case Study : Exploratory Data Analysis on Bank Loan DataCredit EDA Case Study : Exploratory Data Analysis on Bank Loan Data
Credit EDA Case Study : Exploratory Data Analysis on Bank Loan Data
 
Exploratory Data Analysis Example - Credit Risk Analysis (Second Attempt)
Exploratory Data Analysis Example - Credit Risk Analysis (Second Attempt)Exploratory Data Analysis Example - Credit Risk Analysis (Second Attempt)
Exploratory Data Analysis Example - Credit Risk Analysis (Second Attempt)
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model final
 
Credit Card Customer Segmentation
Credit Card Customer SegmentationCredit Card Customer Segmentation
Credit Card Customer Segmentation
 
Decision tree for Predictive Modeling
Decision tree for Predictive ModelingDecision tree for Predictive Modeling
Decision tree for Predictive Modeling
 
Predictive modelling
Predictive modellingPredictive modelling
Predictive modelling
 
Predicting the e-commerce churn
Predicting the e-commerce churnPredicting the e-commerce churn
Predicting the e-commerce churn
 
Bipul customer perception-towards-internet-banking
Bipul   customer perception-towards-internet-bankingBipul   customer perception-towards-internet-banking
Bipul customer perception-towards-internet-banking
 
Lead scoring case study
Lead scoring case studyLead scoring case study
Lead scoring case study
 
Case Study: Loan default prediction
Case Study: Loan default predictionCase Study: Loan default prediction
Case Study: Loan default prediction
 
Default Prediction & Analysis on Lending Club Loan Data
Default Prediction & Analysis on Lending Club Loan DataDefault Prediction & Analysis on Lending Club Loan Data
Default Prediction & Analysis on Lending Club Loan Data
 
project-6-bank-loan-case-study.pdf
project-6-bank-loan-case-study.pdfproject-6-bank-loan-case-study.pdf
project-6-bank-loan-case-study.pdf
 
Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning Algorithms
 
352735326 rsh-qam11-tif-05-doc
352735326 rsh-qam11-tif-05-doc352735326 rsh-qam11-tif-05-doc
352735326 rsh-qam11-tif-05-doc
 
Model building in credit card and loan approval
Model building in credit card and loan approval Model building in credit card and loan approval
Model building in credit card and loan approval
 
Lead scoring case study presentation
Lead scoring case study presentationLead scoring case study presentation
Lead scoring case study presentation
 
Loan default prediction with machine language
Loan  default  prediction with  machine  language Loan  default  prediction with  machine  language
Loan default prediction with machine language
 
Loan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningLoan Default Prediction with Machine Learning
Loan Default Prediction with Machine Learning
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using Classification
 

Similar to Home credit company risk presentation

CFPB Small Dollar Lending Exam Procedures Module 2 ECOA, FCRA, TILA and Othe...
CFPB Small Dollar Lending Exam Procedures  Module 2 ECOA, FCRA, TILA and Othe...CFPB Small Dollar Lending Exam Procedures  Module 2 ECOA, FCRA, TILA and Othe...
CFPB Small Dollar Lending Exam Procedures Module 2 ECOA, FCRA, TILA and Othe...
Justin Hosie
 
LorriAne Turner Resume 4-19-16
LorriAne Turner Resume  4-19-16LorriAne Turner Resume  4-19-16
LorriAne Turner Resume 4-19-16
LorriAne Turner
 

Similar to Home credit company risk presentation (20)

Big east bank
Big east bankBig east bank
Big east bank
 
Decision Techniques For Managers
Decision Techniques For ManagersDecision Techniques For Managers
Decision Techniques For Managers
 
Fair Lending Testing and Analysis - Made Easy
Fair Lending Testing and Analysis - Made EasyFair Lending Testing and Analysis - Made Easy
Fair Lending Testing and Analysis - Made Easy
 
Financing Policy Webinar with Congressman Israel and Matthew Brown - Matthew ...
Financing Policy Webinar with Congressman Israel and Matthew Brown - Matthew ...Financing Policy Webinar with Congressman Israel and Matthew Brown - Matthew ...
Financing Policy Webinar with Congressman Israel and Matthew Brown - Matthew ...
 
EDA_ Bank_Loan_Case_Study_PPT.pdf
EDA_ Bank_Loan_Case_Study_PPT.pdfEDA_ Bank_Loan_Case_Study_PPT.pdf
EDA_ Bank_Loan_Case_Study_PPT.pdf
 
Trust But Verify - Equifax Automotive
Trust But Verify - Equifax AutomotiveTrust But Verify - Equifax Automotive
Trust But Verify - Equifax Automotive
 
CFPB Small Dollar Lending Exam Procedures Module 2 ECOA, FCRA, TILA and Othe...
CFPB Small Dollar Lending Exam Procedures  Module 2 ECOA, FCRA, TILA and Othe...CFPB Small Dollar Lending Exam Procedures  Module 2 ECOA, FCRA, TILA and Othe...
CFPB Small Dollar Lending Exam Procedures Module 2 ECOA, FCRA, TILA and Othe...
 
FairLendingQ&Av10
FairLendingQ&Av10FairLendingQ&Av10
FairLendingQ&Av10
 
Perceptions of the customers towards insurance companies in bangladesh a stud...
Perceptions of the customers towards insurance companies in bangladesh a stud...Perceptions of the customers towards insurance companies in bangladesh a stud...
Perceptions of the customers towards insurance companies in bangladesh a stud...
 
LorriAne Turner Resume 4-19-16
LorriAne Turner Resume  4-19-16LorriAne Turner Resume  4-19-16
LorriAne Turner Resume 4-19-16
 
Data Governance in the age of Social Media
Data Governance in the age of Social MediaData Governance in the age of Social Media
Data Governance in the age of Social Media
 
Credit Audit's Use of Data Analytics in Examining Consumer Loan Portfolios
Credit Audit's Use of Data Analytics in Examining Consumer Loan PortfoliosCredit Audit's Use of Data Analytics in Examining Consumer Loan Portfolios
Credit Audit's Use of Data Analytics in Examining Consumer Loan Portfolios
 
M Henderson Resume
M Henderson ResumeM Henderson Resume
M Henderson Resume
 
Comparity Overview Presentation
Comparity Overview Presentation Comparity Overview Presentation
Comparity Overview Presentation
 
Họ đã làm thế nào
Họ đã làm thế nàoHọ đã làm thế nào
Họ đã làm thế nào
 
Trends in Alternative Financial Services
Trends in Alternative Financial Services Trends in Alternative Financial Services
Trends in Alternative Financial Services
 
Group 1 p53
Group 1 p53Group 1 p53
Group 1 p53
 
5 C's of Credit
5 C's of Credit5 C's of Credit
5 C's of Credit
 
Balance Transfers White Paper
Balance Transfers White PaperBalance Transfers White Paper
Balance Transfers White Paper
 
Workshop Management Consulting Case Central Power
Workshop Management Consulting  Case Central PowerWorkshop Management Consulting  Case Central Power
Workshop Management Consulting Case Central Power
 

Recently uploaded

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
HyderabadDolls
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 

Recently uploaded (20)

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts ServiceCall Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Giridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime Giridih
Giridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime GiridihGiridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime Giridih
Giridih Escorts Service Girl ^ 9332606886, WhatsApp Anytime Giridih
 
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service AvailableVastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 

Home credit company risk presentation

  • 2. BUSINESS OBJECTIVE Toapprove the loan applications of the clients who are capableof repaying the loans. In other words, the company wants to understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default. The company can utilise this knowledge for its portfolio and risk assessment. EDA on the available data to understand how customer attributes and loan attributes influence the tendency of default. Identifying the patterns which indicate if a client has difficulty paying their installments, to take different actions like denying the loan, reducing loan amount, lending at higher interest rates to riskyclients.
  • 3. TYPESOF DECISIONON LOAN APPLICATIONS Approved: TheCompany hasapproved loan Application Cancelled: Theclient cancelled the application sometime during approval. Either the client changed her/his mind about the loan or in some casesdue to ahigher risk ofthe client he received worse pricing which he did notwant. Refused: Thecompany had rejected the loan (because the client does not meet their requirementsetc.). Unused offer: Loan hasbeen cancelled by the client but on different stagesof the process.
  • 4. Risks Associated with decision If the applicant is not likely to repay the loan, i.e. he/she is likely to default, then approving the loan may lead to a financial lossfor thecompany. If the applicant is likely to repay the loan, then not approving the loan theresults in a loss of business to company.
  • 5. Business understanding Theloan providing companies find it hard to give loans to the people due to their insufficient or non-existent credit history. Becauseof that, some consumers useit astheir advantage by becoming a defaulter. Using EDAto analyse the patterns present in the data to ensure that the applicants capable of repaying the loan are not rejected and those who are unlikely to pay are notapproved.
  • 6. Dataset Application Dataset: 122 columns and records of 307511 unique Application ID PreviousApplication Dataset: 37 columns and 1670214 PreviousApplicant ID
  • 7. Resultsof analysis– SignificantCATEGORICALVariables– Forrejecting or approvingthe client By the EDA so far and results discussed above, theseare the Categoricalvariablesto be consideredfor making the decision on a new client Application DataVariables NAME_CONTRACT_TYPE NAME_EDUCATION_TYPE NAME_HOUSING_TYPE NAME_TYPE_SUITE FAMILY_STATUS OCUUPATION_TYPE NAME_INCOME_TYPE FLAG_OWN_REALTY FLAG_OWN_CAR Merged data variables:The previousdata also has a lot to influence on the decision, Although this is not a completeanalysis but, thesevariablesare more significant than others. NAME_PORTFOLIO NAME_TYPE_SUITE_PREV NAME_CLIENT_TYPE NAME_PRODUCT_TYPE NAME_SELLER_INDUSTRY NAME_INCOME_TYPE Theinsights and results of the analysisof each variable are mentioned along with the plots in the next pages.
  • 8. Univariate AnalysisSegmentedover TARGETvariable 0 forno payment difficulties and 1 for defaults in: -Application Data -Application and Previous Merged Data Imbalance of Data for TARGETvariable 0 and 1 in ApplicationData Imbalance of Data for TARGETvariable 0 and 1 in PreviousData
  • 9. Univariate AnalysisSegmentedover TARGET0 AND1 CONTRACTTYPEConsumer,Cashand Revolving Loans: Application Data: PERCENTAGEOF DEFAULTERS(TARGET=1) IN CASHLOANS– 8.35% IN REVOLVINGLOANS–5.48% Contract Typeof previousapplications: PERCENTAGEOFDEFAULTERS(TARGET=1) IN CASHLOANS– 9.12% IN REVOLVINGLOANS-10.46 IN ConsumerLoans-7.70% Onmerging the two datasets: Revolving Loansare slightly better in terms of numberof defaulters turningup
  • 10. Univariate AnalysisSegmentedover TARGET0 AND1 Applicant OwnCaror not Applicants own Realty ornot RESULTSOFANALYSIS: FLAG_OWN_CAR: Thereare moreapplicants who do not havecar in the TARGET1 applicants who havepayment difficulties FLAG_OWN_REALTY: Applicants who own realty Not amajor differencein ratio
  • 11. Univariate AnalysisSegmentedover Target0 and1 Applicant’s SuiteType Applicant’s FamilyStatus RESULTSOFANALYSIS: Type_Suite: Unaccompaniedshowa higherDefaulter Family_Status: Married on the other hand lower defaultsare found
  • 12. Univariate AnalysisSegmentedover Target0 and 1 Applicant’s HousingType Applicant’s EducationType RESULTSOFANALYSIS: Applicant’s Housing Type: Thosewho own House turn lesserintoDefaulters Thosewho are with parents are more of arisk of beingdefaulters Applicant’s Education Type: Secondaryeducationare at higher risk of turning into defaulters asper current data Higher education applicants couldbe potentially better repayers
  • 13. Univariate AnalysisSegmentedover Target0 and1 Applicant’s OccupationType RESULTSOFANALYSIS: As The Laborers , Sales Staff, Core Staff, Managers and Drivers are the highest number ofapplicants. TheLaborers turn out to be even higher defaulters aswell, Sales Staff is also higher in count of defaulters. Managers on the other hand are slightly low in count in being defaulter thanrepayers. According to this information , Managers can be preferred higher over laborers and Sales Staff, but this needsto be furtheranalysed
  • 14. Univariate AnalysisSegmentedover Target0 and1 Applicant’s LoanApplication ProcessStartDay Applicant’s Gender RESULTSOFANALYSIS: CODE_GENDER: Wecannot saymuch about Gender becauseit showsalmost no Bias here in the segmentation on TARGET WEEKDAY_APPR_PROCESS_START: Not significant but aslightlyless Defaulters onTUESDAY
  • 15. DataImbalance :Applicant’s OccupationTypegrouped on LoanContract TypesegmentedonTARGET
  • 16. Resultsof analysis– previousand current application mergeddata NAME_PORTFOLIO: POSseemsto be betterin terms of repayers While XNAand CARDSare more towardsdefaults NAME_TYPE_SUITE_PREV:In the previous data,the same pattern isseen. Unaccompaniedand moreof defaulters
  • 17. Resultsof analysis– previousand current application mergeddata NAME_CLIENT_TYPE: Ascompared to Repeaterand New type of client , the Refreshedclient type is a better type that repaysbut only aslight difference is observed NAME_Product_TYPE: Walk-in type is observed tobe high at turning into defaulter compared to XNAand x-sell which are more of repayin type
  • 18. Resultsof analysis– previousand current application mergeddata NAME_SELLER_INDUSTRY: Consumerelectronics is better at repaying While XNAis amajor defaulter turning category.(XNAis unknown industry here) NAME_INCOME_TYPE: Working: This is the most risky category asdefaulter turn up is highest CommercialAssoc.: slightly lower default turnup. Pensioner: lower defaults observed and hencecouldbe taken into consideration Stateservant: Thishaving lesserdefaults Unemployed: Student: negligible numberof applicants but no defaulters. Couldbe agood sourcefor businessfor company.And alsosafeclients Businessman:safeclients as no defaulters, but negligible applicants.
  • 19. Insightsfrom theCorrelation Significant NUMERICVariables – Forrejectingor approving the client Bythe EDAsofar and results discussedabove, these are the Numeric variables which have high correlation In the Application Data . • AMT_INCOME_TOTAL • AMT_CREDIT • AMT_ANNUITY • AMT_GOODS_PRICE In the previousand current application analysis the following variablesarefound to be highly correlated: • AMT_ANNUITY_PREV • AMT_APPLICATION • AMT_CREDIT_PREV • AMT_DOWN_PAYMENT • AMT_GOODS_PRICE_PREV • CNT_PAYMENT Theresults of the analysis of each of these variables is mentionedalong withthe plots in the next pages.
  • 20. Univariate analysisof MergedPreviousandCurrent Application Dataof few Variables
  • 21. Bivariate analysisof Numeric Variables– Merged Previousand CurrentApplication Data TARGET1 TARGET0
  • 22. Correlation of the most significantnumeric variables Application data TARGET0 TARGET1
  • 23. Correlationof the most significant numeric variableson the MergedData TARGET0 TARGET1
  • 24. Insights from correlation top 3correlations of appdf TARGET0: • 1.AMT_GOODS_PRICE and AMT_CREDIT :0.912 • 2.AMT_ANNUITY and AMT_CREDIT :0.643 • 3.AMT_ANNUITY and AMT_INCOME_TOTAL :0.345 top 3correlations of appdf TARGET1: • 1.AMT_GOODS_PRICE and AMT_CREDIT :0.890 • 2.AMT_ANNUITY and AMT_CREDIT :0.621 • 3.AMT_ANNUITY and AMT_INCOME_TOTAL :0.305 top 3correlations of prev_app_dfTARGET0 • 1.AMT_APPLICATION and AMT_GOODS_PRICE_PREV :0.999 • 2.AMT_GOODS_PRICE_PREV and AMT_CREDIT_PREV :0.932 • 3.AMT_CREDIT_PREV andAMT_APPLICATION :0.878 top 3correlations of prev_app_dfTARGET1 • 1. AMT_APPLICATION and AMT_GOODS_PRICE_PREV :0.999 • 2.AMT_GOODS_PRICE_PREV and AMT_CREDIT_PREV : 0.932 • 3.AMT_CREDIT_PREV andAMT_APPLICATION :0.889