SlideShare a Scribd company logo
1 of 10
SmartCube
Data Analytics Case Study
Team: Mafias In Training
Team Members:
Apoorv Parmar
Kushaang Deswal
Hemal Aurora
Mudith
Executive Summary
• The basis of identifying NPA stems
from the age of the applicant.
• The dataset has been divided into
4 clusters based on age of the
applicant.
– 20-41 Year
• Low Monthly Income
• Less Dependents
– 42-62
• Increased Monthly Income
• Increase In Dependents
– 63-96
• Retirement age
• Giving Loan not Preferred
• Only Credit Card Lines, no Real
Estate Loans
– Greater than 96
• Loan attached to spouse/Children in
eligibility age of dependent is <63
• Thus we were able to obtain 16
clusters in which the data was
divided.
• New Features added to the
dataset.
– Credit Card Lines = Total open Credit
Lines – Total Real Estate lines
– Utilization = Credit card lines * Revolving
Utilization of unsecured Lines
– Per Capita Income of Family = Monthly
Income/(Number of Dependents + 1)
• The other major contributing
factor is the Debt ratio. The
second level of clustering is done
based on debt ratio.
– Debt Ratio >=0.5
• Greater amount of credit lines
• Less monthly income
– Debt Ratio <0.5
• Less Credit lines
• Greater Monthly income
• The third level of clustering is
done on the basis of
NumberOfTimes90DaysLate.
– If the applicant has defaulted in 90
days, the probability of it to go NPA in
high.
– If the applicant has not defaulted in
90 days, probability of it to go NPA is
less
CLUSTERING
Dataset
Age : [21,41]
Debt Ratio<=0.50
+90days past
default =0
+90days past
default !=90
Debt Ratio > 0.50
+90days past
default =0
+90days past
default !=90
Age : [42,62]
Debt Ratio<=0.50
+90days past
default =0
+90days past
default !=90
Debt Ratio > 0.50
+90days past
default =0
+90days past
default !=90
Age : [63,96]
Debt Ratio<=0.50
+90days past
default =0
+90days past
default !=90
Debt Ratio > 0.50
+90days past
default =0
+90days past
default !=90
Age : [97,109]
Debt Ratio<=0.50
+90days past
default =0
+90days past
default !=90
Debt Ratio > 0.50
+90days past
default =0
+90days past
default !=90
Cluster 1
Cluster 2
Cluster 3
Cluster 4
Cluster 5
Cluster 6
Cluster 7
Cluster 8
Cluster 9
Cluster 10
Cluster 11
Cluster 12
Cluster 13
Cluster 14
Cluster 15
Cluster 16
INSIGHTS
• Age, Debt Ratio & Payment History have considerable impact on tendency of a customer to convert into
NPA.
• If borrowers cross the age of 65years then:
• They can only avail credit cards as a loan independently.
• Only Real Estate loan can be obtained by them in partnership with their Spouse / Children of eligibility
age < 65 years.
OPINIONS
• Loans to customers with age > 65years should be given real estate loan only if they have dependents
due to low mortality rate.
• Revolving Utilization must not be allowed to hop over 100%.
Technique Used for filling missing data
MICE : Multiple Imputation by Chained Equations, principle method of dealing with missing data.
Removal of Outliers
Outliers have been conservatively removed since Test data was
kept in consideration. On applying our technique for outliers
identification on Test data, significant number of outliers were
identified and removing them could have caused trimming of test
data by 15-20%.
Hence, outliers for below mentioned factors have only been
removed.
BASIC OUTLIERS WERE CLEARED ON THE BASIS OF A BOX PLOT FOR
EACH OF THE CLUSTERS FORMED.
Revolving Utilization Age DebtRatio NumberOfTimes90DaysLate
Upper Bound 200% 63 7000 17
Lower Bound 20
Exploratory Analysis
Our analysis began with data visualization considering different dimensions.
Count Of NPA vs Age Count of NPA on Credit Lines vs Age
The above pattern showed us a distribution similar to normal distribution. Hence, we got a
clustering dimension as Age. Similarly, visualization was done on 13 combinations to have a
clear analysis of relationship amongst factors.
DECISION MAKING FACTORS
Debt Ratio : Normalized form of Debt Ratio given in each cluster.
Normalized Debt Ratio : Debt Ratio / Max(Debt Ratio in each cluster)
Past Default : Weighted Normalized average of past defaults.
Formula : (0.17*No. of defaults in 30-59days) + ( 0.33*No. of defaults in 60-89days)+(0.50*No. of defaults
+90days)
Utilization of Unsecured Credit Lines : Check the utilization of number of open credit lines which are
other than real estate / long term
Formula : Normalized value of( (No. of open credit lines - No. of Real Estate Loans )*Revolving Utilization of
unsecured lines)
Number of Dependents : Normalized value of number of dependents for a borrower in a cluster
FACTORS IMPACTING CLUSTERS
Debt
Ratio
Past
Default
Utilization of
unsecured lines
Number of
Dependents Final Score
Cluster 1 0.25 0.5 0.15 0.10
Weighted Sum of all Factors
for each Cluster.
If Final Score >0.10, Then
Borrower qualifies for NPA
Else Borrower does not
qualify for NPA.
(94.12% Accuracy)
Cluster 2 0.25 0.5 0.15 0.10
Cluster 3 0.25 0.5 0.15 0.10
Cluster 4 0.25 0.5 0.15 0.10
Cluster 5 0.25 0.45 0.15 0.15
Cluster 6 0.25 0.45 0.15 0.15
Cluster 7 0.25 0.45 0.15 0.15
Cluster 8 0.25 0.45 0.15 0.15
Cluster 9 0.2 0.35 0.15 0.20
Cluster 10 0.2 0.35 0.15 0.20
Cluster 11 0.2 0.35 0.15 0.20
Cluster 12 0.2 0.35 0.15 0.20
Cluster 13 0.2 0.3 0.25 0.25
Cluster 14 0.2 0.3 0.25 0.25
Cluster 15 0.2 0.3 0.25 0.25
Cluster 16 0.2 0.3 0.25 0.25
Team Members and CGPA
Apoorv Parmar 6.63
Kushaang Deswal 7.96
Hemal Aurora 7.43
Mudith 7.92
THANK YOU

More Related Content

Similar to Mafias in training- Resolvr, The SmartCube

Credit Its A Brand New Day
Credit Its A Brand New DayCredit Its A Brand New Day
Credit Its A Brand New Daypeglover
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1Venkata Reddy Konasani
 
Getting out of_debt_presentation(1)
Getting out of_debt_presentation(1)Getting out of_debt_presentation(1)
Getting out of_debt_presentation(1)Shannon Gilliland
 
Getting out of_debt_presentation
Getting out of_debt_presentationGetting out of_debt_presentation
Getting out of_debt_presentationShannon Gilliland
 
Estimation of the probability of default : Credit Rish
Estimation of the probability of default : Credit RishEstimation of the probability of default : Credit Rish
Estimation of the probability of default : Credit RishArsalan Qadri
 
Exploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk AssesmentExploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk AssesmentVishalPatil527
 
Non-Interest Income and Future Business Models
Non-Interest Income and Future Business Models Non-Interest Income and Future Business Models
Non-Interest Income and Future Business Models NAFCU Services Corporation
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model finalRitu Sarkar
 
Ultimate Credit Score Guide
Ultimate Credit Score GuideUltimate Credit Score Guide
Ultimate Credit Score GuideEric Stephenson
 
How your Business Credit Scores are Calculated
How your Business Credit Scores are CalculatedHow your Business Credit Scores are Calculated
How your Business Credit Scores are CalculatedJanet Gershen-Siegel
 
Stanford CS 007-06 (2021): Personal Finance for Engineers / Debt
Stanford CS 007-06 (2021): Personal Finance for Engineers / DebtStanford CS 007-06 (2021): Personal Finance for Engineers / Debt
Stanford CS 007-06 (2021): Personal Finance for Engineers / DebtAdam Nash
 
Credit Its A Brand New Day
Credit Its A Brand New DayCredit Its A Brand New Day
Credit Its A Brand New DayAndre Williams
 
Understanding Your Credit Score
Understanding Your Credit ScoreUnderstanding Your Credit Score
Understanding Your Credit ScoreLexington Law Firm
 
Financing yoursmallbusiness lb-class
Financing yoursmallbusiness lb-classFinancing yoursmallbusiness lb-class
Financing yoursmallbusiness lb-classLady Bizness
 
Credit Risk Evaluation Model
Credit Risk Evaluation ModelCredit Risk Evaluation Model
Credit Risk Evaluation ModelMihai Enescu
 

Similar to Mafias in training- Resolvr, The SmartCube (20)

Credit Its A Brand New Day
Credit Its A Brand New DayCredit Its A Brand New Day
Credit Its A Brand New Day
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1
 
Getting out of_debt_presentation(1)
Getting out of_debt_presentation(1)Getting out of_debt_presentation(1)
Getting out of_debt_presentation(1)
 
Getting out of_debt_presentation
Getting out of_debt_presentationGetting out of_debt_presentation
Getting out of_debt_presentation
 
Estimation of the probability of default : Credit Rish
Estimation of the probability of default : Credit RishEstimation of the probability of default : Credit Rish
Estimation of the probability of default : Credit Rish
 
Exploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk AssesmentExploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk Assesment
 
Non-Interest Income and Future Business Models
Non-Interest Income and Future Business Models Non-Interest Income and Future Business Models
Non-Interest Income and Future Business Models
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model final
 
The Power of Credit
The Power of CreditThe Power of Credit
The Power of Credit
 
Ultimate Credit Score Guide
Ultimate Credit Score GuideUltimate Credit Score Guide
Ultimate Credit Score Guide
 
retailing-credit card
retailing-credit cardretailing-credit card
retailing-credit card
 
Nustart Presentation
Nustart PresentationNustart Presentation
Nustart Presentation
 
How your Business Credit Scores are Calculated
How your Business Credit Scores are CalculatedHow your Business Credit Scores are Calculated
How your Business Credit Scores are Calculated
 
Stanford CS 007-06 (2021): Personal Finance for Engineers / Debt
Stanford CS 007-06 (2021): Personal Finance for Engineers / DebtStanford CS 007-06 (2021): Personal Finance for Engineers / Debt
Stanford CS 007-06 (2021): Personal Finance for Engineers / Debt
 
Credit Its A Brand New Day
Credit Its A Brand New DayCredit Its A Brand New Day
Credit Its A Brand New Day
 
Understanding Your Credit Score
Understanding Your Credit ScoreUnderstanding Your Credit Score
Understanding Your Credit Score
 
Credit Score in Canada
Credit Score in CanadaCredit Score in Canada
Credit Score in Canada
 
Financing yoursmallbusiness lb-class
Financing yoursmallbusiness lb-classFinancing yoursmallbusiness lb-class
Financing yoursmallbusiness lb-class
 
Credit 101
Credit 101Credit 101
Credit 101
 
Credit Risk Evaluation Model
Credit Risk Evaluation ModelCredit Risk Evaluation Model
Credit Risk Evaluation Model
 

More from Apoorv Parmar

Summer Internship Project - Triad Technologies
Summer Internship Project - Triad TechnologiesSummer Internship Project - Triad Technologies
Summer Internship Project - Triad TechnologiesApoorv Parmar
 
Mafias in training - Vishleshan
Mafias in training - VishleshanMafias in training - Vishleshan
Mafias in training - VishleshanApoorv Parmar
 
Mafias in training - ACCS Round 2
Mafias in training - ACCS Round 2Mafias in training - ACCS Round 2
Mafias in training - ACCS Round 2Apoorv Parmar
 
Mafias in training Team - Kudwi Dataset Analysis
Mafias in training Team - Kudwi Dataset AnalysisMafias in training Team - Kudwi Dataset Analysis
Mafias in training Team - Kudwi Dataset AnalysisApoorv Parmar
 
Business Applications of Virtual Reality
Business Applications of Virtual RealityBusiness Applications of Virtual Reality
Business Applications of Virtual RealityApoorv Parmar
 
Business Plan - Hibiscus Iced Tea
Business Plan - Hibiscus Iced TeaBusiness Plan - Hibiscus Iced Tea
Business Plan - Hibiscus Iced TeaApoorv Parmar
 
Termination at Indian Steel Plant.
Termination at Indian Steel Plant.Termination at Indian Steel Plant.
Termination at Indian Steel Plant.Apoorv Parmar
 

More from Apoorv Parmar (8)

Summer Internship Project - Triad Technologies
Summer Internship Project - Triad TechnologiesSummer Internship Project - Triad Technologies
Summer Internship Project - Triad Technologies
 
Mafias in training - Vishleshan
Mafias in training - VishleshanMafias in training - Vishleshan
Mafias in training - Vishleshan
 
Mafias in training - ACCS Round 2
Mafias in training - ACCS Round 2Mafias in training - ACCS Round 2
Mafias in training - ACCS Round 2
 
Mafias in training Team - Kudwi Dataset Analysis
Mafias in training Team - Kudwi Dataset AnalysisMafias in training Team - Kudwi Dataset Analysis
Mafias in training Team - Kudwi Dataset Analysis
 
Business Applications of Virtual Reality
Business Applications of Virtual RealityBusiness Applications of Virtual Reality
Business Applications of Virtual Reality
 
Business Plan - Hibiscus Iced Tea
Business Plan - Hibiscus Iced TeaBusiness Plan - Hibiscus Iced Tea
Business Plan - Hibiscus Iced Tea
 
Termination at Indian Steel Plant.
Termination at Indian Steel Plant.Termination at Indian Steel Plant.
Termination at Indian Steel Plant.
 
Marketing myopia
Marketing myopiaMarketing myopia
Marketing myopia
 

Recently uploaded

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 

Recently uploaded (20)

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 

Mafias in training- Resolvr, The SmartCube

  • 1. SmartCube Data Analytics Case Study Team: Mafias In Training Team Members: Apoorv Parmar Kushaang Deswal Hemal Aurora Mudith
  • 2. Executive Summary • The basis of identifying NPA stems from the age of the applicant. • The dataset has been divided into 4 clusters based on age of the applicant. – 20-41 Year • Low Monthly Income • Less Dependents – 42-62 • Increased Monthly Income • Increase In Dependents – 63-96 • Retirement age • Giving Loan not Preferred • Only Credit Card Lines, no Real Estate Loans – Greater than 96 • Loan attached to spouse/Children in eligibility age of dependent is <63 • Thus we were able to obtain 16 clusters in which the data was divided. • New Features added to the dataset. – Credit Card Lines = Total open Credit Lines – Total Real Estate lines – Utilization = Credit card lines * Revolving Utilization of unsecured Lines – Per Capita Income of Family = Monthly Income/(Number of Dependents + 1) • The other major contributing factor is the Debt ratio. The second level of clustering is done based on debt ratio. – Debt Ratio >=0.5 • Greater amount of credit lines • Less monthly income – Debt Ratio <0.5 • Less Credit lines • Greater Monthly income • The third level of clustering is done on the basis of NumberOfTimes90DaysLate. – If the applicant has defaulted in 90 days, the probability of it to go NPA in high. – If the applicant has not defaulted in 90 days, probability of it to go NPA is less
  • 3. CLUSTERING Dataset Age : [21,41] Debt Ratio<=0.50 +90days past default =0 +90days past default !=90 Debt Ratio > 0.50 +90days past default =0 +90days past default !=90 Age : [42,62] Debt Ratio<=0.50 +90days past default =0 +90days past default !=90 Debt Ratio > 0.50 +90days past default =0 +90days past default !=90 Age : [63,96] Debt Ratio<=0.50 +90days past default =0 +90days past default !=90 Debt Ratio > 0.50 +90days past default =0 +90days past default !=90 Age : [97,109] Debt Ratio<=0.50 +90days past default =0 +90days past default !=90 Debt Ratio > 0.50 +90days past default =0 +90days past default !=90 Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 Cluster 11 Cluster 12 Cluster 13 Cluster 14 Cluster 15 Cluster 16
  • 4. INSIGHTS • Age, Debt Ratio & Payment History have considerable impact on tendency of a customer to convert into NPA. • If borrowers cross the age of 65years then: • They can only avail credit cards as a loan independently. • Only Real Estate loan can be obtained by them in partnership with their Spouse / Children of eligibility age < 65 years. OPINIONS • Loans to customers with age > 65years should be given real estate loan only if they have dependents due to low mortality rate. • Revolving Utilization must not be allowed to hop over 100%.
  • 5. Technique Used for filling missing data MICE : Multiple Imputation by Chained Equations, principle method of dealing with missing data. Removal of Outliers Outliers have been conservatively removed since Test data was kept in consideration. On applying our technique for outliers identification on Test data, significant number of outliers were identified and removing them could have caused trimming of test data by 15-20%. Hence, outliers for below mentioned factors have only been removed. BASIC OUTLIERS WERE CLEARED ON THE BASIS OF A BOX PLOT FOR EACH OF THE CLUSTERS FORMED. Revolving Utilization Age DebtRatio NumberOfTimes90DaysLate Upper Bound 200% 63 7000 17 Lower Bound 20
  • 6. Exploratory Analysis Our analysis began with data visualization considering different dimensions. Count Of NPA vs Age Count of NPA on Credit Lines vs Age The above pattern showed us a distribution similar to normal distribution. Hence, we got a clustering dimension as Age. Similarly, visualization was done on 13 combinations to have a clear analysis of relationship amongst factors.
  • 7. DECISION MAKING FACTORS Debt Ratio : Normalized form of Debt Ratio given in each cluster. Normalized Debt Ratio : Debt Ratio / Max(Debt Ratio in each cluster) Past Default : Weighted Normalized average of past defaults. Formula : (0.17*No. of defaults in 30-59days) + ( 0.33*No. of defaults in 60-89days)+(0.50*No. of defaults +90days) Utilization of Unsecured Credit Lines : Check the utilization of number of open credit lines which are other than real estate / long term Formula : Normalized value of( (No. of open credit lines - No. of Real Estate Loans )*Revolving Utilization of unsecured lines) Number of Dependents : Normalized value of number of dependents for a borrower in a cluster
  • 8. FACTORS IMPACTING CLUSTERS Debt Ratio Past Default Utilization of unsecured lines Number of Dependents Final Score Cluster 1 0.25 0.5 0.15 0.10 Weighted Sum of all Factors for each Cluster. If Final Score >0.10, Then Borrower qualifies for NPA Else Borrower does not qualify for NPA. (94.12% Accuracy) Cluster 2 0.25 0.5 0.15 0.10 Cluster 3 0.25 0.5 0.15 0.10 Cluster 4 0.25 0.5 0.15 0.10 Cluster 5 0.25 0.45 0.15 0.15 Cluster 6 0.25 0.45 0.15 0.15 Cluster 7 0.25 0.45 0.15 0.15 Cluster 8 0.25 0.45 0.15 0.15 Cluster 9 0.2 0.35 0.15 0.20 Cluster 10 0.2 0.35 0.15 0.20 Cluster 11 0.2 0.35 0.15 0.20 Cluster 12 0.2 0.35 0.15 0.20 Cluster 13 0.2 0.3 0.25 0.25 Cluster 14 0.2 0.3 0.25 0.25 Cluster 15 0.2 0.3 0.25 0.25 Cluster 16 0.2 0.3 0.25 0.25
  • 9. Team Members and CGPA Apoorv Parmar 6.63 Kushaang Deswal 7.96 Hemal Aurora 7.43 Mudith 7.92