SlideShare a Scribd company logo
1 of 10
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Title: Decreased FICO score is associated with increased interest rate 
Introduction: 
Lending Club is an online financial community that brings together creditworthy 
borrowers and savvy investors so that both can benefit financially [1]. It allows its 
members to directly invest in and borrow from each other and so avoid the cost and 
complexity of the banking system. 
On the Lending Club site there are several files that contain complete loan data, including 
the current loan status and latest payment information. [2] The data used in this analysis 
represents a sample of 2,500 peer-to-peer loans issued by the Lending Club explained 
through 14 variables such as: monthly income, amount requested, FICO range (a range 
indicating the applicants FICO score) [3], inquiries in the last six months etc. The goal of 
this analysis is to establish if there is any correlation between the outcome variable – the 
interest rate of the loans – and the other variables especially considering the FICO score, 
which is a measure of the creditworthiness of the applicant. 
In this project we performed an analysis to determine if there was a significant association 
between the interest rate and the FICO score. Using exploratory analysis and standard 
multiple regression techniques we show that there is a significant negative relationship 
between the interest rate and the FICO score, even after adjusting for important 
confounders such as the length of the loan, the amount funded by the investors and the 
amount requested by the borrowers. 
Our analysis suggests that there is a significant, negative association between Interest 
Rate and FICO score. Our analysis estimates the relationship using a linear model relating 
one percent of interest rate to one unit of FICO score. There appears to be a strong inverse 
relationship between the two variables. 
Our results suggest that there are other variables such as loan length, amount requested by 
the borrower and amount funded by the investors which are associated with both interest 
rate and FICO score. Including these variables in the regression model relating interest 
rate to FICO score improves the model fit, but does not remove the significant positive 
relationship between the variables. 
Methods: 
Data Collection 
For our analysis we used the data loans from the Lending Club site from 2007 to 2011. 
The data were downloaded from lendingclub.com on November 16, 2013 using the R 
programming language [3]. 
Exploratory Analysis 
Exploratory analysis was performed by examining tables and plots of the observed data. 
We identified transformations to perform on the raw data on the basis of plots and 
knowledge of the scale of measured variables. Exploratory analysis was used to (1) 
identify missing values, (2) verify the quality of the data, and (3) determine the terms 
used in the regression model relating interest rate to FICO score. 
Statistical Modeling 
1 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
To relate interest rate to FICO score we performed a standard multivariate linear 
regression model [4]. Model selection was performed on the basis of our exploratory 
analysis and prior knowledge of the relationship between interest rate and FICO score, 
amount of the loan requested and the length in time of the loan. Coefficients were 
estimated with ordinary least squares and standard errors were calculated using standard 
asymptotic approximations [5]. 
Reproducibility 
All analyses performed in this manuscript are reproduced in the R markdown file 
loansdata.Rmd [6]. To reproduce the exact results presented in this manuscript the cached 
version of the analysis must be performed. 
Results: 
The loans data used in this analysis contains information on the amount requested by the 
borrower (Amount.Requested), the amount funded by the investors 
(Amount.Funded.By.Investors), the lending interest rate (Interest.rate), the length in time 
(in months) of the loan (Loan.Length), the purpose of the loan as stated by the applicant 
(Loan.Purpose), the percentage of consumer’s gross income that goes toward paying 
debts (Debt.To.Income.Ratio), the U.S. state of residence of the loan applicant (State), the 
ownership type of the home (Home.Ownership), the monthly income of the applicant (in 
dollars) (Monthly.income), a range indicating the applicants FICO score (FICO.range), 
the number of open lines of credit the applicant had at the time of application 
(Open.CREDIT.Lines), the total amount outstanding all lines of credit 
(Revolving.CREDIT.Balance), the number of authorized queries about the 
creditworthiness of the applicant in the 6 months before the loan was issued 
(Inquiries.in.the.Last.6.Months), the length of time employed at current job 
(Employment.Length). [5]. 
We identified 77 missing values in the data set we collected for the variable Employment 
Length, one missing value for the variable Monthly Income, 2 missing values each for the 
variables the number of open lines of credit the applicant had at the time of application 
(Open.CREDIT.Lines), the total amount outstanding all lines of credit 
(Revolving.CREDIT.Balance), the number of authorized queries about the 
creditworthiness of the applicant in the 6 months before the loan was issued 
(Inquiries.in.the.Last.6.Months). 
Three measured variables were outside the standard ranges: for the variable Home 
Ownership there are five options (none, other, owns, rents or has a mortgage), although 
there must have been only three: owns, rents or has a mortgage and for the variable 
Amount Funded by the Investors there are 2 negative values and 4 values of 0; for the 
variable the percentage of consumer’s gross income that goes toward paying debts 
(Debt.To.Income.Ratio) there are 8 values of 0% which we consider that must be 
removed because it represents the percentage of consumer’s gross income that goes 
toward paying the loans that were approved. 
2 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
After removing the missing values and the observations that were outside the standard 
ranges, the data now has 2403 observations and 14 variables. 
From the barplot of the variable FICO range we can see that the distribution is positively 
skewed with a long right tail (figure 1). 
Figure 1. Histogram of FICO Range 
The histogram of the interest rate shows a relatively normal distribution with mean 13 
(figure 2). The majority of the loans granted had an interest rate between 10,2% and 
15,8%. 
3 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 2. Histogram of Interest rate 
We performed some exploratory analysis and from the boxplots of the interest rate 
variable and the factor variables we observed that the monthly income of the borrower, 
the employment length, the type of the home ownership and the state from which was the 
borrower don’t have any impact on the size of the interest rate of the loan granted. The 
variables Loan Purpose, Open Credit Lines, Revolving Credit Balance, Inquiries in the 
last 6 months and Debt to income ratio have little correlation with the interest rate 
variable. The potential confounders are: the length of the loan, the amount founded by the 
investors and the amount requested by the borrowers. 
We decided to transform the variable FICO range into the variable FICO score which 
represent the average of the low number and the upper number of a FICO range for each 
observed loan granted. Subsequent analyses focus on this transformed FICO score 
variable. From the boxplot of the FICO range and interest rate we can observe a strongly 
negative association between the two (figure 3). The correlation coefficient between the 
interest rate and FICO score is -71%. 
4 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 3. The Boxplot between the Interest Rate and FICO Range 
We first fit a regression model relating interest rate to FICO score (figure 4). Taking into 
consideration that the multiple R squared is 50,3% which is not equal to the correlation 
coefficient of 71%, it means that there are confounders that explain the rest of 49,7% of 
the variation of the variable interest rate. 
Figure 4. The relationship between the Interest Rate and FICO score 
5 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
The correlation coefficient between the amount funded by the investors and the interest 
rate is 33%. The same coefficient is for the amount requested by the borrowers and the 
association between the interest rate and the loan length is 42%. The mean of the residuals 
is approx. 0, the variance is 8,6 and they follow a normal distribution positively skewed 
(figure 5). 
Figure 5. Residuals distribution for the linear model 
Residuals show patterns of non-random variation (figure 6). We attempted to explain 
those patterns by fitting models including potential confounders. 
6 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 6. The variation of residuals 
Our final regression model was: Interest.Rate = b0 + b1*FICO.score + 
b2*Amount.Funded.By.Investors + b3*Amount.Requested + f(Length.Loan) + e, 
where b0 is an intercept term and b1 represents the change in Interest rate associated with a 
change of one unit in FICO score at the same amount funded by investors, amount 
requested by borrowers and the same loan length of time. The term f(Length.Loan) 
represents a factor model with two different levels. This model explains 75% of the 
variation by one percent in the interest rate variable. The P-values show that all the 
coefficients are statistically significant. 
The error term e represents all sources of unmeasured and unmodeled random variation in 
interest rate. Our final regression model appeared to remove most of the non-random 
patterns of variation in the residuals. We observe that the residuals for the multivariate 
linear model follow a normal distribution with mean 0 and variation 4,38 (figure 7). 
7 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 7. Residuals distribution for multivariate linear regression 
From figure 8 we notice that the residuals’ variation for the multivariate linear model is 
smaller and that we can say it follows a White Noise frequency. 
Figure 8. Variation of residuals for multivariate linear model 
8 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
We observed a highly statistically significant (P = 2e-16) association between interest rate 
and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 
= -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). 
For example, for two loans at the same loan length, amount requested by the borrower, 
amount funded by the investors, we would expect an interest rate to increase by 1% at 
every 0.08 decrease in the FICO score. 
Conclusions: 
Our analysis suggests that there is a significant, negative association between Interest 
Rate and FICO score. Our analysis estimates the relationship using a linear model relating 
one percent of interest rate to one unit of FICO score. There appears to be a strong inverse 
relationship between the two variables. 
We also observed that other variables such as loan length, amount requested by the 
borrower and amount funded by the investors are associated with both interest rate and 
FICO score. Including these variables in the regression model relating interest rate to 
FICO score improves the model fit, but does not remove the significant positive 
relationship between the variables. 
Our analysis may be of interest to both investors and borrowers. Investors are interested 
in selecting the potential borrowers on the financial market at a low cost, to establish a 
fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. 
Borrowers are also concerned in obtaining better interest rates at low costs. It could also 
be of interest to the Lending Club to support its members in selecting the proper partners. 
References 
1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action 
Accessed 09/16/2014. 
2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. 
action, Accessed 09/16/2014 
3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 
4. LendingClub Corporation. URL: https://spark-public. 
s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 
5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 
6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. 
Accessed 09/16/2014 
9 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
We observed a highly statistically significant (P = 2e-16) association between interest rate 
and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 
= -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). 
For example, for two loans at the same loan length, amount requested by the borrower, 
amount funded by the investors, we would expect an interest rate to increase by 1% at 
every 0.08 decrease in the FICO score. 
Conclusions: 
Our analysis suggests that there is a significant, negative association between Interest 
Rate and FICO score. Our analysis estimates the relationship using a linear model relating 
one percent of interest rate to one unit of FICO score. There appears to be a strong inverse 
relationship between the two variables. 
We also observed that other variables such as loan length, amount requested by the 
borrower and amount funded by the investors are associated with both interest rate and 
FICO score. Including these variables in the regression model relating interest rate to 
FICO score improves the model fit, but does not remove the significant positive 
relationship between the variables. 
Our analysis may be of interest to both investors and borrowers. Investors are interested 
in selecting the potential borrowers on the financial market at a low cost, to establish a 
fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. 
Borrowers are also concerned in obtaining better interest rates at low costs. It could also 
be of interest to the Lending Club to support its members in selecting the proper partners. 
References 
1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action 
Accessed 09/16/2014. 
2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. 
action, Accessed 09/16/2014 
3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 
4. LendingClub Corporation. URL: https://spark-public. 
s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 
5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 
6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. 
Accessed 09/16/2014 
9 /9

More Related Content

What's hot

Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsSagar Tupkar
 
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...inventionjournals
 
Benchmark the Actual Bond Prices
Benchmark the Actual Bond PricesBenchmark the Actual Bond Prices
Benchmark the Actual Bond PricesRan Zhang
 
Investment liquidity net income
Investment liquidity net incomeInvestment liquidity net income
Investment liquidity net incomeswajalacharya
 
ART1197.DOC
ART1197.DOCART1197.DOC
ART1197.DOCbutest
 
CECL Project Overview
CECL Project OverviewCECL Project Overview
CECL Project OverviewRohit Khurana
 
The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...RyanMHolcomb
 
The relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank defaultThe relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank defaultAlexander Decker
 
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...SYRTO Project
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search1crore projects
 
Game Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionGame Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionIOSR Journals
 

What's hot (17)

Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning Algorithms
 
Icbelsh150811
Icbelsh150811Icbelsh150811
Icbelsh150811
 
Forecasting peer-to-peer lending risk
Forecasting peer-to-peer lending riskForecasting peer-to-peer lending risk
Forecasting peer-to-peer lending risk
 
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
 
Lemna wp 2015-14
Lemna wp 2015-14Lemna wp 2015-14
Lemna wp 2015-14
 
Benchmark the Actual Bond Prices
Benchmark the Actual Bond PricesBenchmark the Actual Bond Prices
Benchmark the Actual Bond Prices
 
Investment liquidity net income
Investment liquidity net incomeInvestment liquidity net income
Investment liquidity net income
 
Credit defaulter analysis
Credit defaulter analysisCredit defaulter analysis
Credit defaulter analysis
 
Amano getahun
Amano getahunAmano getahun
Amano getahun
 
ART1197.DOC
ART1197.DOCART1197.DOC
ART1197.DOC
 
Investment int rate
Investment int rateInvestment int rate
Investment int rate
 
CECL Project Overview
CECL Project OverviewCECL Project Overview
CECL Project Overview
 
The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...
 
The relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank defaultThe relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank default
 
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search
 
Game Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionGame Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime Detection
 

Viewers also liked

Data analysis and statistical inference project
Data analysis and statistical inference projectData analysis and statistical inference project
Data analysis and statistical inference projectMaruşa Pescu (Beca)
 
Data_Processing_Program
Data_Processing_ProgramData_Processing_Program
Data_Processing_ProgramNeil Dahlqvist
 
Indirect tax (1)
Indirect tax (1)Indirect tax (1)
Indirect tax (1)9867097496
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingSohail Patel
 
Property Tax Assessment Services
Property Tax Assessment ServicesProperty Tax Assessment Services
Property Tax Assessment Servicescutmytaxes
 
Research project report sumit b
Research project report sumit bResearch project report sumit b
Research project report sumit bsumit saxena
 
Research project on packaged drinking water industry
Research project on packaged drinking water industryResearch project on packaged drinking water industry
Research project on packaged drinking water industryPallav Tyagi
 
Standards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian ContextStandards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian ContextBharath Rao
 
Auditing Standards- IndusInd Bank
Auditing Standards- IndusInd BankAuditing Standards- IndusInd Bank
Auditing Standards- IndusInd BankNikita Jangid
 
Project-Student Financial Service System
Project-Student Financial Service SystemProject-Student Financial Service System
Project-Student Financial Service Systemchezhiang
 
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...Oghenovo Egbegbedia
 
STANDARDS ON AUDIT
STANDARDS  ON AUDITSTANDARDS  ON AUDIT
STANDARDS ON AUDITKinjal Gada
 
Company audit & accounts
Company audit & accounts  Company audit & accounts
Company audit & accounts Vivek Mahajan
 
Demonetization of Indian Currency
Demonetization of Indian CurrencyDemonetization of Indian Currency
Demonetization of Indian CurrencyJithin Scaria
 

Viewers also liked (20)

Data analysis and statistical inference project
Data analysis and statistical inference projectData analysis and statistical inference project
Data analysis and statistical inference project
 
Problems statistics 1
Problems statistics 1Problems statistics 1
Problems statistics 1
 
Exercises statistics
Exercises statisticsExercises statistics
Exercises statistics
 
Data_Processing_Program
Data_Processing_ProgramData_Processing_Program
Data_Processing_Program
 
Amar 38 final
Amar 38 finalAmar 38 final
Amar 38 final
 
Indirect tax (1)
Indirect tax (1)Indirect tax (1)
Indirect tax (1)
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Final goods & service tax
Final goods & service taxFinal goods & service tax
Final goods & service tax
 
Property Tax Assessment Services
Property Tax Assessment ServicesProperty Tax Assessment Services
Property Tax Assessment Services
 
Research project report sumit b
Research project report sumit bResearch project report sumit b
Research project report sumit b
 
Research project on packaged drinking water industry
Research project on packaged drinking water industryResearch project on packaged drinking water industry
Research project on packaged drinking water industry
 
Standards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian ContextStandards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian Context
 
Auditing Standards- IndusInd Bank
Auditing Standards- IndusInd BankAuditing Standards- IndusInd Bank
Auditing Standards- IndusInd Bank
 
Project-Student Financial Service System
Project-Student Financial Service SystemProject-Student Financial Service System
Project-Student Financial Service System
 
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
 
STANDARDS ON AUDIT
STANDARDS  ON AUDITSTANDARDS  ON AUDIT
STANDARDS ON AUDIT
 
Project Report on e banking
Project Report on e bankingProject Report on e banking
Project Report on e banking
 
Company audit & accounts
Company audit & accounts  Company audit & accounts
Company audit & accounts
 
E-banking project
E-banking projectE-banking project
E-banking project
 
Demonetization of Indian Currency
Demonetization of Indian CurrencyDemonetization of Indian Currency
Demonetization of Indian Currency
 

Similar to Project data analysis

Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101Heather Lamoureux
 
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...inventionjournals
 
Investigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practiceInvestigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practiceAlexander Decker
 
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?Rusman Mukhlis
 
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...Alexander Decker
 
STRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORKSTRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORKDinabandhu Bag
 
An empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutionsAn empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutionsAlexander Decker
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxtodd191
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxarnoldmeredith47041
 
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.Fawaz Khaled
 
Rudebusch Lopez Christensen
Rudebusch Lopez ChristensenRudebusch Lopez Christensen
Rudebusch Lopez ChristensenPeter Ho
 
Apanps5210 - final presentation
Apanps5210 - final presentationApanps5210 - final presentation
Apanps5210 - final presentationManasa Damera
 
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...iosrjce
 
Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...- -
 
Financial Markets - Interest Rates.pdf
Financial Markets - Interest Rates.pdfFinancial Markets - Interest Rates.pdf
Financial Markets - Interest Rates.pdfJeremiahEdisonCaraga
 

Similar to Project data analysis (20)

Programming for big data
Programming for big dataProgramming for big data
Programming for big data
 
Loan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
Loan Characteristics as Predictors of Default in Commercial Mortgage PortfoliosLoan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
Loan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
 
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
 
loan.docx
loan.docxloan.docx
loan.docx
 
Ppt m& b
Ppt m& bPpt m& b
Ppt m& b
 
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
 
Investigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practiceInvestigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practice
 
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
 
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
 
STRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORKSTRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORK
 
An empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutionsAn empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutions
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docx
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docx
 
Credit Scoring of Turkey with Semiparametric Logit Models
Credit Scoring of Turkey with Semiparametric Logit ModelsCredit Scoring of Turkey with Semiparametric Logit Models
Credit Scoring of Turkey with Semiparametric Logit Models
 
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
 
Rudebusch Lopez Christensen
Rudebusch Lopez ChristensenRudebusch Lopez Christensen
Rudebusch Lopez Christensen
 
Apanps5210 - final presentation
Apanps5210 - final presentationApanps5210 - final presentation
Apanps5210 - final presentation
 
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
 
Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...
 
Financial Markets - Interest Rates.pdf
Financial Markets - Interest Rates.pdfFinancial Markets - Interest Rates.pdf
Financial Markets - Interest Rates.pdf
 

Recently uploaded

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookmanojkuma9823
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 

Recently uploaded (20)

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 

Project data analysis

  • 1. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Title: Decreased FICO score is associated with increased interest rate Introduction: Lending Club is an online financial community that brings together creditworthy borrowers and savvy investors so that both can benefit financially [1]. It allows its members to directly invest in and borrow from each other and so avoid the cost and complexity of the banking system. On the Lending Club site there are several files that contain complete loan data, including the current loan status and latest payment information. [2] The data used in this analysis represents a sample of 2,500 peer-to-peer loans issued by the Lending Club explained through 14 variables such as: monthly income, amount requested, FICO range (a range indicating the applicants FICO score) [3], inquiries in the last six months etc. The goal of this analysis is to establish if there is any correlation between the outcome variable – the interest rate of the loans – and the other variables especially considering the FICO score, which is a measure of the creditworthiness of the applicant. In this project we performed an analysis to determine if there was a significant association between the interest rate and the FICO score. Using exploratory analysis and standard multiple regression techniques we show that there is a significant negative relationship between the interest rate and the FICO score, even after adjusting for important confounders such as the length of the loan, the amount funded by the investors and the amount requested by the borrowers. Our analysis suggests that there is a significant, negative association between Interest Rate and FICO score. Our analysis estimates the relationship using a linear model relating one percent of interest rate to one unit of FICO score. There appears to be a strong inverse relationship between the two variables. Our results suggest that there are other variables such as loan length, amount requested by the borrower and amount funded by the investors which are associated with both interest rate and FICO score. Including these variables in the regression model relating interest rate to FICO score improves the model fit, but does not remove the significant positive relationship between the variables. Methods: Data Collection For our analysis we used the data loans from the Lending Club site from 2007 to 2011. The data were downloaded from lendingclub.com on November 16, 2013 using the R programming language [3]. Exploratory Analysis Exploratory analysis was performed by examining tables and plots of the observed data. We identified transformations to perform on the raw data on the basis of plots and knowledge of the scale of measured variables. Exploratory analysis was used to (1) identify missing values, (2) verify the quality of the data, and (3) determine the terms used in the regression model relating interest rate to FICO score. Statistical Modeling 1 /9
  • 2. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa To relate interest rate to FICO score we performed a standard multivariate linear regression model [4]. Model selection was performed on the basis of our exploratory analysis and prior knowledge of the relationship between interest rate and FICO score, amount of the loan requested and the length in time of the loan. Coefficients were estimated with ordinary least squares and standard errors were calculated using standard asymptotic approximations [5]. Reproducibility All analyses performed in this manuscript are reproduced in the R markdown file loansdata.Rmd [6]. To reproduce the exact results presented in this manuscript the cached version of the analysis must be performed. Results: The loans data used in this analysis contains information on the amount requested by the borrower (Amount.Requested), the amount funded by the investors (Amount.Funded.By.Investors), the lending interest rate (Interest.rate), the length in time (in months) of the loan (Loan.Length), the purpose of the loan as stated by the applicant (Loan.Purpose), the percentage of consumer’s gross income that goes toward paying debts (Debt.To.Income.Ratio), the U.S. state of residence of the loan applicant (State), the ownership type of the home (Home.Ownership), the monthly income of the applicant (in dollars) (Monthly.income), a range indicating the applicants FICO score (FICO.range), the number of open lines of credit the applicant had at the time of application (Open.CREDIT.Lines), the total amount outstanding all lines of credit (Revolving.CREDIT.Balance), the number of authorized queries about the creditworthiness of the applicant in the 6 months before the loan was issued (Inquiries.in.the.Last.6.Months), the length of time employed at current job (Employment.Length). [5]. We identified 77 missing values in the data set we collected for the variable Employment Length, one missing value for the variable Monthly Income, 2 missing values each for the variables the number of open lines of credit the applicant had at the time of application (Open.CREDIT.Lines), the total amount outstanding all lines of credit (Revolving.CREDIT.Balance), the number of authorized queries about the creditworthiness of the applicant in the 6 months before the loan was issued (Inquiries.in.the.Last.6.Months). Three measured variables were outside the standard ranges: for the variable Home Ownership there are five options (none, other, owns, rents or has a mortgage), although there must have been only three: owns, rents or has a mortgage and for the variable Amount Funded by the Investors there are 2 negative values and 4 values of 0; for the variable the percentage of consumer’s gross income that goes toward paying debts (Debt.To.Income.Ratio) there are 8 values of 0% which we consider that must be removed because it represents the percentage of consumer’s gross income that goes toward paying the loans that were approved. 2 /9
  • 3. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa After removing the missing values and the observations that were outside the standard ranges, the data now has 2403 observations and 14 variables. From the barplot of the variable FICO range we can see that the distribution is positively skewed with a long right tail (figure 1). Figure 1. Histogram of FICO Range The histogram of the interest rate shows a relatively normal distribution with mean 13 (figure 2). The majority of the loans granted had an interest rate between 10,2% and 15,8%. 3 /9
  • 4. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 2. Histogram of Interest rate We performed some exploratory analysis and from the boxplots of the interest rate variable and the factor variables we observed that the monthly income of the borrower, the employment length, the type of the home ownership and the state from which was the borrower don’t have any impact on the size of the interest rate of the loan granted. The variables Loan Purpose, Open Credit Lines, Revolving Credit Balance, Inquiries in the last 6 months and Debt to income ratio have little correlation with the interest rate variable. The potential confounders are: the length of the loan, the amount founded by the investors and the amount requested by the borrowers. We decided to transform the variable FICO range into the variable FICO score which represent the average of the low number and the upper number of a FICO range for each observed loan granted. Subsequent analyses focus on this transformed FICO score variable. From the boxplot of the FICO range and interest rate we can observe a strongly negative association between the two (figure 3). The correlation coefficient between the interest rate and FICO score is -71%. 4 /9
  • 5. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 3. The Boxplot between the Interest Rate and FICO Range We first fit a regression model relating interest rate to FICO score (figure 4). Taking into consideration that the multiple R squared is 50,3% which is not equal to the correlation coefficient of 71%, it means that there are confounders that explain the rest of 49,7% of the variation of the variable interest rate. Figure 4. The relationship between the Interest Rate and FICO score 5 /9
  • 6. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa The correlation coefficient between the amount funded by the investors and the interest rate is 33%. The same coefficient is for the amount requested by the borrowers and the association between the interest rate and the loan length is 42%. The mean of the residuals is approx. 0, the variance is 8,6 and they follow a normal distribution positively skewed (figure 5). Figure 5. Residuals distribution for the linear model Residuals show patterns of non-random variation (figure 6). We attempted to explain those patterns by fitting models including potential confounders. 6 /9
  • 7. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 6. The variation of residuals Our final regression model was: Interest.Rate = b0 + b1*FICO.score + b2*Amount.Funded.By.Investors + b3*Amount.Requested + f(Length.Loan) + e, where b0 is an intercept term and b1 represents the change in Interest rate associated with a change of one unit in FICO score at the same amount funded by investors, amount requested by borrowers and the same loan length of time. The term f(Length.Loan) represents a factor model with two different levels. This model explains 75% of the variation by one percent in the interest rate variable. The P-values show that all the coefficients are statistically significant. The error term e represents all sources of unmeasured and unmodeled random variation in interest rate. Our final regression model appeared to remove most of the non-random patterns of variation in the residuals. We observe that the residuals for the multivariate linear model follow a normal distribution with mean 0 and variation 4,38 (figure 7). 7 /9
  • 8. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 7. Residuals distribution for multivariate linear regression From figure 8 we notice that the residuals’ variation for the multivariate linear model is smaller and that we can say it follows a White Noise frequency. Figure 8. Variation of residuals for multivariate linear model 8 /9
  • 9. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa We observed a highly statistically significant (P = 2e-16) association between interest rate and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 = -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). For example, for two loans at the same loan length, amount requested by the borrower, amount funded by the investors, we would expect an interest rate to increase by 1% at every 0.08 decrease in the FICO score. Conclusions: Our analysis suggests that there is a significant, negative association between Interest Rate and FICO score. Our analysis estimates the relationship using a linear model relating one percent of interest rate to one unit of FICO score. There appears to be a strong inverse relationship between the two variables. We also observed that other variables such as loan length, amount requested by the borrower and amount funded by the investors are associated with both interest rate and FICO score. Including these variables in the regression model relating interest rate to FICO score improves the model fit, but does not remove the significant positive relationship between the variables. Our analysis may be of interest to both investors and borrowers. Investors are interested in selecting the potential borrowers on the financial market at a low cost, to establish a fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. Borrowers are also concerned in obtaining better interest rates at low costs. It could also be of interest to the Lending Club to support its members in selecting the proper partners. References 1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action Accessed 09/16/2014. 2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. action, Accessed 09/16/2014 3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 4. LendingClub Corporation. URL: https://spark-public. s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. Accessed 09/16/2014 9 /9
  • 10. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa We observed a highly statistically significant (P = 2e-16) association between interest rate and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 = -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). For example, for two loans at the same loan length, amount requested by the borrower, amount funded by the investors, we would expect an interest rate to increase by 1% at every 0.08 decrease in the FICO score. Conclusions: Our analysis suggests that there is a significant, negative association between Interest Rate and FICO score. Our analysis estimates the relationship using a linear model relating one percent of interest rate to one unit of FICO score. There appears to be a strong inverse relationship between the two variables. We also observed that other variables such as loan length, amount requested by the borrower and amount funded by the investors are associated with both interest rate and FICO score. Including these variables in the regression model relating interest rate to FICO score improves the model fit, but does not remove the significant positive relationship between the variables. Our analysis may be of interest to both investors and borrowers. Investors are interested in selecting the potential borrowers on the financial market at a low cost, to establish a fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. Borrowers are also concerned in obtaining better interest rates at low costs. It could also be of interest to the Lending Club to support its members in selecting the proper partners. References 1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action Accessed 09/16/2014. 2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. action, Accessed 09/16/2014 3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 4. LendingClub Corporation. URL: https://spark-public. s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. Accessed 09/16/2014 9 /9