SlideShare a Scribd company logo
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Title: Decreased FICO score is associated with increased interest rate 
Introduction: 
Lending Club is an online financial community that brings together creditworthy 
borrowers and savvy investors so that both can benefit financially [1]. It allows its 
members to directly invest in and borrow from each other and so avoid the cost and 
complexity of the banking system. 
On the Lending Club site there are several files that contain complete loan data, including 
the current loan status and latest payment information. [2] The data used in this analysis 
represents a sample of 2,500 peer-to-peer loans issued by the Lending Club explained 
through 14 variables such as: monthly income, amount requested, FICO range (a range 
indicating the applicants FICO score) [3], inquiries in the last six months etc. The goal of 
this analysis is to establish if there is any correlation between the outcome variable – the 
interest rate of the loans – and the other variables especially considering the FICO score, 
which is a measure of the creditworthiness of the applicant. 
In this project we performed an analysis to determine if there was a significant association 
between the interest rate and the FICO score. Using exploratory analysis and standard 
multiple regression techniques we show that there is a significant negative relationship 
between the interest rate and the FICO score, even after adjusting for important 
confounders such as the length of the loan, the amount funded by the investors and the 
amount requested by the borrowers. 
Our analysis suggests that there is a significant, negative association between Interest 
Rate and FICO score. Our analysis estimates the relationship using a linear model relating 
one percent of interest rate to one unit of FICO score. There appears to be a strong inverse 
relationship between the two variables. 
Our results suggest that there are other variables such as loan length, amount requested by 
the borrower and amount funded by the investors which are associated with both interest 
rate and FICO score. Including these variables in the regression model relating interest 
rate to FICO score improves the model fit, but does not remove the significant positive 
relationship between the variables. 
Methods: 
Data Collection 
For our analysis we used the data loans from the Lending Club site from 2007 to 2011. 
The data were downloaded from lendingclub.com on November 16, 2013 using the R 
programming language [3]. 
Exploratory Analysis 
Exploratory analysis was performed by examining tables and plots of the observed data. 
We identified transformations to perform on the raw data on the basis of plots and 
knowledge of the scale of measured variables. Exploratory analysis was used to (1) 
identify missing values, (2) verify the quality of the data, and (3) determine the terms 
used in the regression model relating interest rate to FICO score. 
Statistical Modeling 
1 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
To relate interest rate to FICO score we performed a standard multivariate linear 
regression model [4]. Model selection was performed on the basis of our exploratory 
analysis and prior knowledge of the relationship between interest rate and FICO score, 
amount of the loan requested and the length in time of the loan. Coefficients were 
estimated with ordinary least squares and standard errors were calculated using standard 
asymptotic approximations [5]. 
Reproducibility 
All analyses performed in this manuscript are reproduced in the R markdown file 
loansdata.Rmd [6]. To reproduce the exact results presented in this manuscript the cached 
version of the analysis must be performed. 
Results: 
The loans data used in this analysis contains information on the amount requested by the 
borrower (Amount.Requested), the amount funded by the investors 
(Amount.Funded.By.Investors), the lending interest rate (Interest.rate), the length in time 
(in months) of the loan (Loan.Length), the purpose of the loan as stated by the applicant 
(Loan.Purpose), the percentage of consumer’s gross income that goes toward paying 
debts (Debt.To.Income.Ratio), the U.S. state of residence of the loan applicant (State), the 
ownership type of the home (Home.Ownership), the monthly income of the applicant (in 
dollars) (Monthly.income), a range indicating the applicants FICO score (FICO.range), 
the number of open lines of credit the applicant had at the time of application 
(Open.CREDIT.Lines), the total amount outstanding all lines of credit 
(Revolving.CREDIT.Balance), the number of authorized queries about the 
creditworthiness of the applicant in the 6 months before the loan was issued 
(Inquiries.in.the.Last.6.Months), the length of time employed at current job 
(Employment.Length). [5]. 
We identified 77 missing values in the data set we collected for the variable Employment 
Length, one missing value for the variable Monthly Income, 2 missing values each for the 
variables the number of open lines of credit the applicant had at the time of application 
(Open.CREDIT.Lines), the total amount outstanding all lines of credit 
(Revolving.CREDIT.Balance), the number of authorized queries about the 
creditworthiness of the applicant in the 6 months before the loan was issued 
(Inquiries.in.the.Last.6.Months). 
Three measured variables were outside the standard ranges: for the variable Home 
Ownership there are five options (none, other, owns, rents or has a mortgage), although 
there must have been only three: owns, rents or has a mortgage and for the variable 
Amount Funded by the Investors there are 2 negative values and 4 values of 0; for the 
variable the percentage of consumer’s gross income that goes toward paying debts 
(Debt.To.Income.Ratio) there are 8 values of 0% which we consider that must be 
removed because it represents the percentage of consumer’s gross income that goes 
toward paying the loans that were approved. 
2 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
After removing the missing values and the observations that were outside the standard 
ranges, the data now has 2403 observations and 14 variables. 
From the barplot of the variable FICO range we can see that the distribution is positively 
skewed with a long right tail (figure 1). 
Figure 1. Histogram of FICO Range 
The histogram of the interest rate shows a relatively normal distribution with mean 13 
(figure 2). The majority of the loans granted had an interest rate between 10,2% and 
15,8%. 
3 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 2. Histogram of Interest rate 
We performed some exploratory analysis and from the boxplots of the interest rate 
variable and the factor variables we observed that the monthly income of the borrower, 
the employment length, the type of the home ownership and the state from which was the 
borrower don’t have any impact on the size of the interest rate of the loan granted. The 
variables Loan Purpose, Open Credit Lines, Revolving Credit Balance, Inquiries in the 
last 6 months and Debt to income ratio have little correlation with the interest rate 
variable. The potential confounders are: the length of the loan, the amount founded by the 
investors and the amount requested by the borrowers. 
We decided to transform the variable FICO range into the variable FICO score which 
represent the average of the low number and the upper number of a FICO range for each 
observed loan granted. Subsequent analyses focus on this transformed FICO score 
variable. From the boxplot of the FICO range and interest rate we can observe a strongly 
negative association between the two (figure 3). The correlation coefficient between the 
interest rate and FICO score is -71%. 
4 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 3. The Boxplot between the Interest Rate and FICO Range 
We first fit a regression model relating interest rate to FICO score (figure 4). Taking into 
consideration that the multiple R squared is 50,3% which is not equal to the correlation 
coefficient of 71%, it means that there are confounders that explain the rest of 49,7% of 
the variation of the variable interest rate. 
Figure 4. The relationship between the Interest Rate and FICO score 
5 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
The correlation coefficient between the amount funded by the investors and the interest 
rate is 33%. The same coefficient is for the amount requested by the borrowers and the 
association between the interest rate and the loan length is 42%. The mean of the residuals 
is approx. 0, the variance is 8,6 and they follow a normal distribution positively skewed 
(figure 5). 
Figure 5. Residuals distribution for the linear model 
Residuals show patterns of non-random variation (figure 6). We attempted to explain 
those patterns by fitting models including potential confounders. 
6 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 6. The variation of residuals 
Our final regression model was: Interest.Rate = b0 + b1*FICO.score + 
b2*Amount.Funded.By.Investors + b3*Amount.Requested + f(Length.Loan) + e, 
where b0 is an intercept term and b1 represents the change in Interest rate associated with a 
change of one unit in FICO score at the same amount funded by investors, amount 
requested by borrowers and the same loan length of time. The term f(Length.Loan) 
represents a factor model with two different levels. This model explains 75% of the 
variation by one percent in the interest rate variable. The P-values show that all the 
coefficients are statistically significant. 
The error term e represents all sources of unmeasured and unmodeled random variation in 
interest rate. Our final regression model appeared to remove most of the non-random 
patterns of variation in the residuals. We observe that the residuals for the multivariate 
linear model follow a normal distribution with mean 0 and variation 4,38 (figure 7). 
7 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
Figure 7. Residuals distribution for multivariate linear regression 
From figure 8 we notice that the residuals’ variation for the multivariate linear model is 
smaller and that we can say it follows a White Noise frequency. 
Figure 8. Variation of residuals for multivariate linear model 
8 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
We observed a highly statistically significant (P = 2e-16) association between interest rate 
and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 
= -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). 
For example, for two loans at the same loan length, amount requested by the borrower, 
amount funded by the investors, we would expect an interest rate to increase by 1% at 
every 0.08 decrease in the FICO score. 
Conclusions: 
Our analysis suggests that there is a significant, negative association between Interest 
Rate and FICO score. Our analysis estimates the relationship using a linear model relating 
one percent of interest rate to one unit of FICO score. There appears to be a strong inverse 
relationship between the two variables. 
We also observed that other variables such as loan length, amount requested by the 
borrower and amount funded by the investors are associated with both interest rate and 
FICO score. Including these variables in the regression model relating interest rate to 
FICO score improves the model fit, but does not remove the significant positive 
relationship between the variables. 
Our analysis may be of interest to both investors and borrowers. Investors are interested 
in selecting the potential borrowers on the financial market at a low cost, to establish a 
fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. 
Borrowers are also concerned in obtaining better interest rates at low costs. It could also 
be of interest to the Lending Club to support its members in selecting the proper partners. 
References 
1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action 
Accessed 09/16/2014. 
2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. 
action, Accessed 09/16/2014 
3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 
4. LendingClub Corporation. URL: https://spark-public. 
s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 
5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 
6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. 
Accessed 09/16/2014 
9 /9
Title: Increased earthquake depth is associated with increased magnitude 
Beca Marușa 
We observed a highly statistically significant (P = 2e-16) association between interest rate 
and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 
= -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). 
For example, for two loans at the same loan length, amount requested by the borrower, 
amount funded by the investors, we would expect an interest rate to increase by 1% at 
every 0.08 decrease in the FICO score. 
Conclusions: 
Our analysis suggests that there is a significant, negative association between Interest 
Rate and FICO score. Our analysis estimates the relationship using a linear model relating 
one percent of interest rate to one unit of FICO score. There appears to be a strong inverse 
relationship between the two variables. 
We also observed that other variables such as loan length, amount requested by the 
borrower and amount funded by the investors are associated with both interest rate and 
FICO score. Including these variables in the regression model relating interest rate to 
FICO score improves the model fit, but does not remove the significant positive 
relationship between the variables. 
Our analysis may be of interest to both investors and borrowers. Investors are interested 
in selecting the potential borrowers on the financial market at a low cost, to establish a 
fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. 
Borrowers are also concerned in obtaining better interest rates at low costs. It could also 
be of interest to the Lending Club to support its members in selecting the proper partners. 
References 
1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action 
Accessed 09/16/2014. 
2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. 
action, Accessed 09/16/2014 
3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 
4. LendingClub Corporation. URL: https://spark-public. 
s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 
5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 
6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. 
Accessed 09/16/2014 
9 /9

More Related Content

What's hot

Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning Algorithms
Sagar Tupkar
 
Forecasting peer-to-peer lending risk
Forecasting peer-to-peer lending riskForecasting peer-to-peer lending risk
Forecasting peer-to-peer lending risk
Archange Giscard DESTINE
 
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
inventionjournals
 
Lemna wp 2015-14
Lemna wp 2015-14Lemna wp 2015-14
Lemna wp 2015-14
swajalacharya
 
Benchmark the Actual Bond Prices
Benchmark the Actual Bond PricesBenchmark the Actual Bond Prices
Benchmark the Actual Bond PricesRan Zhang
 
Investment liquidity net income
Investment liquidity net incomeInvestment liquidity net income
Investment liquidity net income
swajalacharya
 
Credit defaulter analysis
Credit defaulter analysisCredit defaulter analysis
Credit defaulter analysis
Nimai Chand Das Adhikari
 
ART1197.DOC
ART1197.DOCART1197.DOC
ART1197.DOCbutest
 
Investment int rate
Investment int rateInvestment int rate
Investment int rate
swajalacharya
 
CECL Project Overview
CECL Project OverviewCECL Project Overview
CECL Project Overview
Rohit Khurana
 
The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...RyanMHolcomb
 
The relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank defaultThe relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank default
Alexander Decker
 
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
SYRTO Project
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search
1crore projects
 
Game Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionGame Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime Detection
IOSR Journals
 

What's hot (17)

Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning Algorithms
 
Icbelsh150811
Icbelsh150811Icbelsh150811
Icbelsh150811
 
Forecasting peer-to-peer lending risk
Forecasting peer-to-peer lending riskForecasting peer-to-peer lending risk
Forecasting peer-to-peer lending risk
 
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
The Influence of Solvency Ratio Decision on Rural Bank Dinar Pusaka In The Di...
 
Lemna wp 2015-14
Lemna wp 2015-14Lemna wp 2015-14
Lemna wp 2015-14
 
Benchmark the Actual Bond Prices
Benchmark the Actual Bond PricesBenchmark the Actual Bond Prices
Benchmark the Actual Bond Prices
 
Investment liquidity net income
Investment liquidity net incomeInvestment liquidity net income
Investment liquidity net income
 
Credit defaulter analysis
Credit defaulter analysisCredit defaulter analysis
Credit defaulter analysis
 
Amano getahun
Amano getahunAmano getahun
Amano getahun
 
ART1197.DOC
ART1197.DOCART1197.DOC
ART1197.DOC
 
Investment int rate
Investment int rateInvestment int rate
Investment int rate
 
CECL Project Overview
CECL Project OverviewCECL Project Overview
CECL Project Overview
 
The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...The ability of previous quarterly earnings, net interest margin, and average ...
The ability of previous quarterly earnings, net interest margin, and average ...
 
The relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank defaultThe relationship and effect of credit and liquidity risk on bank default
The relationship and effect of credit and liquidity risk on bank default
 
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
Bank Interconnectedness What determines the links? - Puriya Abbassi, Christia...
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search
 
Game Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionGame Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime Detection
 

Viewers also liked

Data analysis and statistical inference project
Data analysis and statistical inference projectData analysis and statistical inference project
Data analysis and statistical inference project
Maruşa Pescu (Beca)
 
Problems statistics 1
Problems statistics 1Problems statistics 1
Problems statistics 1
Maruşa Pescu (Beca)
 
Exercises statistics
Exercises statisticsExercises statistics
Exercises statistics
Maruşa Pescu (Beca)
 
Data_Processing_Program
Data_Processing_ProgramData_Processing_Program
Data_Processing_ProgramNeil Dahlqvist
 
Amar 38 final
Amar 38 finalAmar 38 final
Amar 38 final
9867097496
 
Indirect tax (1)
Indirect tax (1)Indirect tax (1)
Indirect tax (1)
9867097496
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingSohail Patel
 
Final goods & service tax
Final goods & service taxFinal goods & service tax
Final goods & service tax
Abhishek Jhunjhunwala
 
Property Tax Assessment Services
Property Tax Assessment ServicesProperty Tax Assessment Services
Property Tax Assessment Services
cutmytaxes
 
Research project report sumit b
Research project report sumit bResearch project report sumit b
Research project report sumit b
sumit saxena
 
Research project on packaged drinking water industry
Research project on packaged drinking water industryResearch project on packaged drinking water industry
Research project on packaged drinking water industry
Pallav Tyagi
 
Standards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian ContextStandards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian Context
Bharath Rao
 
Auditing Standards- IndusInd Bank
Auditing Standards- IndusInd BankAuditing Standards- IndusInd Bank
Auditing Standards- IndusInd Bank
Nikita Jangid
 
Project-Student Financial Service System
Project-Student Financial Service SystemProject-Student Financial Service System
Project-Student Financial Service System
chezhiang
 
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...Oghenovo Egbegbedia
 
STANDARDS ON AUDIT
STANDARDS  ON AUDITSTANDARDS  ON AUDIT
STANDARDS ON AUDIT
Kinjal Gada
 
Company audit & accounts
Company audit & accounts  Company audit & accounts
Company audit & accounts
Vivek Mahajan
 
Demonetization of Indian Currency
Demonetization of Indian CurrencyDemonetization of Indian Currency
Demonetization of Indian Currency
Jithin Scaria
 

Viewers also liked (20)

Data analysis and statistical inference project
Data analysis and statistical inference projectData analysis and statistical inference project
Data analysis and statistical inference project
 
Problems statistics 1
Problems statistics 1Problems statistics 1
Problems statistics 1
 
Exercises statistics
Exercises statisticsExercises statistics
Exercises statistics
 
Data_Processing_Program
Data_Processing_ProgramData_Processing_Program
Data_Processing_Program
 
Amar 38 final
Amar 38 finalAmar 38 final
Amar 38 final
 
Indirect tax (1)
Indirect tax (1)Indirect tax (1)
Indirect tax (1)
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Final goods & service tax
Final goods & service taxFinal goods & service tax
Final goods & service tax
 
Property Tax Assessment Services
Property Tax Assessment ServicesProperty Tax Assessment Services
Property Tax Assessment Services
 
Research project report sumit b
Research project report sumit bResearch project report sumit b
Research project report sumit b
 
Research project on packaged drinking water industry
Research project on packaged drinking water industryResearch project on packaged drinking water industry
Research project on packaged drinking water industry
 
Standards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian ContextStandards of Auditing - Introduction and Application in the Indian Context
Standards of Auditing - Introduction and Application in the Indian Context
 
Auditing Standards- IndusInd Bank
Auditing Standards- IndusInd BankAuditing Standards- IndusInd Bank
Auditing Standards- IndusInd Bank
 
Project-Student Financial Service System
Project-Student Financial Service SystemProject-Student Financial Service System
Project-Student Financial Service System
 
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
Mobilizing Local Government Tax Revenue for Adequate Service Delivery in Nige...
 
STANDARDS ON AUDIT
STANDARDS  ON AUDITSTANDARDS  ON AUDIT
STANDARDS ON AUDIT
 
Project Report on e banking
Project Report on e bankingProject Report on e banking
Project Report on e banking
 
Company audit & accounts
Company audit & accounts  Company audit & accounts
Company audit & accounts
 
E-banking project
E-banking projectE-banking project
E-banking project
 
Demonetization of Indian Currency
Demonetization of Indian CurrencyDemonetization of Indian Currency
Demonetization of Indian Currency
 

Similar to Project data analysis

Programming for big data
Programming for big dataProgramming for big data
Programming for big data
Alex Papageorgiou
 
Loan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
Loan Characteristics as Predictors of Default in Commercial Mortgage PortfoliosLoan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
Loan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
International Journal of Economics and Financial Research
 
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101Heather Lamoureux
 
loan.docx
loan.docxloan.docx
loan.docx
Aravind Reddy
 
Ppt m& b
Ppt m& bPpt m& b
Ppt m& b
sarazehra
 
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
inventionjournals
 
Investigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practiceInvestigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practice
Alexander Decker
 
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?Rusman Mukhlis
 
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Alexander Decker
 
STRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORKSTRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORK
Dinabandhu Bag
 
PBA.docx ( Credit Risk Analysis of loans )
PBA.docx ( Credit Risk Analysis of loans )PBA.docx ( Credit Risk Analysis of loans )
PBA.docx ( Credit Risk Analysis of loans )
Supriyasingh459171
 
An empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutionsAn empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutions
Alexander Decker
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docx
todd191
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docx
arnoldmeredith47041
 
Credit Scoring of Turkey with Semiparametric Logit Models
Credit Scoring of Turkey with Semiparametric Logit ModelsCredit Scoring of Turkey with Semiparametric Logit Models
Credit Scoring of Turkey with Semiparametric Logit Models
International Journal of Economics and Financial Research
 
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Fawaz Khaled
 
Rudebusch Lopez Christensen
Rudebusch Lopez ChristensenRudebusch Lopez Christensen
Rudebusch Lopez Christensen
Peter Ho
 
Apanps5210 - final presentation
Apanps5210 - final presentationApanps5210 - final presentation
Apanps5210 - final presentation
Manasa Damera
 
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
iosrjce
 
Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...- -
 

Similar to Project data analysis (20)

Programming for big data
Programming for big dataProgramming for big data
Programming for big data
 
Loan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
Loan Characteristics as Predictors of Default in Commercial Mortgage PortfoliosLoan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
Loan Characteristics as Predictors of Default in Commercial Mortgage Portfolios
 
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
Report-7-B-Searching-for-Harm-in-Storefront-Payday-Lending-nonPrime101
 
loan.docx
loan.docxloan.docx
loan.docx
 
Ppt m& b
Ppt m& bPpt m& b
Ppt m& b
 
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
Estimation of Net Interest Margin Determinants of the Deposit Banks in Turkey...
 
Investigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practiceInvestigation of mortgage lending an overview of nigerian practice
Investigation of mortgage lending an overview of nigerian practice
 
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
Chapter 03_What Do Interest Rates Mean and What Is Their Role in Valuation?
 
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
Analysis of recovery determinants of defaulted mortgages in nigerian lending ...
 
STRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORKSTRESS TESTING IN BANKING SECTOR FRAMEWORK
STRESS TESTING IN BANKING SECTOR FRAMEWORK
 
PBA.docx ( Credit Risk Analysis of loans )
PBA.docx ( Credit Risk Analysis of loans )PBA.docx ( Credit Risk Analysis of loans )
PBA.docx ( Credit Risk Analysis of loans )
 
An empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutionsAn empirical analysis of the loan default rate of microfinance institutions
An empirical analysis of the loan default rate of microfinance institutions
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docx
 
Thanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docxThanks for your work on this assignment. The biggest challenge i.docx
Thanks for your work on this assignment. The biggest challenge i.docx
 
Credit Scoring of Turkey with Semiparametric Logit Models
Credit Scoring of Turkey with Semiparametric Logit ModelsCredit Scoring of Turkey with Semiparametric Logit Models
Credit Scoring of Turkey with Semiparametric Logit Models
 
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
Mortgage Default, Property Price and Banks’ Lending Behaviour in Hong Kong SAR.
 
Rudebusch Lopez Christensen
Rudebusch Lopez ChristensenRudebusch Lopez Christensen
Rudebusch Lopez Christensen
 
Apanps5210 - final presentation
Apanps5210 - final presentationApanps5210 - final presentation
Apanps5210 - final presentation
 
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
Effect of Credit Risk Management Practices on Profitability of Listed Commerc...
 
Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...Mergers and acquisitions and bank performance in Europe the role of strategic...
Mergers and acquisitions and bank performance in Europe the role of strategic...
 

Recently uploaded

一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

Project data analysis

  • 1. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Title: Decreased FICO score is associated with increased interest rate Introduction: Lending Club is an online financial community that brings together creditworthy borrowers and savvy investors so that both can benefit financially [1]. It allows its members to directly invest in and borrow from each other and so avoid the cost and complexity of the banking system. On the Lending Club site there are several files that contain complete loan data, including the current loan status and latest payment information. [2] The data used in this analysis represents a sample of 2,500 peer-to-peer loans issued by the Lending Club explained through 14 variables such as: monthly income, amount requested, FICO range (a range indicating the applicants FICO score) [3], inquiries in the last six months etc. The goal of this analysis is to establish if there is any correlation between the outcome variable – the interest rate of the loans – and the other variables especially considering the FICO score, which is a measure of the creditworthiness of the applicant. In this project we performed an analysis to determine if there was a significant association between the interest rate and the FICO score. Using exploratory analysis and standard multiple regression techniques we show that there is a significant negative relationship between the interest rate and the FICO score, even after adjusting for important confounders such as the length of the loan, the amount funded by the investors and the amount requested by the borrowers. Our analysis suggests that there is a significant, negative association between Interest Rate and FICO score. Our analysis estimates the relationship using a linear model relating one percent of interest rate to one unit of FICO score. There appears to be a strong inverse relationship between the two variables. Our results suggest that there are other variables such as loan length, amount requested by the borrower and amount funded by the investors which are associated with both interest rate and FICO score. Including these variables in the regression model relating interest rate to FICO score improves the model fit, but does not remove the significant positive relationship between the variables. Methods: Data Collection For our analysis we used the data loans from the Lending Club site from 2007 to 2011. The data were downloaded from lendingclub.com on November 16, 2013 using the R programming language [3]. Exploratory Analysis Exploratory analysis was performed by examining tables and plots of the observed data. We identified transformations to perform on the raw data on the basis of plots and knowledge of the scale of measured variables. Exploratory analysis was used to (1) identify missing values, (2) verify the quality of the data, and (3) determine the terms used in the regression model relating interest rate to FICO score. Statistical Modeling 1 /9
  • 2. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa To relate interest rate to FICO score we performed a standard multivariate linear regression model [4]. Model selection was performed on the basis of our exploratory analysis and prior knowledge of the relationship between interest rate and FICO score, amount of the loan requested and the length in time of the loan. Coefficients were estimated with ordinary least squares and standard errors were calculated using standard asymptotic approximations [5]. Reproducibility All analyses performed in this manuscript are reproduced in the R markdown file loansdata.Rmd [6]. To reproduce the exact results presented in this manuscript the cached version of the analysis must be performed. Results: The loans data used in this analysis contains information on the amount requested by the borrower (Amount.Requested), the amount funded by the investors (Amount.Funded.By.Investors), the lending interest rate (Interest.rate), the length in time (in months) of the loan (Loan.Length), the purpose of the loan as stated by the applicant (Loan.Purpose), the percentage of consumer’s gross income that goes toward paying debts (Debt.To.Income.Ratio), the U.S. state of residence of the loan applicant (State), the ownership type of the home (Home.Ownership), the monthly income of the applicant (in dollars) (Monthly.income), a range indicating the applicants FICO score (FICO.range), the number of open lines of credit the applicant had at the time of application (Open.CREDIT.Lines), the total amount outstanding all lines of credit (Revolving.CREDIT.Balance), the number of authorized queries about the creditworthiness of the applicant in the 6 months before the loan was issued (Inquiries.in.the.Last.6.Months), the length of time employed at current job (Employment.Length). [5]. We identified 77 missing values in the data set we collected for the variable Employment Length, one missing value for the variable Monthly Income, 2 missing values each for the variables the number of open lines of credit the applicant had at the time of application (Open.CREDIT.Lines), the total amount outstanding all lines of credit (Revolving.CREDIT.Balance), the number of authorized queries about the creditworthiness of the applicant in the 6 months before the loan was issued (Inquiries.in.the.Last.6.Months). Three measured variables were outside the standard ranges: for the variable Home Ownership there are five options (none, other, owns, rents or has a mortgage), although there must have been only three: owns, rents or has a mortgage and for the variable Amount Funded by the Investors there are 2 negative values and 4 values of 0; for the variable the percentage of consumer’s gross income that goes toward paying debts (Debt.To.Income.Ratio) there are 8 values of 0% which we consider that must be removed because it represents the percentage of consumer’s gross income that goes toward paying the loans that were approved. 2 /9
  • 3. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa After removing the missing values and the observations that were outside the standard ranges, the data now has 2403 observations and 14 variables. From the barplot of the variable FICO range we can see that the distribution is positively skewed with a long right tail (figure 1). Figure 1. Histogram of FICO Range The histogram of the interest rate shows a relatively normal distribution with mean 13 (figure 2). The majority of the loans granted had an interest rate between 10,2% and 15,8%. 3 /9
  • 4. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 2. Histogram of Interest rate We performed some exploratory analysis and from the boxplots of the interest rate variable and the factor variables we observed that the monthly income of the borrower, the employment length, the type of the home ownership and the state from which was the borrower don’t have any impact on the size of the interest rate of the loan granted. The variables Loan Purpose, Open Credit Lines, Revolving Credit Balance, Inquiries in the last 6 months and Debt to income ratio have little correlation with the interest rate variable. The potential confounders are: the length of the loan, the amount founded by the investors and the amount requested by the borrowers. We decided to transform the variable FICO range into the variable FICO score which represent the average of the low number and the upper number of a FICO range for each observed loan granted. Subsequent analyses focus on this transformed FICO score variable. From the boxplot of the FICO range and interest rate we can observe a strongly negative association between the two (figure 3). The correlation coefficient between the interest rate and FICO score is -71%. 4 /9
  • 5. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 3. The Boxplot between the Interest Rate and FICO Range We first fit a regression model relating interest rate to FICO score (figure 4). Taking into consideration that the multiple R squared is 50,3% which is not equal to the correlation coefficient of 71%, it means that there are confounders that explain the rest of 49,7% of the variation of the variable interest rate. Figure 4. The relationship between the Interest Rate and FICO score 5 /9
  • 6. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa The correlation coefficient between the amount funded by the investors and the interest rate is 33%. The same coefficient is for the amount requested by the borrowers and the association between the interest rate and the loan length is 42%. The mean of the residuals is approx. 0, the variance is 8,6 and they follow a normal distribution positively skewed (figure 5). Figure 5. Residuals distribution for the linear model Residuals show patterns of non-random variation (figure 6). We attempted to explain those patterns by fitting models including potential confounders. 6 /9
  • 7. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 6. The variation of residuals Our final regression model was: Interest.Rate = b0 + b1*FICO.score + b2*Amount.Funded.By.Investors + b3*Amount.Requested + f(Length.Loan) + e, where b0 is an intercept term and b1 represents the change in Interest rate associated with a change of one unit in FICO score at the same amount funded by investors, amount requested by borrowers and the same loan length of time. The term f(Length.Loan) represents a factor model with two different levels. This model explains 75% of the variation by one percent in the interest rate variable. The P-values show that all the coefficients are statistically significant. The error term e represents all sources of unmeasured and unmodeled random variation in interest rate. Our final regression model appeared to remove most of the non-random patterns of variation in the residuals. We observe that the residuals for the multivariate linear model follow a normal distribution with mean 0 and variation 4,38 (figure 7). 7 /9
  • 8. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa Figure 7. Residuals distribution for multivariate linear regression From figure 8 we notice that the residuals’ variation for the multivariate linear model is smaller and that we can say it follows a White Noise frequency. Figure 8. Variation of residuals for multivariate linear model 8 /9
  • 9. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa We observed a highly statistically significant (P = 2e-16) association between interest rate and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 = -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). For example, for two loans at the same loan length, amount requested by the borrower, amount funded by the investors, we would expect an interest rate to increase by 1% at every 0.08 decrease in the FICO score. Conclusions: Our analysis suggests that there is a significant, negative association between Interest Rate and FICO score. Our analysis estimates the relationship using a linear model relating one percent of interest rate to one unit of FICO score. There appears to be a strong inverse relationship between the two variables. We also observed that other variables such as loan length, amount requested by the borrower and amount funded by the investors are associated with both interest rate and FICO score. Including these variables in the regression model relating interest rate to FICO score improves the model fit, but does not remove the significant positive relationship between the variables. Our analysis may be of interest to both investors and borrowers. Investors are interested in selecting the potential borrowers on the financial market at a low cost, to establish a fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. Borrowers are also concerned in obtaining better interest rates at low costs. It could also be of interest to the Lending Club to support its members in selecting the proper partners. References 1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action Accessed 09/16/2014. 2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. action, Accessed 09/16/2014 3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 4. LendingClub Corporation. URL: https://spark-public. s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. Accessed 09/16/2014 9 /9
  • 10. Title: Increased earthquake depth is associated with increased magnitude Beca Marușa We observed a highly statistically significant (P = 2e-16) association between interest rate and FICO score. A change of one percent in Interest Rate corresponded to a change of b1 = -0.08 FICO score (95% Confidence Interval: -0.088, -0.081). For example, for two loans at the same loan length, amount requested by the borrower, amount funded by the investors, we would expect an interest rate to increase by 1% at every 0.08 decrease in the FICO score. Conclusions: Our analysis suggests that there is a significant, negative association between Interest Rate and FICO score. Our analysis estimates the relationship using a linear model relating one percent of interest rate to one unit of FICO score. There appears to be a strong inverse relationship between the two variables. We also observed that other variables such as loan length, amount requested by the borrower and amount funded by the investors are associated with both interest rate and FICO score. Including these variables in the regression model relating interest rate to FICO score improves the model fit, but does not remove the significant positive relationship between the variables. Our analysis may be of interest to both investors and borrowers. Investors are interested in selecting the potential borrowers on the financial market at a low cost, to establish a fair interest rate and, in consequence, to build an efficient portfolio with a high return rate. Borrowers are also concerned in obtaining better interest rates at low costs. It could also be of interest to the Lending Club to support its members in selecting the proper partners. References 1. LendingClub Corporation. URL: https://www.lendingclub.com/public/about-us.action Accessed 09/16/2014. 2. LendingClub Corporation. URL: https://www.lendingclub.com/info/download-data. action, Accessed 09/16/2014 3. http://en.wikipedia.org/wiki/Credit_score_in_the_United_States 4. LendingClub Corporation. URL: https://spark-public. s3.amazonaws.com/dataanalysis/loansData.csv Accessed 09/16/2014 5. https://spark-public.s3.amazonaws.com/dataanalysis/loansCodebook.pdf 6. R Markdown Page. URL:http://www.rstudio.com/ide/docs/authoring/using_markdown. Accessed 09/16/2014 9 /9