Decoding Loan Approval: Predictive Modeling in Action
Dr Vinod on Subset & Transformation Data Analysis
1. Select only quiz1, gpa & final
and view few top rows
1
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
select = c(…
c is for combine
2. Select only final > 60
and view few top rows
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
2
Mathematical argument
3. Compare box plots of final of all 105
observations and with final>60
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
3
> boxplot(grades$final,
main = "Box Plot of final",
col="red")
4. Compare correlation between final and gpa in all
105 observations and in subset final > 60
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
4
5. Create subset of only WHITES (ethnicity = 4) and view
few top rows and make box plot of final
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
5
6. Create subset of only HISPANICS (ethnicity = 5) and
view few top rows and make box plot of final
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
6
7. Compare box plots of final for WHITES and
HISPANIC
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
7
8. Transform final with square root and
recode as new variable
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
8
c
c
c
9. Compare Histograms of final and
sqrtfinal
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
9
Final: skewness = -0.33 ; kurtosis = -0.42
Sqrtfinal: skewness = -0.48 ; kurtosis = -0.17
10. Convert final into two categories of final
[one, <60 and second >60, 60 will fall in > 60]
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
10
11. Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
11
Convert final into two categories of final
[one, <60 and second >60]
12. Convert final into categories with increment of 5
[40-45=1, 46-50=2, 51-55=3, 56-60=4, 61-65=5, 66-70=6, 71-75=7]
Dr Vinod on Subset & Transformation 8971073111 vinod@inurture.co.in 12
13. Converting ethnicity into two categories
[category 1= 1, 3& 5; category 2 = 2 & 4
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
13
14. Take out 20% observations randomly from the file
grades
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
14
15. All 20% cases [21#]
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
15
16. Case numbers are known by sam command
Dr Vinod on Subset & Transformation
8971073111 vinod@inurture.co.in
16