SlideShare a Scribd company logo
1 of 18
Average Performance Prediction of
Elementary School using Multiple
Regression
Submitted By:
Ankur Khandelwal
Anurag Shandilya
Pullahbhatla Apuroop
Srikanth Mallya
Agenda
1. Introduction
2. Business Objective
3. Factors and Their Influence
4. Final Regression Model and Variables
5. Inferences drawn from Analysis
6. Appendix
Introduction
The Dataset contains the following:
• Performance of 400 elementary schools from the California
Department of Education and factors like class size, parent
education, student performance,etc.
Business Objective
• To find the factors having major influence on the academic
performance.
• To predict academic performance of an school using those
factors.
Factors & their Influence
Factors which has been chosen on the basis of statistical significance:
Factors Impact
English language learner(ELL) Negative
Percentage first year in school (Mobility) Negative
Average Class size k-3 (ACS_k3) Positive
Parent College Grade(col_grad) Positive
Parent Grad School(grad_sch) Positive
Percentage Emergency Credential(emer) Negative
Regression Model & Variables
Regression Equation
API100 = 563.4-2.7*(ell)-
2.46*(mobility)+10.63*(acs_k3)+1.20*(col_grad)+2.84*(grad_sch)-2.8*(emer)
Variable Label Parameter
Intercept Intercept 563.4395
ell english language learners -2.77842
mobility pct 1st year in school -2.46762
acs_k3 avg class size k-3 10.6312
col_grad parent college grad 1.20298
grad_sch parent grad school 2.8468
emer pct emer credential -2.7947
Inferences Drawn from Analysis
• ell: If the count of ell is more it means students are weak in
English and it can affect their performance in other subjects as well.
It may degrade the performance of a student.
• mobility: If the count of mobility is more it means more number of
students are dropping out from school in the first year. Schools
with high mobility rate shows the low API value.
• grad_sch: Students whose parents have graduation as highest
education and having guidance from their parents. This can highly
influence their performance in school.
Conti..
• emer: Part time teachers can highly influence API value because
weak students cannot have “anytime access” to qualified teachers.
So the teachers available in emergency is highly responsible to
affect the school’s performance.
• acs_k3: Higher the size of the class in the school higher will the
performance of so we can see that average class size k-3 has the
positive contribution on the average performance index for the
schools.
• col_grad: Students whose parents have graduation as highest
education and having guidance from their parents. This can highly
influence their performance in school. Higher the graduation of the
parents higher will be the performance of the students in the
schools.
Appendix
• Missing Values and outliers Treatment
• Test For Regression
• Check for Multicolinearity
• Check For Significance of individual Parameter
• Check for Hetroscedasticity
• Check for Normality
• Mean Absolute Percentage Error
• Check for R-Square Value
Missing Value and Outlier Treatment
Before the treatment After the treatment
Test for Regression
Analysis of Variance
Source DF Sum of Mean F Value Pr > F
Squares Square
Model 6 6282718 1E+06 229.78 <.0001
Error 393 1790954 4557.1
Corrected Total 399 8073672
• This is done to check the over all significance of the model:
• H0: Independent variables collectively or individually can’t influence the dependent
variable.
• H1: The independent variables collectively or individually can influence the
dependent variable.
• If P-value>α:H0 can’t be rejected & hence the model is useless.
• If P-value<α: H0 is rejected & hence some independent can influence the dependent
variable.
• In this case the Pvalue<α & hence some independent variables can influence the
dependent variable.
Check for Multicolinearity
Parameter Estimates
Variable Label DF Parameter Standard t Value Pr > |t| Variance
Estimate Error Inflation
Intercept Intercept 1 563.43951 49.84285 11.3 <.0001 0
ell English language learners 1 -2.77842 0.17562 -15.82 <.0001 1.66602
mobility pct 1st year in school 1 -2.46762 0.47464 -5.2 <.0001 1.10217
acs_k3 avg class size k-3 1 10.6312 2.50946 4.24 <.0001 1.02771
col_grad parent college grad 1 1.20298 0.24159 4.98 <.0001 1.3863
grad_sch parent grad school 1 2.8468 0.34247 8.31 <.0001 1.51113
emer pct emer credential 1 -2.7947 0.33022 -8.46 <.0001 1.31735
• This happens when the independent variables are highly interdependent.
• Hence the individual impact on the dependent variables can’t be correctly estimated.
• The extent of multicolinearity is captured by the variance inflation factor(VIF).
• The final model must have only those variables having VIF ranging from 1.5 to 2.
Check For Significance of individual
Parameter
Parameter Estimates
Variable Label DF Parameter Standard t Value Pr > |t| Variance
Estimate Error Inflation
Intercept Intercept 1 563.43951 49.84285 11.3 <.0001 0
ell English language learners 1 -2.77842 0.17562 -15.82 <.0001 1.66602
mobility pct 1st year in school 1 -2.46762 0.47464 -5.2 <.0001 1.10217
acs_k3 avg class size k-3 1 10.6312 2.50946 4.24 <.0001 1.02771
col_grad parent college grad 1 1.20298 0.24159 4.98 <.0001 1.3863
grad_sch parent grad school 1 2.8468 0.34247 8.31 <.0001 1.51113
emer pct emer credential 1 -2.7947 0.33022 -8.46 <.0001 1.31735
• The P values of the variables are checked for the significance
• Variables having P value>α are not important for the model
• The final model must have variables having P value>α & VIF ranging from
1.5 to 2.
Check for Hetroscedasticity
Test of First and Second
Moment Specification
DF Chi-Square Pr > ChiSq
27 53.67 0.0017
• This occurs when the variance of the random error component is not
constant.
• The White’s test used for the check for Heteroscedasticity
• Null Hypothesis: Model is Homoscedastic.
• If P value>α:H0 can’t be rejected & hence the model is Homoscedastic &
vice-versa.
• The VIF SPEC option is used to check for the Heteroscedasticity.
Check for Normality
• Once the model has only the significant variables the o/p file created.
• The o/p file contains the predicted & the residual variables.
• The residual variables saved in the o/p file for normality
• This is done using the proc univariate with normal option
Mean Absolute Percentage Error
The Means Procedure
Analysis Variable : ERROR
Mean
8.7668826
• Mean absolute percentage error or MAPE captures the overall %
error of the model.
• Ideally MAPE should be with in 10%.
Check for R-Square Value
Root MSE 67.507 R-Square 0.7782
Dependent Mean 647.62 Adj R-Sq 0.7748
Coeff Var 10.424
• This captures the proportion variation that can be explained by the linear
regression.
• Higher the value of R-square, better the explanatory power.
• This acts as a measure of goodness of fit of the model.
• R-square value should be at least 65% or .65.
Average performance prediction of elementary school using multiple regression

More Related Content

What's hot (20)

Correlation Research Design
Correlation Research DesignCorrelation Research Design
Correlation Research Design
 
Item Analysis
Item AnalysisItem Analysis
Item Analysis
 
Item analysis with spss software
Item analysis with spss softwareItem analysis with spss software
Item analysis with spss software
 
Lecture slides stats1.13.l20.air
Lecture slides stats1.13.l20.airLecture slides stats1.13.l20.air
Lecture slides stats1.13.l20.air
 
Psychometrics 101: Know What Your Assessment Data is Telling You
Psychometrics 101: Know What Your Assessment Data is Telling YouPsychometrics 101: Know What Your Assessment Data is Telling You
Psychometrics 101: Know What Your Assessment Data is Telling You
 
Kruskal Wallis Test in R
Kruskal Wallis Test in RKruskal Wallis Test in R
Kruskal Wallis Test in R
 
Presentation non parametric
Presentation non parametricPresentation non parametric
Presentation non parametric
 
12
1212
12
 
Workshop QCI- regression_analysis
Workshop QCI- regression_analysis Workshop QCI- regression_analysis
Workshop QCI- regression_analysis
 
Two-way Repeated Measures ANOVA
Two-way Repeated Measures ANOVATwo-way Repeated Measures ANOVA
Two-way Repeated Measures ANOVA
 
Item Analysis
Item AnalysisItem Analysis
Item Analysis
 
EXAMINING DISTRACTORS AND EFFECTIVENESS
EXAMINING DISTRACTORS AND  EFFECTIVENESSEXAMINING DISTRACTORS AND  EFFECTIVENESS
EXAMINING DISTRACTORS AND EFFECTIVENESS
 
Error analysis
Error analysisError analysis
Error analysis
 
Ad test
Ad testAd test
Ad test
 
Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)
 
Item analysis report
Item analysis report Item analysis report
Item analysis report
 
Antigen antibody reactions
Antigen antibody reactionsAntigen antibody reactions
Antigen antibody reactions
 
Aca 22-407
Aca 22-407Aca 22-407
Aca 22-407
 
Chapter36a
Chapter36aChapter36a
Chapter36a
 
Data Analyst - Interview Guide
Data Analyst - Interview GuideData Analyst - Interview Guide
Data Analyst - Interview Guide
 

Viewers also liked

House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Predictionsriram30691
 
T16 multiple regression
T16 multiple regressionT16 multiple regression
T16 multiple regressionkompellark
 
Prediction of house price using multiple regression
Prediction of house price using multiple regressionPrediction of house price using multiple regression
Prediction of house price using multiple regressionvinovk
 
Prediction of House Sales Price
Prediction of House Sales PricePrediction of House Sales Price
Prediction of House Sales PriceAnirvan Ghosh
 
Simple Linier Regression
Simple Linier RegressionSimple Linier Regression
Simple Linier Regressiondessybudiyanti
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysisnadiazaheer
 

Viewers also liked (7)

House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Prediction
 
T16 multiple regression
T16 multiple regressionT16 multiple regression
T16 multiple regression
 
Prediction of house price using multiple regression
Prediction of house price using multiple regressionPrediction of house price using multiple regression
Prediction of house price using multiple regression
 
Prediction of House Sales Price
Prediction of House Sales PricePrediction of House Sales Price
Prediction of House Sales Price
 
Simple Linier Regression
Simple Linier RegressionSimple Linier Regression
Simple Linier Regression
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Slideshare ppt
Slideshare pptSlideshare ppt
Slideshare ppt
 

Similar to Average performance prediction of elementary school using multiple regression

Multiple regression
Multiple regressionMultiple regression
Multiple regressionSomdeep Sen
 
Teacher evaluation and goal setting connecticut
Teacher evaluation and goal setting   connecticutTeacher evaluation and goal setting   connecticut
Teacher evaluation and goal setting connecticutJohn Cronin
 
Regression techniques to study the student performance in post graduate exam...
Regression techniques to study the student performance in post  graduate exam...Regression techniques to study the student performance in post  graduate exam...
Regression techniques to study the student performance in post graduate exam...IJMER
 
Using Nursing Exam Data Effectively in Preparing Nursing Accreditation
Using Nursing Exam Data Effectively in Preparing Nursing AccreditationUsing Nursing Exam Data Effectively in Preparing Nursing Accreditation
Using Nursing Exam Data Effectively in Preparing Nursing AccreditationExamSoft
 
Test standardization
Test standardizationTest standardization
Test standardizationKaye Batica
 
Using ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation ProcessUsing ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation ProcessExamSoft
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?Smarten Augmented Analytics
 
Effects of e instruction in Teaching Intro to IT
Effects of e instruction in Teaching Intro to ITEffects of e instruction in Teaching Intro to IT
Effects of e instruction in Teaching Intro to ITAmelita Martinez
 
Grading Your Assessments: How to Evaluate the Quality of Your Exams
Grading Your Assessments: How to Evaluate the Quality of Your ExamsGrading Your Assessments: How to Evaluate the Quality of Your Exams
Grading Your Assessments: How to Evaluate the Quality of Your ExamsExamSoft
 
ESE444/544 - Types of Assessment
ESE444/544 - Types of AssessmentESE444/544 - Types of Assessment
ESE444/544 - Types of Assessmentamacargel
 
Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)
Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)
Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)Aamir Ijaz Brig
 
Item and Distracter Analysis
Item and Distracter AnalysisItem and Distracter Analysis
Item and Distracter AnalysisSue Quirante
 
Colorado assessment summit_teacher_eval
Colorado assessment summit_teacher_evalColorado assessment summit_teacher_eval
Colorado assessment summit_teacher_evalJohn Cronin
 
Aligning benchmarks with high stakes assessments 2010
Aligning benchmarks with high stakes assessments 2010Aligning benchmarks with high stakes assessments 2010
Aligning benchmarks with high stakes assessments 2010dvodicka
 
Aligning Benchmarks With High Stakes Assessments 2009
Aligning Benchmarks With High Stakes Assessments 2009Aligning Benchmarks With High Stakes Assessments 2009
Aligning Benchmarks With High Stakes Assessments 2009dvodicka
 

Similar to Average performance prediction of elementary school using multiple regression (20)

Multiple regression
Multiple regressionMultiple regression
Multiple regression
 
Teacher evaluation and goal setting connecticut
Teacher evaluation and goal setting   connecticutTeacher evaluation and goal setting   connecticut
Teacher evaluation and goal setting connecticut
 
Grading policy
Grading policyGrading policy
Grading policy
 
Regression techniques to study the student performance in post graduate exam...
Regression techniques to study the student performance in post  graduate exam...Regression techniques to study the student performance in post  graduate exam...
Regression techniques to study the student performance in post graduate exam...
 
Using Nursing Exam Data Effectively in Preparing Nursing Accreditation
Using Nursing Exam Data Effectively in Preparing Nursing AccreditationUsing Nursing Exam Data Effectively in Preparing Nursing Accreditation
Using Nursing Exam Data Effectively in Preparing Nursing Accreditation
 
Test standardization
Test standardizationTest standardization
Test standardization
 
Using ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation ProcessUsing ExamSoft Data to Prepare For and Ease the Accreditation Process
Using ExamSoft Data to Prepare For and Ease the Accreditation Process
 
New item analysis
New item analysisNew item analysis
New item analysis
 
Item analysis
Item analysisItem analysis
Item analysis
 
Csrde discriminant analysis final
Csrde discriminant analysis finalCsrde discriminant analysis final
Csrde discriminant analysis final
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?
 
Effects of e instruction in Teaching Intro to IT
Effects of e instruction in Teaching Intro to ITEffects of e instruction in Teaching Intro to IT
Effects of e instruction in Teaching Intro to IT
 
Grading Your Assessments: How to Evaluate the Quality of Your Exams
Grading Your Assessments: How to Evaluate the Quality of Your ExamsGrading Your Assessments: How to Evaluate the Quality of Your Exams
Grading Your Assessments: How to Evaluate the Quality of Your Exams
 
ESE444/544 - Types of Assessment
ESE444/544 - Types of AssessmentESE444/544 - Types of Assessment
ESE444/544 - Types of Assessment
 
Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)
Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)
Quality Control for Quantitative Tests by Prof Aamir Ijaz (Pakistan)
 
DepEd Item Analysis
DepEd Item AnalysisDepEd Item Analysis
DepEd Item Analysis
 
Item and Distracter Analysis
Item and Distracter AnalysisItem and Distracter Analysis
Item and Distracter Analysis
 
Colorado assessment summit_teacher_eval
Colorado assessment summit_teacher_evalColorado assessment summit_teacher_eval
Colorado assessment summit_teacher_eval
 
Aligning benchmarks with high stakes assessments 2010
Aligning benchmarks with high stakes assessments 2010Aligning benchmarks with high stakes assessments 2010
Aligning benchmarks with high stakes assessments 2010
 
Aligning Benchmarks With High Stakes Assessments 2009
Aligning Benchmarks With High Stakes Assessments 2009Aligning Benchmarks With High Stakes Assessments 2009
Aligning Benchmarks With High Stakes Assessments 2009
 

Recently uploaded

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Average performance prediction of elementary school using multiple regression

  • 1. Average Performance Prediction of Elementary School using Multiple Regression Submitted By: Ankur Khandelwal Anurag Shandilya Pullahbhatla Apuroop Srikanth Mallya
  • 2. Agenda 1. Introduction 2. Business Objective 3. Factors and Their Influence 4. Final Regression Model and Variables 5. Inferences drawn from Analysis 6. Appendix
  • 3. Introduction The Dataset contains the following: • Performance of 400 elementary schools from the California Department of Education and factors like class size, parent education, student performance,etc.
  • 4. Business Objective • To find the factors having major influence on the academic performance. • To predict academic performance of an school using those factors.
  • 5. Factors & their Influence Factors which has been chosen on the basis of statistical significance: Factors Impact English language learner(ELL) Negative Percentage first year in school (Mobility) Negative Average Class size k-3 (ACS_k3) Positive Parent College Grade(col_grad) Positive Parent Grad School(grad_sch) Positive Percentage Emergency Credential(emer) Negative
  • 6. Regression Model & Variables Regression Equation API100 = 563.4-2.7*(ell)- 2.46*(mobility)+10.63*(acs_k3)+1.20*(col_grad)+2.84*(grad_sch)-2.8*(emer) Variable Label Parameter Intercept Intercept 563.4395 ell english language learners -2.77842 mobility pct 1st year in school -2.46762 acs_k3 avg class size k-3 10.6312 col_grad parent college grad 1.20298 grad_sch parent grad school 2.8468 emer pct emer credential -2.7947
  • 7. Inferences Drawn from Analysis • ell: If the count of ell is more it means students are weak in English and it can affect their performance in other subjects as well. It may degrade the performance of a student. • mobility: If the count of mobility is more it means more number of students are dropping out from school in the first year. Schools with high mobility rate shows the low API value. • grad_sch: Students whose parents have graduation as highest education and having guidance from their parents. This can highly influence their performance in school.
  • 8. Conti.. • emer: Part time teachers can highly influence API value because weak students cannot have “anytime access” to qualified teachers. So the teachers available in emergency is highly responsible to affect the school’s performance. • acs_k3: Higher the size of the class in the school higher will the performance of so we can see that average class size k-3 has the positive contribution on the average performance index for the schools. • col_grad: Students whose parents have graduation as highest education and having guidance from their parents. This can highly influence their performance in school. Higher the graduation of the parents higher will be the performance of the students in the schools.
  • 9. Appendix • Missing Values and outliers Treatment • Test For Regression • Check for Multicolinearity • Check For Significance of individual Parameter • Check for Hetroscedasticity • Check for Normality • Mean Absolute Percentage Error • Check for R-Square Value
  • 10. Missing Value and Outlier Treatment Before the treatment After the treatment
  • 11. Test for Regression Analysis of Variance Source DF Sum of Mean F Value Pr > F Squares Square Model 6 6282718 1E+06 229.78 <.0001 Error 393 1790954 4557.1 Corrected Total 399 8073672 • This is done to check the over all significance of the model: • H0: Independent variables collectively or individually can’t influence the dependent variable. • H1: The independent variables collectively or individually can influence the dependent variable. • If P-value>α:H0 can’t be rejected & hence the model is useless. • If P-value<α: H0 is rejected & hence some independent can influence the dependent variable. • In this case the Pvalue<α & hence some independent variables can influence the dependent variable.
  • 12. Check for Multicolinearity Parameter Estimates Variable Label DF Parameter Standard t Value Pr > |t| Variance Estimate Error Inflation Intercept Intercept 1 563.43951 49.84285 11.3 <.0001 0 ell English language learners 1 -2.77842 0.17562 -15.82 <.0001 1.66602 mobility pct 1st year in school 1 -2.46762 0.47464 -5.2 <.0001 1.10217 acs_k3 avg class size k-3 1 10.6312 2.50946 4.24 <.0001 1.02771 col_grad parent college grad 1 1.20298 0.24159 4.98 <.0001 1.3863 grad_sch parent grad school 1 2.8468 0.34247 8.31 <.0001 1.51113 emer pct emer credential 1 -2.7947 0.33022 -8.46 <.0001 1.31735 • This happens when the independent variables are highly interdependent. • Hence the individual impact on the dependent variables can’t be correctly estimated. • The extent of multicolinearity is captured by the variance inflation factor(VIF). • The final model must have only those variables having VIF ranging from 1.5 to 2.
  • 13. Check For Significance of individual Parameter Parameter Estimates Variable Label DF Parameter Standard t Value Pr > |t| Variance Estimate Error Inflation Intercept Intercept 1 563.43951 49.84285 11.3 <.0001 0 ell English language learners 1 -2.77842 0.17562 -15.82 <.0001 1.66602 mobility pct 1st year in school 1 -2.46762 0.47464 -5.2 <.0001 1.10217 acs_k3 avg class size k-3 1 10.6312 2.50946 4.24 <.0001 1.02771 col_grad parent college grad 1 1.20298 0.24159 4.98 <.0001 1.3863 grad_sch parent grad school 1 2.8468 0.34247 8.31 <.0001 1.51113 emer pct emer credential 1 -2.7947 0.33022 -8.46 <.0001 1.31735 • The P values of the variables are checked for the significance • Variables having P value>α are not important for the model • The final model must have variables having P value>α & VIF ranging from 1.5 to 2.
  • 14. Check for Hetroscedasticity Test of First and Second Moment Specification DF Chi-Square Pr > ChiSq 27 53.67 0.0017 • This occurs when the variance of the random error component is not constant. • The White’s test used for the check for Heteroscedasticity • Null Hypothesis: Model is Homoscedastic. • If P value>α:H0 can’t be rejected & hence the model is Homoscedastic & vice-versa. • The VIF SPEC option is used to check for the Heteroscedasticity.
  • 15. Check for Normality • Once the model has only the significant variables the o/p file created. • The o/p file contains the predicted & the residual variables. • The residual variables saved in the o/p file for normality • This is done using the proc univariate with normal option
  • 16. Mean Absolute Percentage Error The Means Procedure Analysis Variable : ERROR Mean 8.7668826 • Mean absolute percentage error or MAPE captures the overall % error of the model. • Ideally MAPE should be with in 10%.
  • 17. Check for R-Square Value Root MSE 67.507 R-Square 0.7782 Dependent Mean 647.62 Adj R-Sq 0.7748 Coeff Var 10.424 • This captures the proportion variation that can be explained by the linear regression. • Higher the value of R-square, better the explanatory power. • This acts as a measure of goodness of fit of the model. • R-square value should be at least 65% or .65.