SlideShare a Scribd company logo
1 of 28
AIR QUALITY
Alan, Chandni, Vincent, Ceci, Sharon, Mao
May 2018, SAMSI Workshop
What is PM2.5?
May 2018, SAMSI Workshop
Bypass nose/throat penetrate deep into lungs, circulatory system.
➢ Particulate Matter
➢ Diameter < 2.5 micrometers
➢ 3% the diameter of human
hair
May 2018, SAMSI Workshop
PM2.5 Monitoring Systems in the
US
➢Monitoring stations are sparse
➢Need predictions for locations
without a monitoring station
What is CMAQ?
May 2018, SAMSI Workshop
CMAQ - Community Multi-scale Air
Quality is a numerical air quality
model
To predict the concentration of air
pollutants
CMAQ Inaccuracies
● High topographical regions
contained greatest degrees of
error
● Areas with more monitoring stations
had best predictions
The Big Question/Goal
What is the best statistical
model that predicts PM 2.5
concentration level for the
entire U.S. using numerical
model outputs and other
available covariates?
May 2018, SAMSI Workshop
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis
Variable Selection & Transformation
31 Plots: Covariate v.s. PM 2.5 (Response Variable)
? Each covariate related to PM 2.5
Residual Plot: Residuals of PM2.5 v.s. each covariate
Adjusted R-squared: not enough
Variable Selection & Transformation
Counting Covariates
31 -> 28 (same measurement)
28 -> 25 (cor plot & R-square)
Correlation Plot
Threshold for Decision: 0.8
Correlated pair
…...
…...
May 2018, SAMSI Workshop
Variable Selection & Transformation
31 Plots: Covariate v.s. PM 2.5 (Response Variable)
Residual Plot: Residuals of PM2.5 v.s. each covariate
Adjusted R-squared
Used to decide which covariate to exclude when two are highly correlated.
Variable Selection & Transformation
Residual Plot
➢ Do regression PM2.5 ~ CMAQ
➢ Plot the residuals against the other covariates
Finally, 15 covariates are selected
Boundary layer height
residuals
May 2018, SAMSI Workshop
Random Forest
No. of trees:
500
No. of variables tried at each split:
5
Mean of squared residuals (log
scale): 0.1075135
% Variance explained:
72.25
Some fun math behind the models…
May 2018, SAMSI Workshop
Spatial Model
Covariance
Matrix
Conditional
Normality
Some fun math behind the models…
The Kriging Concept
“The basic idea of kriging is to predict the value of a function at a given
point by computing a weighted average of the known values of the
function in the neighborhood of the point.”
———Wikipedia
May 2018, SAMSI Workshop
January 1st Measurements
May 2018, SAMSI Workshop
May 2018, SAMSI Workshop
Prediction Maps
for Jan 1st , 2011
August 1st Measurements
May 2018, SAMSI Workshop
May 2018, SAMSI Workshop
Prediction Maps
for Aug 1st , 2011
5 Fold Cross-Validation
➢ Divide the whole dataset into 5 folds
➢ Train the model using 4 of them and leave out the fifth one
➢ Make predictions on the fifth fold and obtain the MSE and MAD
Model MSE MAD
CMAQ 51.734 4.681
Simple LR 23.220 3.103
Random forest 13.254 2.177
Spatial analysis 9.734 1.718
May 2018, SAMSI Workshop
Model Comparison based on
cross-validation
May 2018, SAMSI Workshop
Prediction Maps
for Jan 1st , 2011
MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
May 2018, SAMSI Workshop
Prediction Maps
for Aug 1st , 2011
MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
Summary
➢ Spatial analysis makes the BEST predictions
➢ Potential Improvements:
○ Look at the interactions between covariates
○ Other machine learning methods like neural network
○ Seasonal analysis
○ Mid-west?
May 2018, SAMSI Workshop
Special thanks to Yawen, Amanda, Suman, and Doug
Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018

More Related Content

What's hot

What's hot (20)

Bar graphs
Bar graphsBar graphs
Bar graphs
 
Bar graphs
Bar graphsBar graphs
Bar graphs
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
 
Spatial presentation of prognosis models in plant protection
Spatial presentation of prognosis models in plant protectionSpatial presentation of prognosis models in plant protection
Spatial presentation of prognosis models in plant protection
 
Calculus
CalculusCalculus
Calculus
 
Deep learning for multi year enso forecasts fnl
Deep learning for multi year enso forecasts fnlDeep learning for multi year enso forecasts fnl
Deep learning for multi year enso forecasts fnl
 
2012 CRL Recruiting Memo
2012 CRL Recruiting Memo2012 CRL Recruiting Memo
2012 CRL Recruiting Memo
 
2nd Test - Scatterplots
2nd Test - Scatterplots2nd Test - Scatterplots
2nd Test - Scatterplots
 
Data science lab project
Data science lab projectData science lab project
Data science lab project
 
VECTOR CALCULUS
VECTOR CALCULUSVECTOR CALCULUS
VECTOR CALCULUS
 
Math in the News: 8/29/11
Math in the News: 8/29/11Math in the News: 8/29/11
Math in the News: 8/29/11
 
How to train your mind to think like the ai machine you are training
How to train your mind to think like the ai machine you are trainingHow to train your mind to think like the ai machine you are training
How to train your mind to think like the ai machine you are training
 
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHOREQCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
 
2.6b scatter plots and lines of best fit
2.6b scatter plots and lines of best fit2.6b scatter plots and lines of best fit
2.6b scatter plots and lines of best fit
 
Day 6 examples
Day 6 examplesDay 6 examples
Day 6 examples
 
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Air Pollution in Sofia - Solution through Data Science by Kiwi teamAir Pollution in Sofia - Solution through Data Science by Kiwi team
Air Pollution in Sofia - Solution through Data Science by Kiwi team
 
MPI 794 (week-1 & 2)
MPI 794 (week-1 & 2)MPI 794 (week-1 & 2)
MPI 794 (week-1 & 2)
 
Global warming graphs
Global warming graphsGlobal warming graphs
Global warming graphs
 
Mathematical modelling and its application in weather forecasting
Mathematical modelling and its application in weather forecastingMathematical modelling and its application in weather forecasting
Mathematical modelling and its application in weather forecasting
 
Application of differential and integral
Application of differential and integralApplication of differential and integral
Application of differential and integral
 

Similar to Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018

MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
The Statistical and Applied Mathematical Sciences Institute
 
Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?
Lionel Briand
 
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdfAnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
Farhad Sagor
 
ThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersollThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersoll
Kyle Ingersoll
 
COSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_AliCOSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_Ali
MDO_Lab
 

Similar to Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018 (20)

Automatic algorithms for time series forecasting
Automatic algorithms for time series forecastingAutomatic algorithms for time series forecasting
Automatic algorithms for time series forecasting
 
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
 
Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?
 
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdfAnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
 
A data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototypingA data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototyping
 
ThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersollThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersoll
 
Srikanta Mishra
Srikanta MishraSrikanta Mishra
Srikanta Mishra
 
Prim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Prim and Genetic Algorithms Performance in Determining Optimum Route on GraphPrim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Prim and Genetic Algorithms Performance in Determining Optimum Route on Graph
 
COSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_AliCOSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_Ali
 
Statsci
StatsciStatsci
Statsci
 
Brussels airport forecast
Brussels airport  forecast Brussels airport  forecast
Brussels airport forecast
 
Dj4201737746
Dj4201737746Dj4201737746
Dj4201737746
 
March 2, 2018 - Machine Learning for Production Forecasting
March 2, 2018 - Machine Learning for Production ForecastingMarch 2, 2018 - Machine Learning for Production Forecasting
March 2, 2018 - Machine Learning for Production Forecasting
 
Estimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble TechniqueEstimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble Technique
 
Kernel based swarm optimization for renewable energy application
Kernel based swarm optimization  for renewable energy applicationKernel based swarm optimization  for renewable energy application
Kernel based swarm optimization for renewable energy application
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
 
Modelsward 2018 Industrial Track - Alessandra Bagnato
Modelsward 2018 Industrial Track - Alessandra BagnatoModelsward 2018 Industrial Track - Alessandra Bagnato
Modelsward 2018 Industrial Track - Alessandra Bagnato
 
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
 
Research Proposal
Research ProposalResearch Proposal
Research Proposal
 

More from The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
The Statistical and Applied Mathematical Sciences Institute
 

More from The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

Recently uploaded

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
AnaAcapella
 

Recently uploaded (20)

NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactistics
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 

Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018

  • 1. AIR QUALITY Alan, Chandni, Vincent, Ceci, Sharon, Mao May 2018, SAMSI Workshop
  • 2. What is PM2.5? May 2018, SAMSI Workshop Bypass nose/throat penetrate deep into lungs, circulatory system. ➢ Particulate Matter ➢ Diameter < 2.5 micrometers ➢ 3% the diameter of human hair
  • 3. May 2018, SAMSI Workshop PM2.5 Monitoring Systems in the US ➢Monitoring stations are sparse ➢Need predictions for locations without a monitoring station
  • 4. What is CMAQ? May 2018, SAMSI Workshop CMAQ - Community Multi-scale Air Quality is a numerical air quality model To predict the concentration of air pollutants
  • 5. CMAQ Inaccuracies ● High topographical regions contained greatest degrees of error ● Areas with more monitoring stations had best predictions
  • 6. The Big Question/Goal What is the best statistical model that predicts PM 2.5 concentration level for the entire U.S. using numerical model outputs and other available covariates? May 2018, SAMSI Workshop
  • 11. Variable Selection & Transformation 31 Plots: Covariate v.s. PM 2.5 (Response Variable) ? Each covariate related to PM 2.5 Residual Plot: Residuals of PM2.5 v.s. each covariate Adjusted R-squared: not enough
  • 12. Variable Selection & Transformation Counting Covariates 31 -> 28 (same measurement) 28 -> 25 (cor plot & R-square) Correlation Plot Threshold for Decision: 0.8 Correlated pair …... …... May 2018, SAMSI Workshop
  • 13. Variable Selection & Transformation 31 Plots: Covariate v.s. PM 2.5 (Response Variable) Residual Plot: Residuals of PM2.5 v.s. each covariate Adjusted R-squared Used to decide which covariate to exclude when two are highly correlated.
  • 14. Variable Selection & Transformation Residual Plot ➢ Do regression PM2.5 ~ CMAQ ➢ Plot the residuals against the other covariates Finally, 15 covariates are selected Boundary layer height residuals
  • 15. May 2018, SAMSI Workshop Random Forest No. of trees: 500 No. of variables tried at each split: 5 Mean of squared residuals (log scale): 0.1075135 % Variance explained: 72.25 Some fun math behind the models…
  • 16. May 2018, SAMSI Workshop Spatial Model Covariance Matrix Conditional Normality Some fun math behind the models…
  • 17. The Kriging Concept “The basic idea of kriging is to predict the value of a function at a given point by computing a weighted average of the known values of the function in the neighborhood of the point.” ———Wikipedia May 2018, SAMSI Workshop
  • 18. January 1st Measurements May 2018, SAMSI Workshop
  • 19. May 2018, SAMSI Workshop Prediction Maps for Jan 1st , 2011
  • 20. August 1st Measurements May 2018, SAMSI Workshop
  • 21. May 2018, SAMSI Workshop Prediction Maps for Aug 1st , 2011
  • 22. 5 Fold Cross-Validation ➢ Divide the whole dataset into 5 folds ➢ Train the model using 4 of them and leave out the fifth one ➢ Make predictions on the fifth fold and obtain the MSE and MAD
  • 23. Model MSE MAD CMAQ 51.734 4.681 Simple LR 23.220 3.103 Random forest 13.254 2.177 Spatial analysis 9.734 1.718 May 2018, SAMSI Workshop Model Comparison based on cross-validation
  • 24. May 2018, SAMSI Workshop Prediction Maps for Jan 1st , 2011 MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
  • 25. May 2018, SAMSI Workshop Prediction Maps for Aug 1st , 2011 MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
  • 26. Summary ➢ Spatial analysis makes the BEST predictions ➢ Potential Improvements: ○ Look at the interactions between covariates ○ Other machine learning methods like neural network ○ Seasonal analysis ○ Mid-west?
  • 27. May 2018, SAMSI Workshop Special thanks to Yawen, Amanda, Suman, and Doug