SlideShare a Scribd company logo
AIR QUALITY
Alan, Chandni, Vincent, Ceci, Sharon, Mao
May 2018, SAMSI Workshop
What is PM2.5?
May 2018, SAMSI Workshop
Bypass nose/throat penetrate deep into lungs, circulatory system.
➢ Particulate Matter
➢ Diameter < 2.5 micrometers
➢ 3% the diameter of human
hair
May 2018, SAMSI Workshop
PM2.5 Monitoring Systems in the
US
➢Monitoring stations are sparse
➢Need predictions for locations
without a monitoring station
What is CMAQ?
May 2018, SAMSI Workshop
CMAQ - Community Multi-scale Air
Quality is a numerical air quality
model
To predict the concentration of air
pollutants
CMAQ Inaccuracies
● High topographical regions
contained greatest degrees of
error
● Areas with more monitoring stations
had best predictions
The Big Question/Goal
What is the best statistical
model that predicts PM 2.5
concentration level for the
entire U.S. using numerical
model outputs and other
available covariates?
May 2018, SAMSI Workshop
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis
Variable Selection & Transformation
31 Plots: Covariate v.s. PM 2.5 (Response Variable)
? Each covariate related to PM 2.5
Residual Plot: Residuals of PM2.5 v.s. each covariate
Adjusted R-squared: not enough
Variable Selection & Transformation
Counting Covariates
31 -> 28 (same measurement)
28 -> 25 (cor plot & R-square)
Correlation Plot
Threshold for Decision: 0.8
Correlated pair
…...
…...
May 2018, SAMSI Workshop
Variable Selection & Transformation
31 Plots: Covariate v.s. PM 2.5 (Response Variable)
Residual Plot: Residuals of PM2.5 v.s. each covariate
Adjusted R-squared
Used to decide which covariate to exclude when two are highly correlated.
Variable Selection & Transformation
Residual Plot
➢ Do regression PM2.5 ~ CMAQ
➢ Plot the residuals against the other covariates
Finally, 15 covariates are selected
Boundary layer height
residuals
May 2018, SAMSI Workshop
Random Forest
No. of trees:
500
No. of variables tried at each split:
5
Mean of squared residuals (log
scale): 0.1075135
% Variance explained:
72.25
Some fun math behind the models…
May 2018, SAMSI Workshop
Spatial Model
Covariance
Matrix
Conditional
Normality
Some fun math behind the models…
The Kriging Concept
“The basic idea of kriging is to predict the value of a function at a given
point by computing a weighted average of the known values of the
function in the neighborhood of the point.”
———Wikipedia
May 2018, SAMSI Workshop
January 1st Measurements
May 2018, SAMSI Workshop
May 2018, SAMSI Workshop
Prediction Maps
for Jan 1st , 2011
August 1st Measurements
May 2018, SAMSI Workshop
May 2018, SAMSI Workshop
Prediction Maps
for Aug 1st , 2011
5 Fold Cross-Validation
➢ Divide the whole dataset into 5 folds
➢ Train the model using 4 of them and leave out the fifth one
➢ Make predictions on the fifth fold and obtain the MSE and MAD
Model MSE MAD
CMAQ 51.734 4.681
Simple LR 23.220 3.103
Random forest 13.254 2.177
Spatial analysis 9.734 1.718
May 2018, SAMSI Workshop
Model Comparison based on
cross-validation
May 2018, SAMSI Workshop
Prediction Maps
for Jan 1st , 2011
MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
May 2018, SAMSI Workshop
Prediction Maps
for Aug 1st , 2011
MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
Summary
➢ Spatial analysis makes the BEST predictions
➢ Potential Improvements:
○ Look at the interactions between covariates
○ Other machine learning methods like neural network
○ Seasonal analysis
○ Mid-west?
May 2018, SAMSI Workshop
Special thanks to Yawen, Amanda, Suman, and Doug
Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018

More Related Content

What's hot

Bar graphs
Bar graphsBar graphs
Bar graphs
kristenu83
 
Bar graphs
Bar graphsBar graphs
Bar graphs
kristenu83
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
The Statistical and Applied Mathematical Sciences Institute
 
Spatial presentation of prognosis models in plant protection
Spatial presentation of prognosis models in plant protectionSpatial presentation of prognosis models in plant protection
Spatial presentation of prognosis models in plant protection
CAPIGI
 
Calculus
CalculusCalculus
Deep learning for multi year enso forecasts fnl
Deep learning for multi year enso forecasts fnlDeep learning for multi year enso forecasts fnl
Deep learning for multi year enso forecasts fnl
Rakesh S
 
2012 CRL Recruiting Memo
2012 CRL Recruiting Memo2012 CRL Recruiting Memo
2012 CRL Recruiting Memo
A Jorge Garcia
 
2nd Test - Scatterplots
2nd Test - Scatterplots2nd Test - Scatterplots
2nd Test - Scatterplots
Brandeis High School
 
Data science lab project
Data science lab projectData science lab project
Data science lab project
LuciaRavazzi
 
VECTOR CALCULUS
VECTOR CALCULUSVECTOR CALCULUS
VECTOR CALCULUS
MANJULAKAMALANATHAN
 
Math in the News: 8/29/11
Math in the News: 8/29/11Math in the News: 8/29/11
Math in the News: 8/29/11
Media4math
 
How to train your mind to think like the ai machine you are training
How to train your mind to think like the ai machine you are trainingHow to train your mind to think like the ai machine you are training
How to train your mind to think like the ai machine you are training
Denis Rothman
 
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHOREQCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
tanya rathore
 
2.6b scatter plots and lines of best fit
2.6b scatter plots and lines of best fit2.6b scatter plots and lines of best fit
2.6b scatter plots and lines of best fit
hartcher
 
Day 6 examples
Day 6 examplesDay 6 examples
Day 6 examples
jchartiersjsd
 
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Air Pollution in Sofia - Solution through Data Science by Kiwi teamAir Pollution in Sofia - Solution through Data Science by Kiwi team
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Data Science Society
 
MPI 794 (week-1 & 2)
MPI 794 (week-1 & 2)MPI 794 (week-1 & 2)
MPI 794 (week-1 & 2)
Yasser B. A. Farag
 
Global warming graphs
Global warming graphsGlobal warming graphs
Global warming graphs
Sophia Elliott
 
Mathematical modelling and its application in weather forecasting
Mathematical modelling and its application in weather forecastingMathematical modelling and its application in weather forecasting
Mathematical modelling and its application in weather forecasting
Sarwar Azad
 
Application of differential and integral
Application of differential and integralApplication of differential and integral
Application of differential and integral
Shohan Ahmed
 

What's hot (20)

Bar graphs
Bar graphsBar graphs
Bar graphs
 
Bar graphs
Bar graphsBar graphs
Bar graphs
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
 
Spatial presentation of prognosis models in plant protection
Spatial presentation of prognosis models in plant protectionSpatial presentation of prognosis models in plant protection
Spatial presentation of prognosis models in plant protection
 
Calculus
CalculusCalculus
Calculus
 
Deep learning for multi year enso forecasts fnl
Deep learning for multi year enso forecasts fnlDeep learning for multi year enso forecasts fnl
Deep learning for multi year enso forecasts fnl
 
2012 CRL Recruiting Memo
2012 CRL Recruiting Memo2012 CRL Recruiting Memo
2012 CRL Recruiting Memo
 
2nd Test - Scatterplots
2nd Test - Scatterplots2nd Test - Scatterplots
2nd Test - Scatterplots
 
Data science lab project
Data science lab projectData science lab project
Data science lab project
 
VECTOR CALCULUS
VECTOR CALCULUSVECTOR CALCULUS
VECTOR CALCULUS
 
Math in the News: 8/29/11
Math in the News: 8/29/11Math in the News: 8/29/11
Math in the News: 8/29/11
 
How to train your mind to think like the ai machine you are training
How to train your mind to think like the ai machine you are trainingHow to train your mind to think like the ai machine you are training
How to train your mind to think like the ai machine you are training
 
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHOREQCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
QCL-14-v3_PARETO DIAGRAM_BANASTHALI UNIVERSITY_TANYA RATHORE
 
2.6b scatter plots and lines of best fit
2.6b scatter plots and lines of best fit2.6b scatter plots and lines of best fit
2.6b scatter plots and lines of best fit
 
Day 6 examples
Day 6 examplesDay 6 examples
Day 6 examples
 
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Air Pollution in Sofia - Solution through Data Science by Kiwi teamAir Pollution in Sofia - Solution through Data Science by Kiwi team
Air Pollution in Sofia - Solution through Data Science by Kiwi team
 
MPI 794 (week-1 & 2)
MPI 794 (week-1 & 2)MPI 794 (week-1 & 2)
MPI 794 (week-1 & 2)
 
Global warming graphs
Global warming graphsGlobal warming graphs
Global warming graphs
 
Mathematical modelling and its application in weather forecasting
Mathematical modelling and its application in weather forecastingMathematical modelling and its application in weather forecasting
Mathematical modelling and its application in weather forecasting
 
Application of differential and integral
Application of differential and integralApplication of differential and integral
Application of differential and integral
 

Similar to Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018

Automatic algorithms for time series forecasting
Automatic algorithms for time series forecastingAutomatic algorithms for time series forecasting
Automatic algorithms for time series forecasting
Rob Hyndman
 
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
The Statistical and Applied Mathematical Sciences Institute
 
Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?
Lionel Briand
 
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdfAnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
Farhad Sagor
 
A data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototypingA data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototyping
Akin Osman Kazakci
 
ThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersollThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersoll
Kyle Ingersoll
 
Srikanta Mishra
Srikanta MishraSrikanta Mishra
Prim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Prim and Genetic Algorithms Performance in Determining Optimum Route on GraphPrim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Prim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Universitas Pembangunan Panca Budi
 
COSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_AliCOSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_Ali
MDO_Lab
 
Statsci
StatsciStatsci
Brussels airport forecast
Brussels airport  forecast Brussels airport  forecast
Brussels airport forecast
Mohammed Awad
 
Dj4201737746
Dj4201737746Dj4201737746
Dj4201737746
IJERA Editor
 
March 2, 2018 - Machine Learning for Production Forecasting
March 2, 2018 - Machine Learning for Production ForecastingMarch 2, 2018 - Machine Learning for Production Forecasting
March 2, 2018 - Machine Learning for Production Forecasting
David Fulford
 
Estimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble TechniqueEstimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble Technique
ijtsrd
 
Kernel based swarm optimization for renewable energy application
Kernel based swarm optimization  for renewable energy applicationKernel based swarm optimization  for renewable energy application
Kernel based swarm optimization for renewable energy application
Aboul Ella Hassanien
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
Agnirudra Sikdar
 
Modelsward 2018 Industrial Track - Alessandra Bagnato
Modelsward 2018 Industrial Track - Alessandra BagnatoModelsward 2018 Industrial Track - Alessandra Bagnato
Modelsward 2018 Industrial Track - Alessandra Bagnato
Alessandra Bagnato
 
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Shakas Technologies
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
Venkata Reddy Konasani
 
Research Proposal
Research ProposalResearch Proposal
Research Proposal
Komlan Atitey
 

Similar to Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018 (20)

Automatic algorithms for time series forecasting
Automatic algorithms for time series forecastingAutomatic algorithms for time series forecasting
Automatic algorithms for time series forecasting
 
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
 
Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?
 
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdfAnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
AnnualAutomobileSalesPredictionusingARIMAModel (2).pdf
 
A data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototypingA data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototyping
 
ThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersollThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersoll
 
Srikanta Mishra
Srikanta MishraSrikanta Mishra
Srikanta Mishra
 
Prim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Prim and Genetic Algorithms Performance in Determining Optimum Route on GraphPrim and Genetic Algorithms Performance in Determining Optimum Route on Graph
Prim and Genetic Algorithms Performance in Determining Optimum Route on Graph
 
COSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_AliCOSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_Ali
 
Statsci
StatsciStatsci
Statsci
 
Brussels airport forecast
Brussels airport  forecast Brussels airport  forecast
Brussels airport forecast
 
Dj4201737746
Dj4201737746Dj4201737746
Dj4201737746
 
March 2, 2018 - Machine Learning for Production Forecasting
March 2, 2018 - Machine Learning for Production ForecastingMarch 2, 2018 - Machine Learning for Production Forecasting
March 2, 2018 - Machine Learning for Production Forecasting
 
Estimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble TechniqueEstimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble Technique
 
Kernel based swarm optimization for renewable energy application
Kernel based swarm optimization  for renewable energy applicationKernel based swarm optimization  for renewable energy application
Kernel based swarm optimization for renewable energy application
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
 
Modelsward 2018 Industrial Track - Alessandra Bagnato
Modelsward 2018 Industrial Track - Alessandra BagnatoModelsward 2018 Industrial Track - Alessandra Bagnato
Modelsward 2018 Industrial Track - Alessandra Bagnato
 
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
 
Research Proposal
Research ProposalResearch Proposal
Research Proposal
 

More from The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
The Statistical and Applied Mathematical Sciences Institute
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
The Statistical and Applied Mathematical Sciences Institute
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
The Statistical and Applied Mathematical Sciences Institute
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
The Statistical and Applied Mathematical Sciences Institute
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
The Statistical and Applied Mathematical Sciences Institute
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
The Statistical and Applied Mathematical Sciences Institute
 

More from The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

Recently uploaded

HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
blueshagoo1
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
nitinpv4ai
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
How to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in useHow to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in use
Celine George
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
nitinpv4ai
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
danielkiash986
 
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
Nguyen Thanh Tu Collection
 
How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17
Celine George
 
MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025
khuleseema60
 
Data Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsxData Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsx
Prof. Dr. K. Adisesha
 
Electric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger HuntElectric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger Hunt
RamseyBerglund
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
National Information Standards Organization (NISO)
 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
Celine George
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
TechSoup
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
giancarloi8888
 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
TechSoup
 

Recently uploaded (20)

HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
How to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in useHow to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in use
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
 
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
 
How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17
 
MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025
 
Data Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsxData Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsx
 
Electric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger HuntElectric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger Hunt
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
 

Undergraduate Modeling Workshop - Air Quality Working Group Final Presentation, May 25, 2018

  • 1. AIR QUALITY Alan, Chandni, Vincent, Ceci, Sharon, Mao May 2018, SAMSI Workshop
  • 2. What is PM2.5? May 2018, SAMSI Workshop Bypass nose/throat penetrate deep into lungs, circulatory system. ➢ Particulate Matter ➢ Diameter < 2.5 micrometers ➢ 3% the diameter of human hair
  • 3. May 2018, SAMSI Workshop PM2.5 Monitoring Systems in the US ➢Monitoring stations are sparse ➢Need predictions for locations without a monitoring station
  • 4. What is CMAQ? May 2018, SAMSI Workshop CMAQ - Community Multi-scale Air Quality is a numerical air quality model To predict the concentration of air pollutants
  • 5. CMAQ Inaccuracies ● High topographical regions contained greatest degrees of error ● Areas with more monitoring stations had best predictions
  • 6. The Big Question/Goal What is the best statistical model that predicts PM 2.5 concentration level for the entire U.S. using numerical model outputs and other available covariates? May 2018, SAMSI Workshop
  • 11. Variable Selection & Transformation 31 Plots: Covariate v.s. PM 2.5 (Response Variable) ? Each covariate related to PM 2.5 Residual Plot: Residuals of PM2.5 v.s. each covariate Adjusted R-squared: not enough
  • 12. Variable Selection & Transformation Counting Covariates 31 -> 28 (same measurement) 28 -> 25 (cor plot & R-square) Correlation Plot Threshold for Decision: 0.8 Correlated pair …... …... May 2018, SAMSI Workshop
  • 13. Variable Selection & Transformation 31 Plots: Covariate v.s. PM 2.5 (Response Variable) Residual Plot: Residuals of PM2.5 v.s. each covariate Adjusted R-squared Used to decide which covariate to exclude when two are highly correlated.
  • 14. Variable Selection & Transformation Residual Plot ➢ Do regression PM2.5 ~ CMAQ ➢ Plot the residuals against the other covariates Finally, 15 covariates are selected Boundary layer height residuals
  • 15. May 2018, SAMSI Workshop Random Forest No. of trees: 500 No. of variables tried at each split: 5 Mean of squared residuals (log scale): 0.1075135 % Variance explained: 72.25 Some fun math behind the models…
  • 16. May 2018, SAMSI Workshop Spatial Model Covariance Matrix Conditional Normality Some fun math behind the models…
  • 17. The Kriging Concept “The basic idea of kriging is to predict the value of a function at a given point by computing a weighted average of the known values of the function in the neighborhood of the point.” ———Wikipedia May 2018, SAMSI Workshop
  • 18. January 1st Measurements May 2018, SAMSI Workshop
  • 19. May 2018, SAMSI Workshop Prediction Maps for Jan 1st , 2011
  • 20. August 1st Measurements May 2018, SAMSI Workshop
  • 21. May 2018, SAMSI Workshop Prediction Maps for Aug 1st , 2011
  • 22. 5 Fold Cross-Validation ➢ Divide the whole dataset into 5 folds ➢ Train the model using 4 of them and leave out the fifth one ➢ Make predictions on the fifth fold and obtain the MSE and MAD
  • 23. Model MSE MAD CMAQ 51.734 4.681 Simple LR 23.220 3.103 Random forest 13.254 2.177 Spatial analysis 9.734 1.718 May 2018, SAMSI Workshop Model Comparison based on cross-validation
  • 24. May 2018, SAMSI Workshop Prediction Maps for Jan 1st , 2011 MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
  • 25. May 2018, SAMSI Workshop Prediction Maps for Aug 1st , 2011 MSE of CMAQ = 51.734, MSE of LR = 23.220, MSE of RF = 13.254, MSE of Spatial Analysis = 9.734
  • 26. Summary ➢ Spatial analysis makes the BEST predictions ➢ Potential Improvements: ○ Look at the interactions between covariates ○ Other machine learning methods like neural network ○ Seasonal analysis ○ Mid-west?
  • 27. May 2018, SAMSI Workshop Special thanks to Yawen, Amanda, Suman, and Doug