SlideShare a Scribd company logo
Dimensionality Reduction Using Sliced Inverse
Regression In Modeling Large Climate Data
UMBC REU Site: Interdisciplinary Program in High Performance Computing
Ross Flieger-Allison1
, Lois Miller2
, Danielle Sykes3
, Pablo Valle4
RAs: Sai K. Popuri3
, Nadeesri Wijekoon3
Faculty: Nagaraj K. Neerchal3
Client: Amita Mehta5
1
Williams College 2
DePauw University 3
UMBC 4
Kean University 5
JCET
• The Missouri River Basin (MRB) is a
significant agricultural region that is
not irrigated and thus dependent on
rainfall
• Precipitation predictions are histor-
ically inaccurate due to the semi-
continuous nature of observed precip-
itation data
Figure 1: The Missouri Rivier Basin
Background
• Precipitation data is semi-continuous,
meaning many observations are equal
to 0 and all other observations are
positive and follow a continuous dis-
tribution
• Predictions are calculated using ob-
served data and data from MIROC5,
a Global Climate Model (GCM),
across 57 years (1949-2005) from over
270,000 locations
• Precipitation at any given location s
depends on a large number of co-
variates: current and past values of
monthly precipitation, sea-level pres-
sure, relative humidity, and max-
imum/minimum temperatures at s
and its neighboring locations
Data
• Sliced Inverse Regression (SIR) is a
data-analytic tool that can be used to
reduce the dimensionality of a large
set of covariates
• Nadaraya-Watson Estimator (NWE)
is a simple non-paramentric smooth-
ing regression method:
ˆmx =
n
i=1 Kh(x−xi)yi
n
i=1 Kh(x−xi)
• The NWE is adapted to account for
the semi-continous nature of predic-
tions
Methodology
• The following figures show a comparison
between observed precipitation data, and
their predictions
Figure 2: Observed and predicted monthly precipitation, including
the proportion of 0 values for both
Figure 3: Mean Squared Error from positive predictions for months
with positive rainfall (true positives)
Results
Table 1: Performance Study Results
Processes Subregion∗
MRB
1 01:14:24 N/A
4 00:19:07 09:54:22
8 00:11:04 05:20:31
16 00:05:58 02:49:49
32 00:03:13 04:25:51
64 N/A 03:41:26
∗ denotes a region from latitutude -101 to -97 and longitude 39.25 to 42.95
Figure 4: Observed speedup (Sp = (T1/Tp)) of parallelization
Parallelization
• We have successfully demonstrated
that SIR and NWE methods can
be implemented to work on a large
dataset
• Parallelization of SIR and NWE code
greatly increases computational effi-
ciency on the subregion, and also im-
proves efficiency for the entire MRB
region, up to 16 processes
Conclusions
[1] National Integrated Drought Information
Systems. https://www.drought.gov/
drought/dews/missouri-river-basin/
about.
[2] S. K. Popuri, K. P. Adragni, et al., “Spatio-
Temporal Analysis of Daily Precipitation via a
Sufficient Dimension Reduction.” (In Progress).
[3] Full Technical Report: HPCF-2016-13 hpcf.
umbc.edu > Publications
REU Site: hpcreu.umbc.edu
NSF, NSA, DOD, UMBC, HPCF, CIRC
References & Acknowledgments

More Related Content

What's hot

vectorrasterQGIS
vectorrasterQGISvectorrasterQGIS
vectorrasterQGIS
Stefani Lukashov, GIT
 
Chris Spry
Chris SpryChris Spry
Chris Spry
Wuzzy13
 
Chris Spry
Chris SpryChris Spry
Chris Spry
Wuzzy13
 
Chris Spry
Chris SpryChris Spry
Chris Spry
Wuzzy13
 
FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...
FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...
FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...
grssieee
 
3D Analyst - Lake Lorelindu by GRASS
3D Analyst - Lake Lorelindu by GRASS3D Analyst - Lake Lorelindu by GRASS
3D Analyst - Lake Lorelindu by GRASS
Hartanto Sanjaya
 
CSpryFinal5
CSpryFinal5CSpryFinal5
CSpryFinal5
Wuzzy13
 
3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu
Hartanto Sanjaya
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
The Statistical and Applied Mathematical Sciences Institute
 
IGARSS2011(OkiKazuo110726).ppt
IGARSS2011(OkiKazuo110726).pptIGARSS2011(OkiKazuo110726).ppt
IGARSS2011(OkiKazuo110726).ppt
grssieee
 
3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur
Hartanto Sanjaya
 
Change detection of forest fire in los angeles
Change detection of forest fire in los angelesChange detection of forest fire in los angeles
Change detection of forest fire in los angeles
Parthipan S
 
ÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGY
ÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGYÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGY
ÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGY
Ali Osman Öncel
 
Rainfall Erosivity in China
Rainfall Erosivity in ChinaRainfall Erosivity in China
Rainfall Erosivity in China
ExternalEvents
 
Nichols, Meredith_Validation of Precipitation and Lightning Observations
Nichols, Meredith_Validation of Precipitation and Lightning ObservationsNichols, Meredith_Validation of Precipitation and Lightning Observations
Nichols, Meredith_Validation of Precipitation and Lightning Observations
Meredith Nichols
 
Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...
Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...
Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...
IES / IAQM
 
Fire Poster
Fire PosterFire Poster
Fire Poster
Anders Kipp
 
Gutierrez_E_Lab2
Gutierrez_E_Lab2Gutierrez_E_Lab2
Gutierrez_E_Lab2
Enrique Gutierrez
 
Flood Risk Zonation for Karaj, Ezatollah GHANAVATI
Flood Risk Zonation for Karaj, Ezatollah GHANAVATIFlood Risk Zonation for Karaj, Ezatollah GHANAVATI
Flood Risk Zonation for Karaj, Ezatollah GHANAVATI
Global Risk Forum GRFDavos
 
TREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNING
TREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNINGTREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNING
TREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNING
National Cheng Kung University
 

What's hot (20)

vectorrasterQGIS
vectorrasterQGISvectorrasterQGIS
vectorrasterQGIS
 
Chris Spry
Chris SpryChris Spry
Chris Spry
 
Chris Spry
Chris SpryChris Spry
Chris Spry
 
Chris Spry
Chris SpryChris Spry
Chris Spry
 
FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...
FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...
FR4.L10.2: A MICROWAVE SCATTERING MODEL OF VEGETATED SURFACES BASED ON BOR/DD...
 
3D Analyst - Lake Lorelindu by GRASS
3D Analyst - Lake Lorelindu by GRASS3D Analyst - Lake Lorelindu by GRASS
3D Analyst - Lake Lorelindu by GRASS
 
CSpryFinal5
CSpryFinal5CSpryFinal5
CSpryFinal5
 
3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
 
IGARSS2011(OkiKazuo110726).ppt
IGARSS2011(OkiKazuo110726).pptIGARSS2011(OkiKazuo110726).ppt
IGARSS2011(OkiKazuo110726).ppt
 
3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur
 
Change detection of forest fire in los angeles
Change detection of forest fire in los angelesChange detection of forest fire in los angeles
Change detection of forest fire in los angeles
 
ÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGY
ÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGYÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGY
ÖNCEL AKADEMİ: INTRODUCTION TO SEISMOLOGY
 
Rainfall Erosivity in China
Rainfall Erosivity in ChinaRainfall Erosivity in China
Rainfall Erosivity in China
 
Nichols, Meredith_Validation of Precipitation and Lightning Observations
Nichols, Meredith_Validation of Precipitation and Lightning ObservationsNichols, Meredith_Validation of Precipitation and Lightning Observations
Nichols, Meredith_Validation of Precipitation and Lightning Observations
 
Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...
Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...
Use of the German Weather Services KLAM Model to Investigate the Cold Air Dra...
 
Fire Poster
Fire PosterFire Poster
Fire Poster
 
Gutierrez_E_Lab2
Gutierrez_E_Lab2Gutierrez_E_Lab2
Gutierrez_E_Lab2
 
Flood Risk Zonation for Karaj, Ezatollah GHANAVATI
Flood Risk Zonation for Karaj, Ezatollah GHANAVATIFlood Risk Zonation for Karaj, Ezatollah GHANAVATI
Flood Risk Zonation for Karaj, Ezatollah GHANAVATI
 
TREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNING
TREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNINGTREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNING
TREE BIOMASS ESTIMATION USING AIRBORNE LASER SCANNING
 

Similar to SURF_Poster_REU2016Team3

Comparison of Spatial Interpolation Methods for Precipitation in Ningxia, China
Comparison of Spatial Interpolation Methods for Precipitation in Ningxia, ChinaComparison of Spatial Interpolation Methods for Precipitation in Ningxia, China
Comparison of Spatial Interpolation Methods for Precipitation in Ningxia, China
International Journal of Science and Research (IJSR)
 
Coweeta ppt cd_ms
Coweeta ppt cd_msCoweeta ppt cd_ms
Coweeta ppt cd_ms
questRCN
 
Probalistic assessment of agriculture
Probalistic assessment of agricultureProbalistic assessment of agriculture
Probalistic assessment of agriculture
Soil and Water Conservation Society
 
Soil erosion
Soil erosionSoil erosion
Soil erosion
Hanamantray sk
 
MCP_ES_2012_Jie
MCP_ES_2012_JieMCP_ES_2012_Jie
MCP_ES_2012_Jie
MDO_Lab
 
Application of Bayesian Regularized Neural Networks for Groundwater Level Mo...
Application of Bayesian Regularized Neural  Networks for Groundwater Level Mo...Application of Bayesian Regularized Neural  Networks for Groundwater Level Mo...
Application of Bayesian Regularized Neural Networks for Groundwater Level Mo...
Dr. Amir Mosavi, PhD., P.Eng.
 
Determining reservoir outflow using machine learning techniques.pptx
Determining reservoir outflow using machine learning techniques.pptxDetermining reservoir outflow using machine learning techniques.pptx
Determining reservoir outflow using machine learning techniques.pptx
Prakhar Verma
 
Air quality challenges and business opportunities in China: Fusion of environ...
Air quality challenges and business opportunities in China: Fusion of environ...Air quality challenges and business opportunities in China: Fusion of environ...
Air quality challenges and business opportunities in China: Fusion of environ...
CLIC Innovation Ltd
 
Aquifer recharge from flash floods in the arid environment: A mass balance ap...
Aquifer recharge from flash floods in the arid environment: A mass balance ap...Aquifer recharge from flash floods in the arid environment: A mass balance ap...
Aquifer recharge from flash floods in the arid environment: A mass balance ap...
Amro Elfeki
 
Mauro Sulis
Mauro SulisMauro Sulis
urpl969-group2-paper-03May06
urpl969-group2-paper-03May06urpl969-group2-paper-03May06
urpl969-group2-paper-03May06
Wintford Thornton
 
Determination of homogenous regions in the Tensift basin (Morocco).
Determination of homogenous regions in the Tensift basin (Morocco).Determination of homogenous regions in the Tensift basin (Morocco).
Determination of homogenous regions in the Tensift basin (Morocco).
IJERA Editor
 
Monsoon Rainfall Forecast+.ppt
Monsoon Rainfall Forecast+.pptMonsoon Rainfall Forecast+.ppt
Monsoon Rainfall Forecast+.ppt
AryanNetkar1
 
APPLICATION OF KRIGING IN GROUND WATER STUDIES
APPLICATION OF KRIGING IN GROUND WATER STUDIESAPPLICATION OF KRIGING IN GROUND WATER STUDIES
APPLICATION OF KRIGING IN GROUND WATER STUDIES
Abhiram Kanigolla
 
Remote sensing in agriculture
Remote sensing in agricultureRemote sensing in agriculture
Remote sensing in agriculture
Chitra Nair
 
Remote sensing in agriculture
Remote sensing in agricultureRemote sensing in agriculture
Remote sensing in agriculture
Chitra Nair
 
CSU_Poster
CSU_PosterCSU_Poster
CSU_Poster
Jason A. Sulskis
 
September 1 - 1116 - Tassia Brighenti and Phillip Gassman
September 1 - 1116 - Tassia Brighenti and Phillip GassmanSeptember 1 - 1116 - Tassia Brighenti and Phillip Gassman
September 1 - 1116 - Tassia Brighenti and Phillip Gassman
Soil and Water Conservation Society
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
Jo Elver-Evans
 
Master's course defense presentation in Water Resource Management and GIS
Master's course defense presentation in Water Resource Management and GIS  Master's course defense presentation in Water Resource Management and GIS
Master's course defense presentation in Water Resource Management and GIS
Tooryalay Ayoubi
 

Similar to SURF_Poster_REU2016Team3 (20)

Comparison of Spatial Interpolation Methods for Precipitation in Ningxia, China
Comparison of Spatial Interpolation Methods for Precipitation in Ningxia, ChinaComparison of Spatial Interpolation Methods for Precipitation in Ningxia, China
Comparison of Spatial Interpolation Methods for Precipitation in Ningxia, China
 
Coweeta ppt cd_ms
Coweeta ppt cd_msCoweeta ppt cd_ms
Coweeta ppt cd_ms
 
Probalistic assessment of agriculture
Probalistic assessment of agricultureProbalistic assessment of agriculture
Probalistic assessment of agriculture
 
Soil erosion
Soil erosionSoil erosion
Soil erosion
 
MCP_ES_2012_Jie
MCP_ES_2012_JieMCP_ES_2012_Jie
MCP_ES_2012_Jie
 
Application of Bayesian Regularized Neural Networks for Groundwater Level Mo...
Application of Bayesian Regularized Neural  Networks for Groundwater Level Mo...Application of Bayesian Regularized Neural  Networks for Groundwater Level Mo...
Application of Bayesian Regularized Neural Networks for Groundwater Level Mo...
 
Determining reservoir outflow using machine learning techniques.pptx
Determining reservoir outflow using machine learning techniques.pptxDetermining reservoir outflow using machine learning techniques.pptx
Determining reservoir outflow using machine learning techniques.pptx
 
Air quality challenges and business opportunities in China: Fusion of environ...
Air quality challenges and business opportunities in China: Fusion of environ...Air quality challenges and business opportunities in China: Fusion of environ...
Air quality challenges and business opportunities in China: Fusion of environ...
 
Aquifer recharge from flash floods in the arid environment: A mass balance ap...
Aquifer recharge from flash floods in the arid environment: A mass balance ap...Aquifer recharge from flash floods in the arid environment: A mass balance ap...
Aquifer recharge from flash floods in the arid environment: A mass balance ap...
 
Mauro Sulis
Mauro SulisMauro Sulis
Mauro Sulis
 
urpl969-group2-paper-03May06
urpl969-group2-paper-03May06urpl969-group2-paper-03May06
urpl969-group2-paper-03May06
 
Determination of homogenous regions in the Tensift basin (Morocco).
Determination of homogenous regions in the Tensift basin (Morocco).Determination of homogenous regions in the Tensift basin (Morocco).
Determination of homogenous regions in the Tensift basin (Morocco).
 
Monsoon Rainfall Forecast+.ppt
Monsoon Rainfall Forecast+.pptMonsoon Rainfall Forecast+.ppt
Monsoon Rainfall Forecast+.ppt
 
APPLICATION OF KRIGING IN GROUND WATER STUDIES
APPLICATION OF KRIGING IN GROUND WATER STUDIESAPPLICATION OF KRIGING IN GROUND WATER STUDIES
APPLICATION OF KRIGING IN GROUND WATER STUDIES
 
Remote sensing in agriculture
Remote sensing in agricultureRemote sensing in agriculture
Remote sensing in agriculture
 
Remote sensing in agriculture
Remote sensing in agricultureRemote sensing in agriculture
Remote sensing in agriculture
 
CSU_Poster
CSU_PosterCSU_Poster
CSU_Poster
 
September 1 - 1116 - Tassia Brighenti and Phillip Gassman
September 1 - 1116 - Tassia Brighenti and Phillip GassmanSeptember 1 - 1116 - Tassia Brighenti and Phillip Gassman
September 1 - 1116 - Tassia Brighenti and Phillip Gassman
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
 
Master's course defense presentation in Water Resource Management and GIS
Master's course defense presentation in Water Resource Management and GIS  Master's course defense presentation in Water Resource Management and GIS
Master's course defense presentation in Water Resource Management and GIS
 

SURF_Poster_REU2016Team3

  • 1. Dimensionality Reduction Using Sliced Inverse Regression In Modeling Large Climate Data UMBC REU Site: Interdisciplinary Program in High Performance Computing Ross Flieger-Allison1 , Lois Miller2 , Danielle Sykes3 , Pablo Valle4 RAs: Sai K. Popuri3 , Nadeesri Wijekoon3 Faculty: Nagaraj K. Neerchal3 Client: Amita Mehta5 1 Williams College 2 DePauw University 3 UMBC 4 Kean University 5 JCET • The Missouri River Basin (MRB) is a significant agricultural region that is not irrigated and thus dependent on rainfall • Precipitation predictions are histor- ically inaccurate due to the semi- continuous nature of observed precip- itation data Figure 1: The Missouri Rivier Basin Background • Precipitation data is semi-continuous, meaning many observations are equal to 0 and all other observations are positive and follow a continuous dis- tribution • Predictions are calculated using ob- served data and data from MIROC5, a Global Climate Model (GCM), across 57 years (1949-2005) from over 270,000 locations • Precipitation at any given location s depends on a large number of co- variates: current and past values of monthly precipitation, sea-level pres- sure, relative humidity, and max- imum/minimum temperatures at s and its neighboring locations Data • Sliced Inverse Regression (SIR) is a data-analytic tool that can be used to reduce the dimensionality of a large set of covariates • Nadaraya-Watson Estimator (NWE) is a simple non-paramentric smooth- ing regression method: ˆmx = n i=1 Kh(x−xi)yi n i=1 Kh(x−xi) • The NWE is adapted to account for the semi-continous nature of predic- tions Methodology • The following figures show a comparison between observed precipitation data, and their predictions Figure 2: Observed and predicted monthly precipitation, including the proportion of 0 values for both Figure 3: Mean Squared Error from positive predictions for months with positive rainfall (true positives) Results Table 1: Performance Study Results Processes Subregion∗ MRB 1 01:14:24 N/A 4 00:19:07 09:54:22 8 00:11:04 05:20:31 16 00:05:58 02:49:49 32 00:03:13 04:25:51 64 N/A 03:41:26 ∗ denotes a region from latitutude -101 to -97 and longitude 39.25 to 42.95 Figure 4: Observed speedup (Sp = (T1/Tp)) of parallelization Parallelization • We have successfully demonstrated that SIR and NWE methods can be implemented to work on a large dataset • Parallelization of SIR and NWE code greatly increases computational effi- ciency on the subregion, and also im- proves efficiency for the entire MRB region, up to 16 processes Conclusions [1] National Integrated Drought Information Systems. https://www.drought.gov/ drought/dews/missouri-river-basin/ about. [2] S. K. Popuri, K. P. Adragni, et al., “Spatio- Temporal Analysis of Daily Precipitation via a Sufficient Dimension Reduction.” (In Progress). [3] Full Technical Report: HPCF-2016-13 hpcf. umbc.edu > Publications REU Site: hpcreu.umbc.edu NSF, NSA, DOD, UMBC, HPCF, CIRC References & Acknowledgments