SlideShare a Scribd company logo
1 of 28
Automating regional descriptive statistic computations for environmental modeling Satoshi Hirabayashi Environmental Resources Engineering SUNY College of Environmental Science and Forestry, Syracuse, NY USA
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],Outline
Low Streamflow Regional Regression (Kroll et al., 2004) Background % Standard Error 700 600 500 400 300 200 100 0 USGS USGS  and Digital USGS, Digital,  and Hydrogeology Entire US 29 regions 930 HCDN sites Focus !
Low Streamflow Regional Regression Background Q 7,10  : 7-day 10-year streamflow statistic  I  : Model parameter to be estimated X i  : Watershed characteristics ,[object Object],[object Object],[object Object],Model Construction Process
Zonal Statistics Tool Background
Zonal Statistics Tool Background
Zonal Statistics Tool Background
Problems with Zonal Statistics Tool Introduction to Problems SAS
Problems with Multiple Raster Datasets Introduction  to Problems Research Hirabayashi, 2005 Kroll, 2007 Hirabayashi and Kroll, 2007 Hirabayashi and Kroll, 2008 1,466 28 1,466 54 # of Raster Dataset # of Watersheds (# of Layers) 35 (3) 144 (4) 35 (1) 106 (3) # of Tables 4,398 112 1,466 162
Problems in Manual Operations ,[object Object],[object Object],[object Object],Introduction  to Problems Develop a custom ArcGIS toolset.
Automated Explanatory Variables Extraction Methods Batch Output Table Creation Output Table Watershed Boundaries Parameter File Parameter File Log File Batch Descriptive Statistic Calculation dBASE table Developed tool ESRI GRID/TIFF  /IMAGINE raster geodatabase/ shapefile ASCII text file Weather, Soil, Elevation, etc
User Interface (ArcWC) Methods
Watershed Characteristic Extraction Case Study
Watershed Characteristic Extraction Case Study Hydro1K DEM Raster Dataset # Data 1
Watershed Characteristic Extraction Case Study Hydro1K DEM Slope Raster Dataset # Data 1 1
Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM Raster Dataset # Data 1 1 13
Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO Raster Dataset # Data 1 1 13 12
Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO NLCD Raster Dataset # Data 1 1 13 12 1
Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO NLCD Raster Dataset # Data 1 1 13 12 1 Watershed characteristics database
[object Object],[object Object],[object Object],Conclusions Conclusions
Questions?
Gauging Site Relocation Methods
Gauging Site Relocation Methods
Unnested Watershed Identification Methods
Unnested Watershed Identification Methods
Unnested Watershed Identification Methods
Unnested Watershed Identification Methods
Batch  STATSGO Processing Methods

More Related Content

What's hot

Cross-domain data discovery and integration
Cross-domain data discovery and integration Cross-domain data discovery and integration
Cross-domain data discovery and integration Simon Cox
 
The Implementation of the International Geo Sample Number in CSIRO: Experienc...
The Implementation of the International Geo Sample Number in CSIRO: Experienc...The Implementation of the International Geo Sample Number in CSIRO: Experienc...
The Implementation of the International Geo Sample Number in CSIRO: Experienc...Anusuriya Devaraju
 
GRASP 2-page Overview - June 2016
GRASP 2-page Overview - June 2016GRASP 2-page Overview - June 2016
GRASP 2-page Overview - June 2016David Jarvis
 
OpenTopography - Scalable Services for Geosciences Data
OpenTopography - Scalable Services for Geosciences DataOpenTopography - Scalable Services for Geosciences Data
OpenTopography - Scalable Services for Geosciences DataOpenTopography Facility
 
Quality of ground data for assessment and benchmarking
Quality of ground data for assessment and benchmarkingQuality of ground data for assessment and benchmarking
Quality of ground data for assessment and benchmarkingIrSOLaV Pomares
 
Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...
Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...
Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...ijtsrd
 
long-range_scanning_lidars_for_different_and_cost_effective_campaigns
long-range_scanning_lidars_for_different_and_cost_effective_campaignslong-range_scanning_lidars_for_different_and_cost_effective_campaigns
long-range_scanning_lidars_for_different_and_cost_effective_campaignsAlexander Cassola
 
Overview of the IGSN discovery portal
Overview of the IGSN discovery portalOverview of the IGSN discovery portal
Overview of the IGSN discovery portalARDC
 
K venkata reddy
K venkata reddyK venkata reddy
K venkata reddyClimDev15
 
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...Kerstin Lehnert
 
Nuclear emergency response and Big Data technologies
Nuclear emergency response and Big Data technologiesNuclear emergency response and Big Data technologies
Nuclear emergency response and Big Data technologiesBigData_Europe
 
Session 3 - Presentation by Sachiko Hayashida
Session 3 - Presentation by Sachiko HayashidaSession 3 - Presentation by Sachiko Hayashida
Session 3 - Presentation by Sachiko HayashidaOECD Environment
 
Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...
Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...
Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...Bijal (Bee) Hayes-Thakore
 

What's hot (20)

Current HDF Tools (1997)
Current HDF Tools (1997)Current HDF Tools (1997)
Current HDF Tools (1997)
 
Cross-domain data discovery and integration
Cross-domain data discovery and integration Cross-domain data discovery and integration
Cross-domain data discovery and integration
 
The Implementation of the International Geo Sample Number in CSIRO: Experienc...
The Implementation of the International Geo Sample Number in CSIRO: Experienc...The Implementation of the International Geo Sample Number in CSIRO: Experienc...
The Implementation of the International Geo Sample Number in CSIRO: Experienc...
 
view_hdf
view_hdfview_hdf
view_hdf
 
GRASP 2-page Overview - June 2016
GRASP 2-page Overview - June 2016GRASP 2-page Overview - June 2016
GRASP 2-page Overview - June 2016
 
OpenTopography - Scalable Services for Geosciences Data
OpenTopography - Scalable Services for Geosciences DataOpenTopography - Scalable Services for Geosciences Data
OpenTopography - Scalable Services for Geosciences Data
 
Quality of ground data for assessment and benchmarking
Quality of ground data for assessment and benchmarkingQuality of ground data for assessment and benchmarking
Quality of ground data for assessment and benchmarking
 
Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...
Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...
Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...
 
long-range_scanning_lidars_for_different_and_cost_effective_campaigns
long-range_scanning_lidars_for_different_and_cost_effective_campaignslong-range_scanning_lidars_for_different_and_cost_effective_campaigns
long-range_scanning_lidars_for_different_and_cost_effective_campaigns
 
Overview of the IGSN discovery portal
Overview of the IGSN discovery portalOverview of the IGSN discovery portal
Overview of the IGSN discovery portal
 
P10 hansen cw_data_requirements_for_calibration
P10 hansen cw_data_requirements_for_calibrationP10 hansen cw_data_requirements_for_calibration
P10 hansen cw_data_requirements_for_calibration
 
K venkata reddy
K venkata reddyK venkata reddy
K venkata reddy
 
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
 
HDF Town Hall
HDF Town HallHDF Town Hall
HDF Town Hall
 
China's cng map
China's cng mapChina's cng map
China's cng map
 
Nuclear emergency response and Big Data technologies
Nuclear emergency response and Big Data technologiesNuclear emergency response and Big Data technologies
Nuclear emergency response and Big Data technologies
 
Session 3 - Presentation by Sachiko Hayashida
Session 3 - Presentation by Sachiko HayashidaSession 3 - Presentation by Sachiko Hayashida
Session 3 - Presentation by Sachiko Hayashida
 
Pitch deck
Pitch deckPitch deck
Pitch deck
 
Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...
Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...
Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...
 
Measurement-based upscaling of pan-Arctic net ecosystem exchange
Measurement-based upscaling of pan-Arctic net ecosystem exchangeMeasurement-based upscaling of pan-Arctic net ecosystem exchange
Measurement-based upscaling of pan-Arctic net ecosystem exchange
 

Similar to Automating regional statistics for environmental modeling

M.S. Capstone Seminar
M.S. Capstone SeminarM.S. Capstone Seminar
M.S. Capstone Seminarshirabay
 
igarss11swot-vadon-callahan-psc-s3.110725.pptx
igarss11swot-vadon-callahan-psc-s3.110725.pptxigarss11swot-vadon-callahan-psc-s3.110725.pptx
igarss11swot-vadon-callahan-psc-s3.110725.pptxgrssieee
 
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSIONFR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSIONgrssieee
 
Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014aceas13tern
 
Watershed development and drainage assessments
Watershed development and drainage assessmentsWatershed development and drainage assessments
Watershed development and drainage assessmentsAndrew Harrison
 
2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...
2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...
2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...GIS in the Rockies
 
#EarthOnAWS | AWS Public Sector Summit 2017
#EarthOnAWS | AWS Public Sector Summit 2017#EarthOnAWS | AWS Public Sector Summit 2017
#EarthOnAWS | AWS Public Sector Summit 2017Amazon Web Services
 
Geographic information system
Geographic information systemGeographic information system
Geographic information systemSumanta Das
 
Spatial Data, KML, and the University Web
Spatial Data, KML, and the University WebSpatial Data, KML, and the University Web
Spatial Data, KML, and the University WebGlennon Alan
 
Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2Shweta Bhatia Gupta
 
Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2guesta56b77
 
Continental Divide Trail GPS Mapping Project
Continental Divide Trail GPS Mapping ProjectContinental Divide Trail GPS Mapping Project
Continental Divide Trail GPS Mapping Projectkshakarjian
 
Friedrich - LiDAR CADD Engr. Design
Friedrich - LiDAR CADD Engr. DesignFriedrich - LiDAR CADD Engr. Design
Friedrich - LiDAR CADD Engr. DesignJose A. Hernandez
 
JHydro - an implementation of the digital watershed
JHydro - an implementation of the digital watershedJHydro - an implementation of the digital watershed
JHydro - an implementation of the digital watershedsilli
 

Similar to Automating regional statistics for environmental modeling (20)

M.S. Capstone Seminar
M.S. Capstone SeminarM.S. Capstone Seminar
M.S. Capstone Seminar
 
igarss11swot-vadon-callahan-psc-s3.110725.pptx
igarss11swot-vadon-callahan-psc-s3.110725.pptxigarss11swot-vadon-callahan-psc-s3.110725.pptx
igarss11swot-vadon-callahan-psc-s3.110725.pptx
 
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSIONFR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
 
Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014
 
Watershed development and drainage assessments
Watershed development and drainage assessmentsWatershed development and drainage assessments
Watershed development and drainage assessments
 
2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...
2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...
2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...
 
#EarthOnAWS | AWS Public Sector Summit 2017
#EarthOnAWS | AWS Public Sector Summit 2017#EarthOnAWS | AWS Public Sector Summit 2017
#EarthOnAWS | AWS Public Sector Summit 2017
 
Arc hydro
Arc hydroArc hydro
Arc hydro
 
Rasdaman use case
Rasdaman use case Rasdaman use case
Rasdaman use case
 
Geographic information system
Geographic information systemGeographic information system
Geographic information system
 
Advanced Technologies
Advanced TechnologiesAdvanced Technologies
Advanced Technologies
 
Spatial Data, KML, and the University Web
Spatial Data, KML, and the University WebSpatial Data, KML, and the University Web
Spatial Data, KML, and the University Web
 
Working with Scientific Data in MATLAB
Working with Scientific Data in MATLABWorking with Scientific Data in MATLAB
Working with Scientific Data in MATLAB
 
ArcGIS and Multi-D: Tools & Roadmap
ArcGIS and Multi-D: Tools & RoadmapArcGIS and Multi-D: Tools & Roadmap
ArcGIS and Multi-D: Tools & Roadmap
 
Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2
 
Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2Risk Analysis Of Cultural Resource4th June2
Risk Analysis Of Cultural Resource4th June2
 
Continental Divide Trail GPS Mapping Project
Continental Divide Trail GPS Mapping ProjectContinental Divide Trail GPS Mapping Project
Continental Divide Trail GPS Mapping Project
 
Friedrich - LiDAR CADD Engr. Design
Friedrich - LiDAR CADD Engr. DesignFriedrich - LiDAR CADD Engr. Design
Friedrich - LiDAR CADD Engr. Design
 
Rangeland hydrology and erosion model
Rangeland hydrology and erosion modelRangeland hydrology and erosion model
Rangeland hydrology and erosion model
 
JHydro - an implementation of the digital watershed
JHydro - an implementation of the digital watershedJHydro - an implementation of the digital watershed
JHydro - an implementation of the digital watershed
 

Automating regional statistics for environmental modeling

Editor's Notes

  1. I am gonna talk little bit about my past research, titled automating regional descriptive statistic computations for environmental modeling.
  2. This is the same chart as Chuck’s talk, showing a comparison of low streamflow regression models constructed with three different sets of explanatory variables. My talk is focusing on these digitally derived watershed characteristics.
  3. Low streamflow regression models generally take this form. Q7,10 is a 7-day 10-year streamflow statistic, Betas are model parameters, and X’s are watershed characteristics, like topography, climate, and soil information. The models can be constructed by first, deriving Xi’s from raster datasets using ArcGIS zonal statistics tool, and then inputing Q7,10 and potential X’s into a statistical software, SAS. We imput a large number of Xi’s as potential explanatory variables and the SAS picks Xi’s that best estimates the Q7,10.
  4. This is how this tool works. Here is watershed layer, each polygon here represents a watershed boundary.
  5. Then, overlay this layer on top a raster data.
  6. The tool takes cells that are included within each watershed, and calculates descriptive statistics of these cell values, and results are stored in a table. In this table each row represents a watershed boundary, and columns represent descriptive statistics for this raster data. When you process another raster data, ideally, the results are appended to the same table, because eventually we want to have one table to input to SAS. But the zonal statistics tool can’t do that. Instead, separated tables are created for multiple raster data.
  7. This is a problem of the zonal statistics tool. So what you need to do is to merge these tables created for multiple raster data into one table. This can be done by just copy and paste columns, but there is another problem. Columns in these tables have same name, mean or standard deviation, but in this table, those column names should be identifiable for each raster data, like a mean of elevation, standard deviation of precipitation, and so on. So you also need change the column names. When there are only 10 raster data, those table can be relatively easily merged manually.
  8. But in our studies, we employ much more raster data. Here, my master’s thesis, fourteen hundred raster data were used, and I had three different watershed layers, each has 35 watersheds, so the number of tables I needed to merge were more than four thousand. In Chuck’s today’s talk, 28 raster and 112 tables needed to be merged. In my paper here, again fourteen hundred raster tables, and in this paper, 162 tables. So, for the first one, you need to manually copy & paste columns for 4000 times, and change the column names 4000 times.
  9. So manual operation is very tedious, time-consuming, and prone to human errors. Motivated by these problems, we decided to develop a custom ArcGIS toolset.
  10. Here is a user interface of that tool. Actually, that tool is just one tool in the GIS toolset we developed, named Arc watershed classification. In this toolset, most of the GIS operations for our research are customized and integrated. I only show this one tool today. Using this window, you can specify parameter files and other input to the tool. Then, press OK, everything is automatically done.
  11. Here is a case study. In the same study region as Chuck’s talk, 144 watersheds.
  12. We used hydro1k DEM.
  13. Slope that is derived from the DEM
  14. 13 raster data from dataset called PRISM,representing monthly and yearly precipitation
  15. and 12 raster data of soil classification from dataset called STATSGO,
  16. And landcover from national landcover dataset.
  17. Using these raster dataset, we used the developed tool and created a watershed characteristics database. This table can be inputted to SAS to construct regression equations.
  18. The developed tool saved at least 95 % of the manual labor time. GIS toolset is versatile and can aid in a wide variety of environmental studies, meaning that the polygons don’t need to be watershed boundaries, that can be any boundaries like State, county, or town, and any raster dataset can be processed.