Automating regional statistics for environmental modeling

•

1 like•271 views

The document outlines methods for automating the extraction of watershed characteristics from raster datasets to be used for regional environmental modeling. It describes developing custom ArcGIS tools to calculate descriptive statistics for multiple raster layers across thousands of watersheds in a batch process, avoiding manual and error-prone operations. A case study demonstrates extracting characteristics from 1 to 13 raster datasets representing topography, climate, soil, and land use across 1,466 watersheds. The automated process is estimated to save at least 95% of the labor time compared to manual methods. The tools make regional environmental studies more efficient.

Automating regional descriptive statistic computations for environmental modeling Satoshi Hirabayashi Environmental Resources Engineering SUNY College of Environmental Science and Forestry, Syracuse, NY USA

Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],Outline

Low Streamflow Regional Regression (Kroll et al., 2004) Background % Standard Error 700 600 500 400 300 200 100 0 USGS USGS and Digital USGS, Digital, and Hydrogeology Entire US 29 regions 930 HCDN sites Focus !

Low Streamflow Regional Regression Background Q 7,10 : 7-day 10-year streamflow statistic  I : Model parameter to be estimated X i : Watershed characteristics ,[object Object],[object Object],[object Object],Model Construction Process

Problems with Zonal Statistics Tool Introduction to Problems SAS

Problems with Multiple Raster Datasets Introduction to Problems Research Hirabayashi, 2005 Kroll, 2007 Hirabayashi and Kroll, 2007 Hirabayashi and Kroll, 2008 1,466 28 1,466 54 # of Raster Dataset # of Watersheds (# of Layers) 35 (3) 144 (4) 35 (1) 106 (3) # of Tables 4,398 112 1,466 162

Problems in Manual Operations ,[object Object],[object Object],[object Object],Introduction to Problems Develop a custom ArcGIS toolset.

Automated Explanatory Variables Extraction Methods Batch Output Table Creation Output Table Watershed Boundaries Parameter File Parameter File Log File Batch Descriptive Statistic Calculation dBASE table Developed tool ESRI GRID/TIFF /IMAGINE raster geodatabase/ shapefile ASCII text file Weather, Soil, Elevation, etc

Watershed Characteristic Extraction Case Study

Watershed Characteristic Extraction Case Study Hydro1K DEM Raster Dataset # Data 1

Watershed Characteristic Extraction Case Study Hydro1K DEM Slope Raster Dataset # Data 1 1

Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM Raster Dataset # Data 1 1 13

Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO Raster Dataset # Data 1 1 13 12

Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO NLCD Raster Dataset # Data 1 1 13 12 1

Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO NLCD Raster Dataset # Data 1 1 13 12 1 Watershed characteristics database

[object Object],[object Object],[object Object],Conclusions Conclusions

Unnested Watershed Identification Methods

What's hot

Current HDF Tools (1997)The HDF-EOS Tools and Information Center

Cross-domain data discovery and integration Simon Cox

The Implementation of the International Geo Sample Number in CSIRO: Experienc...Anusuriya Devaraju

view_hdfThe HDF-EOS Tools and Information Center

GRASP 2-page Overview - June 2016David Jarvis

OpenTopography - Scalable Services for Geosciences DataOpenTopography Facility

Quality of ground data for assessment and benchmarkingIrSOLaV Pomares

Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...ijtsrd

long-range_scanning_lidars_for_different_and_cost_effective_campaignsAlexander Cassola

Overview of the IGSN discovery portalARDC

P10 hansen cw_data_requirements_for_calibrationSandia National Laboratories: Energy & Climate: Renewables

K venkata reddyClimDev15

Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...Kerstin Lehnert

HDF Town HallThe HDF-EOS Tools and Information Center

China's cng maparapublication

Nuclear emergency response and Big Data technologiesBigData_Europe

Session 3 - Presentation by Sachiko HayashidaOECD Environment

Pitch deckIkramullah Qayyum

Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...Bijal (Bee) Hayes-Thakore

Measurement-based upscaling of pan-Arctic net ecosystem exchangeIntegrated Carbon Observation System (ICOS)

What's hot (20)

Current HDF Tools (1997)

Cross-domain data discovery and integration

The Implementation of the International Geo Sample Number in CSIRO: Experienc...

view_hdf

GRASP 2-page Overview - June 2016

OpenTopography - Scalable Services for Geosciences Data

Quality of ground data for assessment and benchmarking

Runoff Prediction of Gharni River Catchment of Maharashtra by Regressional An...

long-range_scanning_lidars_for_different_and_cost_effective_campaigns

Overview of the IGSN discovery portal

P10 hansen cw_data_requirements_for_calibration

K venkata reddy

Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...

HDF Town Hall

China's cng map

Nuclear emergency response and Big Data technologies

Session 3 - Presentation by Sachiko Hayashida

Pitch deck

Cost analysis Toolkit for In-situ Lunar Exploration. Presented to IAC 2006, b...

Measurement-based upscaling of pan-Arctic net ecosystem exchange

Similar to Automating regional statistics for environmental modeling

M.S. Capstone Seminarshirabay

igarss11swot-vadon-callahan-psc-s3.110725.pptxgrssieee

FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSIONgrssieee

Adam Lewis–SPEDDEXES 2014aceas13tern

Watershed development and drainage assessmentsAndrew Harrison

2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...GIS in the Rockies

#EarthOnAWS | AWS Public Sector Summit 2017Amazon Web Services

Arc hydroDurgeshPratapSIngh8

Rasdaman use case EOSC-hub project

Geographic information systemSumanta Das

Advanced TechnologiesSociety of Women Engineers

Spatial Data, KML, and the University WebGlennon Alan

Working with Scientific Data in MATLABThe HDF-EOS Tools and Information Center

ArcGIS and Multi-D: Tools & RoadmapThe HDF-EOS Tools and Information Center

Risk Analysis Of Cultural Resource4th June2Shweta Bhatia Gupta

Risk Analysis Of Cultural Resource4th June2guesta56b77

Continental Divide Trail GPS Mapping Projectkshakarjian

Friedrich - LiDAR CADD Engr. DesignJose A. Hernandez

Rangeland hydrology and erosion modelSoil and Water Conservation Society

JHydro - an implementation of the digital watershedsilli

Similar to Automating regional statistics for environmental modeling (20)

M.S. Capstone Seminar

igarss11swot-vadon-callahan-psc-s3.110725.pptx

FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION

Adam Lewis–SPEDDEXES 2014

Watershed development and drainage assessments

2017 ASPRS-RMR Big Data Track: Using NASA's AppEEARS to Slice and Dice Big Ea...

#EarthOnAWS | AWS Public Sector Summit 2017

Arc hydro

Rasdaman use case

Geographic information system

Advanced Technologies

Spatial Data, KML, and the University Web

Working with Scientific Data in MATLAB

ArcGIS and Multi-D: Tools & Roadmap

Risk Analysis Of Cultural Resource4th June2

Continental Divide Trail GPS Mapping Project

Friedrich - LiDAR CADD Engr. Design

Rangeland hydrology and erosion model

JHydro - an implementation of the digital watershed

Automating regional statistics for environmental modeling

1. Automating regional descriptive statistic computations for environmental modeling Satoshi Hirabayashi Environmental Resources Engineering SUNY College of Environmental Science and Forestry, Syracuse, NY USA

3. Low Streamflow Regional Regression (Kroll et al., 2004) Background % Standard Error 700 600 500 400 300 200 100 0 USGS USGS and Digital USGS, Digital, and Hydrogeology Entire US 29 regions 930 HCDN sites Focus !

5. Zonal Statistics Tool Background

6. Zonal Statistics Tool Background

7. Zonal Statistics Tool Background

8. Problems with Zonal Statistics Tool Introduction to Problems SAS

9. Problems with Multiple Raster Datasets Introduction to Problems Research Hirabayashi, 2005 Kroll, 2007 Hirabayashi and Kroll, 2007 Hirabayashi and Kroll, 2008 1,466 28 1,466 54 # of Raster Dataset # of Watersheds (# of Layers) 35 (3) 144 (4) 35 (1) 106 (3) # of Tables 4,398 112 1,466 162

10.

11. Automated Explanatory Variables Extraction Methods Batch Output Table Creation Output Table Watershed Boundaries Parameter File Parameter File Log File Batch Descriptive Statistic Calculation dBASE table Developed tool ESRI GRID/TIFF /IMAGINE raster geodatabase/ shapefile ASCII text file Weather, Soil, Elevation, etc

12. User Interface (ArcWC) Methods

13. Watershed Characteristic Extraction Case Study

14. Watershed Characteristic Extraction Case Study Hydro1K DEM Raster Dataset # Data 1

15. Watershed Characteristic Extraction Case Study Hydro1K DEM Slope Raster Dataset # Data 1 1

16. Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM Raster Dataset # Data 1 1 13

17. Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO Raster Dataset # Data 1 1 13 12

18. Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO NLCD Raster Dataset # Data 1 1 13 12 1

19. Watershed Characteristic Extraction Case Study Hydro1K DEM Slope PRISM STATSGO NLCD Raster Dataset # Data 1 1 13 12 1 Watershed characteristics database

20.

21. Questions?

22. Gauging Site Relocation Methods

23. Gauging Site Relocation Methods

24. Unnested Watershed Identification Methods

25. Unnested Watershed Identification Methods

26. Unnested Watershed Identification Methods

27. Unnested Watershed Identification Methods

28. Batch STATSGO Processing Methods

Editor's Notes

I am gonna talk little bit about my past research, titled automating regional descriptive statistic computations for environmental modeling.
This is the same chart as Chuck’s talk, showing a comparison of low streamflow regression models constructed with three different sets of explanatory variables. My talk is focusing on these digitally derived watershed characteristics.
Low streamflow regression models generally take this form. Q7,10 is a 7-day 10-year streamflow statistic, Betas are model parameters, and X’s are watershed characteristics, like topography, climate, and soil information. The models can be constructed by first, deriving Xi’s from raster datasets using ArcGIS zonal statistics tool, and then inputing Q7,10 and potential X’s into a statistical software, SAS. We imput a large number of Xi’s as potential explanatory variables and the SAS picks Xi’s that best estimates the Q7,10.
This is how this tool works. Here is watershed layer, each polygon here represents a watershed boundary.
Then, overlay this layer on top a raster data.
The tool takes cells that are included within each watershed, and calculates descriptive statistics of these cell values, and results are stored in a table. In this table each row represents a watershed boundary, and columns represent descriptive statistics for this raster data. When you process another raster data, ideally, the results are appended to the same table, because eventually we want to have one table to input to SAS. But the zonal statistics tool can’t do that. Instead, separated tables are created for multiple raster data.
This is a problem of the zonal statistics tool. So what you need to do is to merge these tables created for multiple raster data into one table. This can be done by just copy and paste columns, but there is another problem. Columns in these tables have same name, mean or standard deviation, but in this table, those column names should be identifiable for each raster data, like a mean of elevation, standard deviation of precipitation, and so on. So you also need change the column names. When there are only 10 raster data, those table can be relatively easily merged manually.
But in our studies, we employ much more raster data. Here, my master’s thesis, fourteen hundred raster data were used, and I had three different watershed layers, each has 35 watersheds, so the number of tables I needed to merge were more than four thousand. In Chuck’s today’s talk, 28 raster and 112 tables needed to be merged. In my paper here, again fourteen hundred raster tables, and in this paper, 162 tables. So, for the first one, you need to manually copy & paste columns for 4000 times, and change the column names 4000 times.
So manual operation is very tedious, time-consuming, and prone to human errors. Motivated by these problems, we decided to develop a custom ArcGIS toolset.
Here is a user interface of that tool. Actually, that tool is just one tool in the GIS toolset we developed, named Arc watershed classification. In this toolset, most of the GIS operations for our research are customized and integrated. I only show this one tool today. Using this window, you can specify parameter files and other input to the tool. Then, press OK, everything is automatically done.
Here is a case study. In the same study region as Chuck’s talk, 144 watersheds.
We used hydro1k DEM.
Slope that is derived from the DEM
13 raster data from dataset called PRISM,representing monthly and yearly precipitation
and 12 raster data of soil classification from dataset called STATSGO,
And landcover from national landcover dataset.
Using these raster dataset, we used the developed tool and created a watershed characteristics database. This table can be inputted to SAS to construct regression equations.
The developed tool saved at least 95 % of the manual labor time. GIS toolset is versatile and can aid in a wide variety of environmental studies, meaning that the polygons don’t need to be watershed boundaries, that can be any boundaries like State, county, or town, and any raster dataset can be processed.

Automating regional statistics for environmental modeling

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Automating regional statistics for environmental modeling

Similar to Automating regional statistics for environmental modeling (20)

Automating regional statistics for environmental modeling

Editor's Notes