Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GIS, and Web‐Services

Data‐intensive hydrologic modeling:
A Cloud strategy for integrating PIHM, GIS,
and Web‐Services

Lorne Leonard
Chris Duffy
Gopal Bhatt
Xuan Yu

Civil & Environmental Engineering, PSU,
University Park, PA,
United States.

Our Big Goal
• To be able to rapidly prototype any watershed
model in the world any time
• Yes, anywhere
• Do real time forecasting and analysis to
improve our models
• Using National Datasets of ETVs – Data
Intensive

Issues
• Data Computationally Intensive!
• 100s Terabytes of ETV data to model watersheds
anywhere in the USA
• 1000s Terabytes of data for around the world.
• Federal data servers are slow
• No central data store for our ETV data needs
• Complex Workflows to automate data and model
development processing
• Computation requirements vary per project
• IT is expensive! We are focused on research only

The appeal of the Cloud…
“Cloud computing” enables us to:
• have access to Data Intensive and HPC
computational needs dynamically
• to be scalable
• do data intensive operations near HPC for faster
access
• Enable other researchers and educators to use
our scientific software via the web without the
need to install and maintain software and
systems

Our definition of “Cloud”
• Dynamically scalable (virtualized resources), from
Desktop, HPC cluster to NCSA Blue Waters, grid
• Resources are provided as a web based service
(data, software)
• Data‐Intensive and parallel computing
• Private cloud to private cloud conduit between
PSU and NCSA for hydrological research
• This is a prototype!

PIHM and the Cloud: What is PIHM
• Fully coupled multi‐process distributed hydrological
model
• Uses semi‐discrete finite volume method
• Unstructured mesh (TIN) http://www.pihm.psu.edu

2268 HUC 8

103,444 HUC 12

Our Strategy

Atmospheric Forcing (precipitation,
snow cover, wind, relative humidity,
temperature, net radiation, albedo,
photosynthetic atmospheric radiation,
leaf area index)
Digital elevation models
River/Stream Discharge
Soil (class, hydrologic properties)
Groundwater (levels, extent,
hydro‐geologic properties)
Lake/Reservoir/Wetlands (levels, extent)
Land Cover/Use (biomass,
human infrastructure, demography,
ecosystem disturbance)
Water Use

PIHM Cloud Re‐Analysis and Forecast
• With NCSA we are developing a
PIHM cloud prototype to
distribute the PIHM web service
workflow and model
components over the cloud for
research and education.

• Calibrate models – spawn 100s
of dataflow execution
parameters to process,
compute, analyze and visualize
the transformed results.

Example of PIHM Web Services

ArcPIHM
• PIHM will soon be available as a
toolbox for ESRI users
• Development plans include
developing protocols to encourage
further modularity so other
developers can plug and play code
into the PIHM workflow. For
example, other Physic engines,
datasets etc
• Consume CUAHSI HydroServer,
HydroGML resources

International CZO sites at
Crete and Plynlimon

PIHM Cloud Forecast Example
• Real time forecasting

Conclusion
• Data & Computationally Intensive Watershed
Simulations!
• 1000s Terabytes of data required to model any
watershed in the USA
• Workflows to automate data processing and
distribute the computation on the cloud
• What is needed is fast access to data centers
that are close to HPC resources

Thank you for listening
Visit http://www.pihm.psu.edu
For more information and updates
Kumar, M., G. Bhatt, and C.J. Duffy, 2009, An efficient domain decomposition framework for accurate
representation of geodata in distributed hydrologic models, IJGIS.
Kumar, M., G. Bhatt, and C.J. Duffy, 2008, The Role of Physical, Numerical and Data Coupling in a
Mesoscale Watershed Model, Advances in Water Resources.
Bhatt, G., M. Kumar, and C.J. Duffy, 2008, Bridging the gap between geohydrologic data and
distributed hydrologic modeling, In Proceedings of International Congress on Environmental Modeling
and Software

Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GIS, and Web‐Services

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GIS, and Web‐Services

Similar to Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GIS, and Web‐Services (20)

Recently uploaded

Recently uploaded (20)

Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GIS, and Web‐Services