SlideShare a Scribd company logo
CyberGIS Center for Advanced Digital and Spatial Studies
A CyberGIS-Jupyter Framework for
Geospatial Analytics at Scale
Dandong Yin1,2, Yan Liu1,2,3, Anand Padmanabhan1,2,3,
Jeff Terstriep1,3, Johnathan Rush1,3, and Shaowen Wang1,2,3
1CyberGIS Center for Advanced Digital and Spatial Studies
2Department of Geography and Geographic Information Science
3National Center for Supercomputing Applications (NCSA)
University of Illinois at Urbana-Champaign
Practice & Experience in Advanced Research Computing (PEARC17)
July 11, 2017, New Orleans, Louisiana, USA
CyberGIS Center for Advanced Digital and Spatial Studies
Background
• Gateways facilitate and accelerate scientific research and education enabled by
advanced cyberinfrastructure (CI)
• End-user-oriented development approaches often consume significant resource
and time to achieve these desirable goals
2
Advanced Cyberinfrastructure
Gateway Gateway Gateway
• To fully leverage CI, it is necessary to not
only reduce the barrier of accessing CI via
gateways, but also reduce the barrier of
developing gateway applications so that
common researchers could efficiently
deliver their own applications to their
communities
CyberGIS Center for Advanced Digital and Spatial Studies
End-user-orientedmodel
• Final products need to be of fine quality in order to serve diverse users
3Topolens: A CyberGIS Gateway Application
• Usually implemented with
web-application frameworks
• Frontend UI design
• Backend computation
• Database design
• Costly to develop, optimize,
and maintain
• Limited to specific purposes
• Difficult to extend and re-use
CyberGIS Center for Advanced Digital and Spatial Studies
A new gateway paradigm
• In many scientific research scenarios, agility, reproducibility and extensibility are
increasingly important
4
Advanced Cyberinfrastructure
• Community-driven gateway development
• A series of building blocks to be re-organized for
new applications
• A public platform for domain scientists to
develop, reproduce and share their own
analytics at scale
Gateway Gateway
Gateway
• Common and friendly user interfaces for
collaborators, researchers, and gateway
developers
CyberGIS Center for Advanced Digital and Spatial Studies
CyberGIS
• CyberGIS -- geographic information
science and systems (GIS) based on
advanced CI
• Innovate new-generation GIS
• Focus on computational and data-
intensive geospatial problem-solving
within various research and education
domains
• Bridge gaps between geospatial big data,
software and applications through
• innovative cyberGIS supercomputer --
ROGER
5
Wang, S. (2010) A CyberGIS Framework for the Synthesis of
Cyberinfrastructure, GIS, and Spatial Analysis. Annals of the Association
of American Geographers, 100(3): 535-557
CyberGIS Center for Advanced Digital and Spatial Studies
• An interactive scripting interface
• Declarative UI widgets
• Supports general computation and
visualization
• Increasingly popular in data- and
computational sciences
Jupyter Notebook
6
CyberGIS Center for Advanced Digital and Spatial Studies
• An interactive scripting interface
• Declarative UI widgets
• Supports general computation and
visualization
• Increasingly popular in data- and
computational sciences
CyberGIS-Jupyter
7
CyberGIS Center for Advanced Digital and Spatial Studies
CyberGIS-Jupyter: Technologies
8
Batch Job management
Cloud-based architecture
Data/storage synchronization
CyberGIS Center for Advanced Digital and Spatial Studies
CyberGIS-Jupyter: Innovation
• Geovisualization
• Interactive map generation inside notebooks
• Support multiple formats of geospatial data
• Layer management, transparency and styles
9
No existing technologies found that match all the
criteria above. Therefore, we developed a library
named Floret to serve the purpose (shown in
demo).
CyberGIS Center for Advanced Digital and Spatial Studies
Features
• Adaptive interfaces for CI access embedded in notebooks
• Programming or GUI at choice
• Manage batch jobs at fingertips
• Cloud-based environment
• Reproducible computing environments with Docker
• Elastic resource-provisioning with OpenStack
• Shared storage
• Big data support
• Fault tolerance
• Collaborative sharing
• Generative geovisualization by programming
• Embedded web-mapping services
• Support multiple geographic data formats
10
CyberGIS Center for Advanced Digital and Spatial Studies
Architecture - 1
• Gateway applications are standardized as packages deployed on hybrid HPC
11
CyberGIS Center for Advanced Digital and Spatial Studies
Architecture - 2
12
CyberGIS Center for Advanced Digital and Spatial Studies
• Highly extensible
• Predefined UI templates
User interface
13
CyberGIS Center for Advanced Digital and Spatial Studies
• PAM with LDAP + SSSD
• Whitelist of authorized users
Authentication and authorization
14
CyberGIS Center for Advanced Digital and Spatial Studies
• Using JupyterHub with swarm spawner
• Openstack VMs join/leave swarm as needed
Container management
15
CyberGIS Center for Advanced Digital and Spatial Studies
• Widgets for composing job scripts (PBS files)
• Submit and retrieve information (via qsub and qstat)
Computation management
16
CyberGIS Center for Advanced Digital and Spatial Studies
• GPFS -> NFS -> VM Volume -> Container Volume
• Enable volume expansion, fault-tolerance and low-latency feedback
Data persistency
17
CyberGIS Center for Advanced Digital and Spatial Studies
Case study
18
• Height Above Nearest Drainage (HAND) at 10m for continental US
• Flood analysis map derived from 10m USGS 3DEP national elevation dataset (180 billion cells) and
National Hydrography Dataset (2.67 million stream reaches)
• Goals
a. Collaborative methodology development
b. HPC for data analytics
c. Deliver methodology and data products to
different user communities
• Collaborators
• Researchers
• Decision makers
• Students
CyberGIS Center for Advanced Digital and Spatial Studies
• Input data hosts
• ROGER GPFS storage
• Docker image
• GDAL, GEOS, PROJ4,
• HDF4, SQLite, MPI, etc.
• Sample notebook
• Explicit computation steps
• Seamless scaling up
• Output visualization
• Floret maps
HAND application package
19
CyberGIS Center for Advanced Digital and Spatial Studies
Comparison - 1
Collaboration
Requirements
Traditional Enhanced
Methodology
development
● Writeups as shared documents
● Communicated via emails, teleconf
● Methodology notebooks
○ Math formula
○ Code snippets
○ Sample results
Software
development
● Source codes only; computing
environment needs to be maintained and
synchronized manually
● Function notebooks with both source
codes and computing environments
effectively synchronized between
collaborators
Computation
● Conducted by a dedicated person, a
bottleneck
● Notebook interface to workflow
computation on advanced CI
○ Everyone can launch
20
CyberGIS Center for Advanced Digital and Spatial Studies
Comparison - 2
Result validation
● Data: direct download or via iRODS
● Validation results: shared document in
Google Drive
● Integrated validation notebooks with
reproducible input, statistics, and output
Visualization
● Local: download and use desktop GIS.
Not scalable for large outputs.
● Online: Tile Map Service (TMS); web-
GIS. Only available for major output data
● Integrated data, code, and visualization
notebooks
○ Traditional visualization libraries
○ Jupyter IFrame cells
○ CyberGIS Floret library
Collaboration
Requirements
Traditional Enhanced
21
CyberGIS Center for Advanced Digital and Spatial Studies
Feedbacks
• The CyberGIS-Jupyter framework was a major platform that supported UCGIS
summer school 2017
• Received impressively positive feedbacks from summer school attendees and
organizers
• According to an on-site survey, over 80% attendees strongly confirm the
importance of CyberGIS-Jupyter to enhancing cyberGIS-enabled research and
education
22
CyberGIS Center for Advanced Digital and Spatial Studies
Conclusions
• A new gateway development framework for facilitating and accelerating CI-
enabled research and education
•Domain scientists can easily adapt and create their own applications
•Students/collaborators can easily understand, reproduce and extend such gateway capabilities
• Leveraging cutting-edge technologies
•Jupyter notebooks and associated interactive interfaces
•Cloud-based elastic architecture
• To boost geospatial analytics and discoveries at desirable computing and
community scale
23
CyberGIS Center for Advanced Digital and Spatial Studies
Acknowledgements
• This work is supported in part by the National Science Foundation (NSF) under
grant numbers 1047916 and 1443080. The computational work used the NSF-
supported ROGER supercomputer (1429699).
• This work is also supported in part by the ECSS program of XSEDE, which is
supported by NSF grant number 1053575.
24
CyberGIS Center for Advanced Digital and Spatial Studies
DEMO
https://jupyter.cybergis.tk
25

More Related Content

Similar to PEARC17: a CyberGIS-Jupyter framework for geospatial analytics at scale

GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012
John Reiser
 
SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...
Sandra Gesing
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSC
inside-BigData.com
 
The Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteThe Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an Institute
Sandra Gesing
 

Similar to PEARC17: a CyberGIS-Jupyter framework for geospatial analytics at scale (20)

GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...
 
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
 
Rack Cluster Deployment for SDSC Supercomputer
Rack Cluster Deployment for SDSC SupercomputerRack Cluster Deployment for SDSC Supercomputer
Rack Cluster Deployment for SDSC Supercomputer
 
Open Geographies and Open Software
Open Geographies and Open SoftwareOpen Geographies and Open Software
Open Geographies and Open Software
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
COBWEB technology platform and future development needs
COBWEB technology platform and future development needsCOBWEB technology platform and future development needs
COBWEB technology platform and future development needs
 
COBWEB technology platform and future development needs, ISPRA 2016
COBWEB technology platform and future development needs, ISPRA 2016COBWEB technology platform and future development needs, ISPRA 2016
COBWEB technology platform and future development needs, ISPRA 2016
 
SGCI OAC webinar 4 18-19
SGCI OAC webinar 4 18-19SGCI OAC webinar 4 18-19
SGCI OAC webinar 4 18-19
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageGeospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
 
Big data analytics and machine intelligence v5.0
Big data analytics and machine intelligence   v5.0Big data analytics and machine intelligence   v5.0
Big data analytics and machine intelligence v5.0
 
Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSC
 
GlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobusWorld 2020 Keynote
GlobusWorld 2020 Keynote
 
The Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteThe Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an Institute
 

Recently uploaded

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Recently uploaded (20)

Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 

PEARC17: a CyberGIS-Jupyter framework for geospatial analytics at scale

  • 1. CyberGIS Center for Advanced Digital and Spatial Studies A CyberGIS-Jupyter Framework for Geospatial Analytics at Scale Dandong Yin1,2, Yan Liu1,2,3, Anand Padmanabhan1,2,3, Jeff Terstriep1,3, Johnathan Rush1,3, and Shaowen Wang1,2,3 1CyberGIS Center for Advanced Digital and Spatial Studies 2Department of Geography and Geographic Information Science 3National Center for Supercomputing Applications (NCSA) University of Illinois at Urbana-Champaign Practice & Experience in Advanced Research Computing (PEARC17) July 11, 2017, New Orleans, Louisiana, USA
  • 2. CyberGIS Center for Advanced Digital and Spatial Studies Background • Gateways facilitate and accelerate scientific research and education enabled by advanced cyberinfrastructure (CI) • End-user-oriented development approaches often consume significant resource and time to achieve these desirable goals 2 Advanced Cyberinfrastructure Gateway Gateway Gateway • To fully leverage CI, it is necessary to not only reduce the barrier of accessing CI via gateways, but also reduce the barrier of developing gateway applications so that common researchers could efficiently deliver their own applications to their communities
  • 3. CyberGIS Center for Advanced Digital and Spatial Studies End-user-orientedmodel • Final products need to be of fine quality in order to serve diverse users 3Topolens: A CyberGIS Gateway Application • Usually implemented with web-application frameworks • Frontend UI design • Backend computation • Database design • Costly to develop, optimize, and maintain • Limited to specific purposes • Difficult to extend and re-use
  • 4. CyberGIS Center for Advanced Digital and Spatial Studies A new gateway paradigm • In many scientific research scenarios, agility, reproducibility and extensibility are increasingly important 4 Advanced Cyberinfrastructure • Community-driven gateway development • A series of building blocks to be re-organized for new applications • A public platform for domain scientists to develop, reproduce and share their own analytics at scale Gateway Gateway Gateway • Common and friendly user interfaces for collaborators, researchers, and gateway developers
  • 5. CyberGIS Center for Advanced Digital and Spatial Studies CyberGIS • CyberGIS -- geographic information science and systems (GIS) based on advanced CI • Innovate new-generation GIS • Focus on computational and data- intensive geospatial problem-solving within various research and education domains • Bridge gaps between geospatial big data, software and applications through • innovative cyberGIS supercomputer -- ROGER 5 Wang, S. (2010) A CyberGIS Framework for the Synthesis of Cyberinfrastructure, GIS, and Spatial Analysis. Annals of the Association of American Geographers, 100(3): 535-557
  • 6. CyberGIS Center for Advanced Digital and Spatial Studies • An interactive scripting interface • Declarative UI widgets • Supports general computation and visualization • Increasingly popular in data- and computational sciences Jupyter Notebook 6
  • 7. CyberGIS Center for Advanced Digital and Spatial Studies • An interactive scripting interface • Declarative UI widgets • Supports general computation and visualization • Increasingly popular in data- and computational sciences CyberGIS-Jupyter 7
  • 8. CyberGIS Center for Advanced Digital and Spatial Studies CyberGIS-Jupyter: Technologies 8 Batch Job management Cloud-based architecture Data/storage synchronization
  • 9. CyberGIS Center for Advanced Digital and Spatial Studies CyberGIS-Jupyter: Innovation • Geovisualization • Interactive map generation inside notebooks • Support multiple formats of geospatial data • Layer management, transparency and styles 9 No existing technologies found that match all the criteria above. Therefore, we developed a library named Floret to serve the purpose (shown in demo).
  • 10. CyberGIS Center for Advanced Digital and Spatial Studies Features • Adaptive interfaces for CI access embedded in notebooks • Programming or GUI at choice • Manage batch jobs at fingertips • Cloud-based environment • Reproducible computing environments with Docker • Elastic resource-provisioning with OpenStack • Shared storage • Big data support • Fault tolerance • Collaborative sharing • Generative geovisualization by programming • Embedded web-mapping services • Support multiple geographic data formats 10
  • 11. CyberGIS Center for Advanced Digital and Spatial Studies Architecture - 1 • Gateway applications are standardized as packages deployed on hybrid HPC 11
  • 12. CyberGIS Center for Advanced Digital and Spatial Studies Architecture - 2 12
  • 13. CyberGIS Center for Advanced Digital and Spatial Studies • Highly extensible • Predefined UI templates User interface 13
  • 14. CyberGIS Center for Advanced Digital and Spatial Studies • PAM with LDAP + SSSD • Whitelist of authorized users Authentication and authorization 14
  • 15. CyberGIS Center for Advanced Digital and Spatial Studies • Using JupyterHub with swarm spawner • Openstack VMs join/leave swarm as needed Container management 15
  • 16. CyberGIS Center for Advanced Digital and Spatial Studies • Widgets for composing job scripts (PBS files) • Submit and retrieve information (via qsub and qstat) Computation management 16
  • 17. CyberGIS Center for Advanced Digital and Spatial Studies • GPFS -> NFS -> VM Volume -> Container Volume • Enable volume expansion, fault-tolerance and low-latency feedback Data persistency 17
  • 18. CyberGIS Center for Advanced Digital and Spatial Studies Case study 18 • Height Above Nearest Drainage (HAND) at 10m for continental US • Flood analysis map derived from 10m USGS 3DEP national elevation dataset (180 billion cells) and National Hydrography Dataset (2.67 million stream reaches) • Goals a. Collaborative methodology development b. HPC for data analytics c. Deliver methodology and data products to different user communities • Collaborators • Researchers • Decision makers • Students
  • 19. CyberGIS Center for Advanced Digital and Spatial Studies • Input data hosts • ROGER GPFS storage • Docker image • GDAL, GEOS, PROJ4, • HDF4, SQLite, MPI, etc. • Sample notebook • Explicit computation steps • Seamless scaling up • Output visualization • Floret maps HAND application package 19
  • 20. CyberGIS Center for Advanced Digital and Spatial Studies Comparison - 1 Collaboration Requirements Traditional Enhanced Methodology development ● Writeups as shared documents ● Communicated via emails, teleconf ● Methodology notebooks ○ Math formula ○ Code snippets ○ Sample results Software development ● Source codes only; computing environment needs to be maintained and synchronized manually ● Function notebooks with both source codes and computing environments effectively synchronized between collaborators Computation ● Conducted by a dedicated person, a bottleneck ● Notebook interface to workflow computation on advanced CI ○ Everyone can launch 20
  • 21. CyberGIS Center for Advanced Digital and Spatial Studies Comparison - 2 Result validation ● Data: direct download or via iRODS ● Validation results: shared document in Google Drive ● Integrated validation notebooks with reproducible input, statistics, and output Visualization ● Local: download and use desktop GIS. Not scalable for large outputs. ● Online: Tile Map Service (TMS); web- GIS. Only available for major output data ● Integrated data, code, and visualization notebooks ○ Traditional visualization libraries ○ Jupyter IFrame cells ○ CyberGIS Floret library Collaboration Requirements Traditional Enhanced 21
  • 22. CyberGIS Center for Advanced Digital and Spatial Studies Feedbacks • The CyberGIS-Jupyter framework was a major platform that supported UCGIS summer school 2017 • Received impressively positive feedbacks from summer school attendees and organizers • According to an on-site survey, over 80% attendees strongly confirm the importance of CyberGIS-Jupyter to enhancing cyberGIS-enabled research and education 22
  • 23. CyberGIS Center for Advanced Digital and Spatial Studies Conclusions • A new gateway development framework for facilitating and accelerating CI- enabled research and education •Domain scientists can easily adapt and create their own applications •Students/collaborators can easily understand, reproduce and extend such gateway capabilities • Leveraging cutting-edge technologies •Jupyter notebooks and associated interactive interfaces •Cloud-based elastic architecture • To boost geospatial analytics and discoveries at desirable computing and community scale 23
  • 24. CyberGIS Center for Advanced Digital and Spatial Studies Acknowledgements • This work is supported in part by the National Science Foundation (NSF) under grant numbers 1047916 and 1443080. The computational work used the NSF- supported ROGER supercomputer (1429699). • This work is also supported in part by the ECSS program of XSEDE, which is supported by NSF grant number 1053575. 24
  • 25. CyberGIS Center for Advanced Digital and Spatial Studies DEMO https://jupyter.cybergis.tk 25

Editor's Notes

  1. In this presentation we will share our work of using Jupyter as a new type of gateway to cyberinfrastructure and cyberGIS. We deployed JupyterHub on ROGER supercomputing system, leveraging Openstack cloud,
  2. 5 PB storage in container