SlideShare a Scribd company logo
1 of 41
Download to read offline
An introduction to GEOSHARE’s
cyberinfrastructure:
exchange and analysis of geo-spatial
data through HUBzero
Nelson Villoria, GEOSHARE and NSF-DIBBS Co-Principal Investigator
Department of Agricultural Economics
Purdue University
GLP 2nd Open Science Meeting, Berlin March 21, 2014
G E O S H A R E
This presentation is based on work
with the following collaborators:
 HongJun Choi (Purdue U.)
 Delphine Derying (University of East Anglia)
 Joshua Elliot (AgMip)
 Thomas Hertel (Purdue U.)
 Erich Huebner (Purdue U.)
 Rajesh Kalyanam (Purdue U.)
 Danny Plouffe (McGill U.)
 Navin Ramankutty (McGill U.)
 Carol Song (Purdue U.)
 Lan Zhao (Purdue U.)
GEOSHARE’s Mission
GEOSHARE is a decentralized network of scientists in many
disciplines which mission is to provide:
 a freely available, global, spatially explicit database on
agriculture, land use, and the environment
 analysis tools for scientists, decision makers, and
development practitioners
GEOSHARE: Vision of a Network
GEOSHARE envisions a vibrant global network:
 contributing to shared cyberinfrastructure
 enhancing capacity for geospatial analysis
 applying geospatial tools to guide decision making
related to food security, land use, environmental
sustainability and poverty reduction
GEOSHARE’s Network Today (Pilot Project)
Cyberinfrastructure for
GEOSPATIAL Analysis
Carol Song, Purdue University
Jawoo Koo, IFPRI
Why a shared cyberinfrastructure
for GEOSHARE?
Different Worlds
Expert groups in
educational and
international institutions
generate
knowledge, data, and tools
Users elsewhere wish to
use knowledge for
decision making, learning,
etc.
Often, knowledge
stays within disciplinary
silos
Crop Models
What Keeps Them Separate?
Commercial Software
is expensive, requires powerful
Hardware, and specialization
SAS ≠ STATA ≠ ARCGis
Most Research
Codes
are written by one
user
for one user
Multidisciplinary
collaborations to
open the silos are
costly
Hostile
HUBzero technology seeks to break
these barriers
G E O S H A R E
Use the Web to
share data,
code, hardware, and
spread the impact
of research efforts
and funds
No additional
Software to install or learn
How to do this?
 a
 HUBzero is a technology developed at Purdue University
with NSF funds for creating dynamic websites for scientific
research and educational activities:
 It’s fully open source
 Whatever can be compiled in Linux, can be run in the Hub
 Rapid deployment of Graphical User Interfaces
 Access to cluster computer resources
Have this worked elsewhere?
 We have been inspired by nanoHUB.org
 nanoHUB.org’s objective is to transfer the knowledge
generated in extensive nanotechnology facilities to
others in academy and industry
 Created by the NSF-funded Network for Computational
Nanotechnology.
 Next slides from Prof. Gerhard Klimeck, nanoHUB.org
Director
The nanoHUB Proposition and
Emerged (and Busted) Myths
Proposition
 Be Developer Friendly
 Be Accessible
 Be User Friendly
Myth:
 Cannot use research codes for
education
 Must write own code to do
research
 Experimentalists cannot use
research codes
 NO End-to-end Science Cloud
Possible
 Building User Interfaces too Difficult
 Must rewrite code for web
deployment
 There is no incentive to share
codes
nanoHUB.org: Developer Collaboration
Network
Each dot is a Developer
Links are tools
Supplier Network
http://nanoHUB.org
Network members in 172 countries
Over 12,000 simulations/year
Over 300,000 users annually
New Registrations
Simulation Users
Tutorial / Lecture Users
GEOSHARE Example 1: Pegasus
Pegasus 1.0 Predicting Ecosystem
Goods And Services Using Scenarios
 Published by/in:
 Deryng, D., W. J. Sacks, C. C. Barford, and N. Ramankutty.
2011. “Simulating the Effects of Climate and Agricultural
Management Practices on Global Crop Yield.” Global
Biogeochemical Cycles 25 (May): 18 PP.
 A global crop model that integrates climate, the effect
of planting dates and cultivar choices, irrigation, and
fertilizer application on crop yield for maize, soybean,
and spring wheat.
 Useful for studying adaptation to climate change.
Inputs when we started:
 Fortran Code with Model Equations:
 Few megabytes of data in NetCDF format to calibrate
the model.
Now: Pegasus 1.0 at
geoshareproject.org
> No code was rewritten!
> No Web developer was needed!
> No additional extensions for the web server
> For next Pegasus, just update the code!
On-the-fly visualization of results
Download
output for
Further
processing
GEOSHARE Example 2: Processing
AgMIP’s Aggregated Data
The AgMIP GRIDded crop modeling
initiative
 Crop Modeling Teams grouped around
 The Global Gridded Crop Model Intercomparison
(GGCMI):
 Fast track phase:
 5 Crop Models
 5 Global General Circulation Models
 5 Representative Concentration Pathways
 CO2 Fertilization and irrigation
 12 Crops
 All the combinations ran from 1971 to 2099
 + 35,000 0.50x0.50 grids with crop yields
The Problem
 Economic modelers use data aggregated to the country
level
 NetCDF and other spatial data delivery formats are
foreign to non-specialists
 Download of large grids is challenging even with good
bandwith
 Storage is challenging for otherwise very good PCs
 How to make the data available to Users
The Solution
 Good all-around R Code:
 Reads data from Globus Online to the HUB
 Use aggregate() function
 Aggregates using mapping from XY coordinate to country,
AEZs, up to the user.
 Calculate summary statistics (mean, etc.) or weighted
averages using user provided weights (production, area,
population, etc.)
 Wrap the R function around a Graphical User interface
The Result
Aggregation Possibilities
Aggregation Possibilities
Visualization
Visualization
Costs and Impacts
 Costs to the developers:
 Write R function
 Test it
 Develop GUI
 Cost to the User:
 None.
 Software Licenses, Hardware and Other Equipment
 None.
 All the processing on-the-fly and on university owned
hardware
 Impacts:
 Anyone interested in getting aggregated climate
shocks for aggregated analysis can do so.
A peak inside the Hub: A Linux
Workspace
If it can be run on Linux, it can be converted on a web application
Graphical User Interfaces
 HUBzero supports its own language, XML based, called
RAPPTURE toolkit
 But also supports Python, Java, etc.
 User’s knowledge and experience set boundaries.
GEOSHARE Example 3: A
workflow for assessing the value
of improved data
Data
Workflows: From data to analysis
 Gridded source data on
land cover & use, water,
poverty and environment
- flows into -
 Data reconciliation
‘models’, e.g. SPAM which
produce usable data - for
use in -
 Biophysical and economic
models (e.g., DSSAT,
IMPACT, GTAP)
Sensitivity of results at final stage
determines value of improving
quality of source data
Source Data
Land Cover
Data Reconciliation
Process
Water
Poverty
Land Use
Environment
DSSAT IMPACT
Other considerations
GEOSHARE is free for all users and
open to all
 Yes, users pay nothing.
 What about developers?
 This is shared space
 The cost of the Hub is around 50,000 per/year
 Working on GEOSHARE governance cost-sharing model for
self sustainability
 But opportunities for economies of scale are huge.
Space is always available!
Recent Funding For Expanding
HUBzero’s Spatial Capabilities
 A National Science Foundation grant
 Data Infrastructure Building Blocks (DIBBs)
program
 GABBs: 1 of 4 implementation awards in 2013
 $4.5M over 4 years
 Started October 1, 2013
 Collaboration with other awards
Sister geospatial hub projects
 Efforts in developing integrated geospatial
data/modeling capabilities using HUBzero
 Drinet hub (http://drinet.hubzero.org)
 Geoshare hub (http://geoshareproject.org)
 Water hub (http://water-hub.org)
 Useful to Useable (u2u)
http://drinet.hubzero.org/groups/u2u
Overarching goals
 Enabling geospatial modeling and analysis online
 Anyone can create an online geospatial app and
share
 Further support for geospatial data
 Building blocks can be used by other projects
Building for self service (DIY) – Leverage successful
software – Develop building blocks
 Questions
 Feedback?
 Thanks!
 nvillori@purdue.edu

More Related Content

Similar to Introduction to GEOSHARE's Cyberinfrastructure

UK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationUK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationNeil Chue Hong
 
WSSSPE: Building communities
WSSSPE: Building communitiesWSSSPE: Building communities
WSSSPE: Building communitiesKaren Cranston
 
Building the Next Generation Earth System Grid Federation (ESGF2)
Building the Next Generation Earth System Grid Federation (ESGF2)Building the Next Generation Earth System Grid Federation (ESGF2)
Building the Next Generation Earth System Grid Federation (ESGF2)Globus
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Cloud processing close to the Earth data
Cloud processing close to the Earth dataCloud processing close to the Earth data
Cloud processing close to the Earth dataterradue
 
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe
 
Dahlquist bosc 20160709
Dahlquist bosc 20160709Dahlquist bosc 20160709
Dahlquist bosc 20160709GRNsight
 
Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop ClusterParalyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop ClusterIOSR Journals
 
The BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchThe BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchBlue BRIDGE
 
Application of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSSApplication of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSSAnusha Basavaraj
 
National scale research computing and beyond pearc panel 2017
National scale research computing and beyond   pearc panel 2017National scale research computing and beyond   pearc panel 2017
National scale research computing and beyond pearc panel 2017Gregory Newby
 
Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...Phidias
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021Gérard Dupont
 

Similar to Introduction to GEOSHARE's Cyberinfrastructure (20)

UK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationUK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing Participation
 
WSSSPE: Building communities
WSSSPE: Building communitiesWSSSPE: Building communities
WSSSPE: Building communities
 
Lesson1 esa summer_school_brovelli
Lesson1 esa summer_school_brovelliLesson1 esa summer_school_brovelli
Lesson1 esa summer_school_brovelli
 
Building the Next Generation Earth System Grid Federation (ESGF2)
Building the Next Generation Earth System Grid Federation (ESGF2)Building the Next Generation Earth System Grid Federation (ESGF2)
Building the Next Generation Earth System Grid Federation (ESGF2)
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
C cerin piv2017_c
C cerin piv2017_cC cerin piv2017_c
C cerin piv2017_c
 
Conversatorio: estado de las National Research and Education Networks (NREN)...
 Conversatorio: estado de las National Research and Education Networks (NREN)... Conversatorio: estado de las National Research and Education Networks (NREN)...
Conversatorio: estado de las National Research and Education Networks (NREN)...
 
Cloud processing close to the Earth data
Cloud processing close to the Earth dataCloud processing close to the Earth data
Cloud processing close to the Earth data
 
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
Dahlquist bosc 20160709
Dahlquist bosc 20160709Dahlquist bosc 20160709
Dahlquist bosc 20160709
 
Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop ClusterParalyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
 
The BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchThe BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative research
 
Session 33 - Production Grids
Session 33 - Production GridsSession 33 - Production Grids
Session 33 - Production Grids
 
Application of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSSApplication of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSS
 
National scale research computing and beyond pearc panel 2017
National scale research computing and beyond   pearc panel 2017National scale research computing and beyond   pearc panel 2017
National scale research computing and beyond pearc panel 2017
 
Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 

Recently uploaded

Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...Call Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 

Recently uploaded (20)

Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 

Introduction to GEOSHARE's Cyberinfrastructure

  • 1. An introduction to GEOSHARE’s cyberinfrastructure: exchange and analysis of geo-spatial data through HUBzero Nelson Villoria, GEOSHARE and NSF-DIBBS Co-Principal Investigator Department of Agricultural Economics Purdue University GLP 2nd Open Science Meeting, Berlin March 21, 2014 G E O S H A R E
  • 2. This presentation is based on work with the following collaborators:  HongJun Choi (Purdue U.)  Delphine Derying (University of East Anglia)  Joshua Elliot (AgMip)  Thomas Hertel (Purdue U.)  Erich Huebner (Purdue U.)  Rajesh Kalyanam (Purdue U.)  Danny Plouffe (McGill U.)  Navin Ramankutty (McGill U.)  Carol Song (Purdue U.)  Lan Zhao (Purdue U.)
  • 3. GEOSHARE’s Mission GEOSHARE is a decentralized network of scientists in many disciplines which mission is to provide:  a freely available, global, spatially explicit database on agriculture, land use, and the environment  analysis tools for scientists, decision makers, and development practitioners
  • 4. GEOSHARE: Vision of a Network GEOSHARE envisions a vibrant global network:  contributing to shared cyberinfrastructure  enhancing capacity for geospatial analysis  applying geospatial tools to guide decision making related to food security, land use, environmental sustainability and poverty reduction
  • 5. GEOSHARE’s Network Today (Pilot Project) Cyberinfrastructure for GEOSPATIAL Analysis Carol Song, Purdue University Jawoo Koo, IFPRI
  • 6. Why a shared cyberinfrastructure for GEOSHARE?
  • 7. Different Worlds Expert groups in educational and international institutions generate knowledge, data, and tools Users elsewhere wish to use knowledge for decision making, learning, etc. Often, knowledge stays within disciplinary silos Crop Models
  • 8. What Keeps Them Separate? Commercial Software is expensive, requires powerful Hardware, and specialization SAS ≠ STATA ≠ ARCGis Most Research Codes are written by one user for one user Multidisciplinary collaborations to open the silos are costly Hostile
  • 9. HUBzero technology seeks to break these barriers G E O S H A R E Use the Web to share data, code, hardware, and spread the impact of research efforts and funds No additional Software to install or learn
  • 10. How to do this?  a  HUBzero is a technology developed at Purdue University with NSF funds for creating dynamic websites for scientific research and educational activities:  It’s fully open source  Whatever can be compiled in Linux, can be run in the Hub  Rapid deployment of Graphical User Interfaces  Access to cluster computer resources
  • 11. Have this worked elsewhere?  We have been inspired by nanoHUB.org  nanoHUB.org’s objective is to transfer the knowledge generated in extensive nanotechnology facilities to others in academy and industry  Created by the NSF-funded Network for Computational Nanotechnology.  Next slides from Prof. Gerhard Klimeck, nanoHUB.org Director
  • 12. The nanoHUB Proposition and Emerged (and Busted) Myths Proposition  Be Developer Friendly  Be Accessible  Be User Friendly Myth:  Cannot use research codes for education  Must write own code to do research  Experimentalists cannot use research codes  NO End-to-end Science Cloud Possible  Building User Interfaces too Difficult  Must rewrite code for web deployment  There is no incentive to share codes
  • 13. nanoHUB.org: Developer Collaboration Network Each dot is a Developer Links are tools Supplier Network
  • 14. http://nanoHUB.org Network members in 172 countries Over 12,000 simulations/year Over 300,000 users annually New Registrations Simulation Users Tutorial / Lecture Users
  • 16. Pegasus 1.0 Predicting Ecosystem Goods And Services Using Scenarios  Published by/in:  Deryng, D., W. J. Sacks, C. C. Barford, and N. Ramankutty. 2011. “Simulating the Effects of Climate and Agricultural Management Practices on Global Crop Yield.” Global Biogeochemical Cycles 25 (May): 18 PP.  A global crop model that integrates climate, the effect of planting dates and cultivar choices, irrigation, and fertilizer application on crop yield for maize, soybean, and spring wheat.  Useful for studying adaptation to climate change.
  • 17. Inputs when we started:  Fortran Code with Model Equations:  Few megabytes of data in NetCDF format to calibrate the model.
  • 18. Now: Pegasus 1.0 at geoshareproject.org > No code was rewritten! > No Web developer was needed! > No additional extensions for the web server > For next Pegasus, just update the code!
  • 19. On-the-fly visualization of results Download output for Further processing
  • 20. GEOSHARE Example 2: Processing AgMIP’s Aggregated Data
  • 21. The AgMIP GRIDded crop modeling initiative  Crop Modeling Teams grouped around  The Global Gridded Crop Model Intercomparison (GGCMI):  Fast track phase:  5 Crop Models  5 Global General Circulation Models  5 Representative Concentration Pathways  CO2 Fertilization and irrigation  12 Crops  All the combinations ran from 1971 to 2099  + 35,000 0.50x0.50 grids with crop yields
  • 22. The Problem  Economic modelers use data aggregated to the country level  NetCDF and other spatial data delivery formats are foreign to non-specialists  Download of large grids is challenging even with good bandwith  Storage is challenging for otherwise very good PCs  How to make the data available to Users
  • 23. The Solution  Good all-around R Code:  Reads data from Globus Online to the HUB  Use aggregate() function  Aggregates using mapping from XY coordinate to country, AEZs, up to the user.  Calculate summary statistics (mean, etc.) or weighted averages using user provided weights (production, area, population, etc.)  Wrap the R function around a Graphical User interface
  • 25.
  • 30. Costs and Impacts  Costs to the developers:  Write R function  Test it  Develop GUI  Cost to the User:  None.  Software Licenses, Hardware and Other Equipment  None.  All the processing on-the-fly and on university owned hardware  Impacts:  Anyone interested in getting aggregated climate shocks for aggregated analysis can do so.
  • 31. A peak inside the Hub: A Linux Workspace If it can be run on Linux, it can be converted on a web application
  • 32. Graphical User Interfaces  HUBzero supports its own language, XML based, called RAPPTURE toolkit  But also supports Python, Java, etc.  User’s knowledge and experience set boundaries.
  • 33.
  • 34. GEOSHARE Example 3: A workflow for assessing the value of improved data
  • 35. Data Workflows: From data to analysis  Gridded source data on land cover & use, water, poverty and environment - flows into -  Data reconciliation ‘models’, e.g. SPAM which produce usable data - for use in -  Biophysical and economic models (e.g., DSSAT, IMPACT, GTAP) Sensitivity of results at final stage determines value of improving quality of source data Source Data Land Cover Data Reconciliation Process Water Poverty Land Use Environment DSSAT IMPACT
  • 37. GEOSHARE is free for all users and open to all  Yes, users pay nothing.  What about developers?  This is shared space  The cost of the Hub is around 50,000 per/year  Working on GEOSHARE governance cost-sharing model for self sustainability  But opportunities for economies of scale are huge. Space is always available!
  • 38. Recent Funding For Expanding HUBzero’s Spatial Capabilities  A National Science Foundation grant  Data Infrastructure Building Blocks (DIBBs) program  GABBs: 1 of 4 implementation awards in 2013  $4.5M over 4 years  Started October 1, 2013  Collaboration with other awards
  • 39. Sister geospatial hub projects  Efforts in developing integrated geospatial data/modeling capabilities using HUBzero  Drinet hub (http://drinet.hubzero.org)  Geoshare hub (http://geoshareproject.org)  Water hub (http://water-hub.org)  Useful to Useable (u2u) http://drinet.hubzero.org/groups/u2u
  • 40. Overarching goals  Enabling geospatial modeling and analysis online  Anyone can create an online geospatial app and share  Further support for geospatial data  Building blocks can be used by other projects Building for self service (DIY) – Leverage successful software – Develop building blocks
  • 41.  Questions  Feedback?  Thanks!  nvillori@purdue.edu