SlideShare a Scribd company logo
Crescat scientia; vita excolatur
A National Discovery Cloud
Ian Foster
The University of Chicago
Argonne National Laboratory
https://cra.org/ccc/wp-content/uploads/sites/2/2021/04/CCC-Whitepaper-National-Discovery-Cloud-2021.pdf
foster@uchicago.edu, @ianfoster
Tools for augmenting human intellect: 1962
“By ‘augmenting human intellect’
we mean increasing the capability
of a [person] to approach a
complex problem situation, to
gain comprehension to suit [their]
particular needs, and to derive
solutions to problems.” *
* Doug Engelbart, 1962 -- https://www.dougengelbart.org/content/view/138/
Tools for augmenting human intellect: 1962
“By ‘augmenting human intellect’
we mean increasing the capability
of a [person] to approach a
complex problem situation, to
gain comprehension to suit [their]
particular needs, and to derive
solutions to problems.” *
* Doug Engelbart, 1962 -- https://www.dougengelbart.org/content/view/138/
+ https://www.theregister.com/2008/12/11/engelbart_celebration/
"I don't get it - everything you've
shown me today I can do on my
ASR-33.” – prominent prof, as
reported by Andries Van Dam +
Numerical simulation
A scientific method on par
with, and sometimes
exceeding, experiment
Public cloud
A new computing platform
enabling new approaches to
building, delivering services
2022: Three transformative technologies
Sensors, data, ML
Powerful methods for
generating, and extracting
information from, huge data
x x
Sources: servecentric.com, visibleearth.nasa.gov, quantamagazine.org, e3sm.org
Challenge and opportunity in 2022:
Create new tools for augmenting human intellect
• Curated collections of observational, experimental,
and simulated data, plus derived ML models
• A global knowledge graph linking publications,
data, models—updated by computational agents
• Digital twins of complex systems, running on
powerful computers, plus ML surrogates
• Rich set of science services, with infrastructure to
simplify operations and incentives to sustain
See DOI: 10.1126/science.1110411, 2005, substituting “cloud” for “grid”
Challenge and opportunity in 2022:
Create new tools for augmenting human intellect
• Curated collections of observational, experimental,
and simulated data, plus derived ML models
• A global knowledge graph linking publications,
data, models—updated by computational agents
• Digital twins of complex systems, running on
powerful computers, plus ML surrogates
• Rich set of science services, with infrastructure to
simplify operations and incentives to sustain
See DOI: 10.1126/science.1110411, 2005, substituting “cloud” for “grid”
Such tools are
needed across all
of research and
education
The CS community
should be:
• leading design
• creating tools for
CS-specific needs
Operated by UChicago for researchers worldwide
Made possible by the support of 150+ subscribers globus.org
Science services
Hosted
on
Heatmap and clustering of the
occurrence of Corynebacteria
in study mgp128
doi: 10.1093/bib/bbx105
ttps://www.mg-rast.org
A National Discovery Cloud requires new capabilities
• The definition, creation, and curation of large reference datasets to fuel new
data-driven models of the natural world, economy, human physiology, healthcare
system, manufacturing processes, etc.
• A discovery cloud platform to enable the collaborative development of value-
added services that support NDC-powered scholarship and education
• New educational programs and curricula to prepare a generation for whom
programming and using NDC capabilities is second nature
• Substantial computing, storage, and network resources to host and compute
over enormous datasets, and to host and operate discovery cloud services that
enhance the value of datasets
• Innovative integrations of NDC capabilities with high-performance computers,
automated laboratories, and other elements of a 21st century discovery and
innovation ecosystem
• Privacy and security designed in from the beginning, rather than added post
facto, and with integrated assurances and audit capabilities so that the NDC
advances rather than hinders computing in the public interest
Open issues and challenges include …
• Weaving diverse capabilities just listed into a coherent whole that US
R&D enterprise can harness for discovery, innovation, and workforce
• Balancing needs for persistent resources to support R&E communities
vs. supporting innovation by those communities
• Enabling research at lower levels of the ‘stack’ (Touch’s Law: The
lowest level at which research is permitted in a testbed is also the
highest level at which it can occur)
• Privacy and security: Balancing “free and open” vs. “private and
secure” in data and services
• Building an NDC that contributes to environmental sustainability
• Appropriate balance between bespoke and private sector data centers
Summary: Let’s not underestimate public cloud
• An elastic source of computing and storage capacity – sure
• A cheap source of computing and storage capacity – maybe/not?
• A new technology to study and engineer – yes
• An immensely powerful platform for delivering scalable, reliable, and
democratizing digital services – absolutely!
• Our opportunity and challenge is a top-to-bottom rethink of what
computing means for research and education

More Related Content

What's hot

The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
Larry Smarr
 
An asynchronous and task-based implementation of peridynamics utilizing HPX—t...
An asynchronous and task-based implementation of peridynamics utilizing HPX—t...An asynchronous and task-based implementation of peridynamics utilizing HPX—t...
An asynchronous and task-based implementation of peridynamics utilizing HPX—t...
Patrick Diehl
 
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
Larry Smarr
 
Porting our astrophysics application to Arm64FX and adding Arm64FX support us...
Porting our astrophysics application to Arm64FX and adding Arm64FX support us...Porting our astrophysics application to Arm64FX and adding Arm64FX support us...
Porting our astrophysics application to Arm64FX and adding Arm64FX support us...
Patrick Diehl
 
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Larry Smarr
 
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Calit2-a Persistent UCSD/UCI Framework for CollaborationCalit2-a Persistent UCSD/UCI Framework for Collaboration
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Larry Smarr
 
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWSBioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
Lynn Langit
 
Ceoa Nov 2005 Final Small
Ceoa Nov 2005 Final SmallCeoa Nov 2005 Final Small
Ceoa Nov 2005 Final Small
Larry Smarr
 
Slide 1
Slide 1Slide 1
Slide 1butest
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research Platform
Larry Smarr
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
Ian Foster
 
OptIPuter: From the End User Lab to Global Digital Assets
OptIPuter: From the End User Lab to Global Digital AssetsOptIPuter: From the End User Lab to Global Digital Assets
OptIPuter: From the End User Lab to Global Digital Assets
Larry Smarr
 
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Larry Smarr
 
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained WorldInternet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Larry Smarr
 
High Performance Collaboration – The Jump to Light Speed
High Performance Collaboration – The Jump to Light SpeedHigh Performance Collaboration – The Jump to Light Speed
High Performance Collaboration – The Jump to Light Speed
Larry Smarr
 
Berkeley cloud computing meetup may 2020
Berkeley cloud computing meetup may 2020Berkeley cloud computing meetup may 2020
Berkeley cloud computing meetup may 2020
Larry Smarr
 
Cascade Project
Cascade ProjectCascade Project
Cascade Project
JasonCapehart
 
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
Larry Smarr
 
Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025
Larry Smarr
 
Cycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC RunCycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC Run
inside-BigData.com
 

What's hot (20)

The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
The Academic and R&D Sectors' Current and Future Broadband and Fiber Access N...
 
An asynchronous and task-based implementation of peridynamics utilizing HPX—t...
An asynchronous and task-based implementation of peridynamics utilizing HPX—t...An asynchronous and task-based implementation of peridynamics utilizing HPX—t...
An asynchronous and task-based implementation of peridynamics utilizing HPX—t...
 
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
 
Porting our astrophysics application to Arm64FX and adding Arm64FX support us...
Porting our astrophysics application to Arm64FX and adding Arm64FX support us...Porting our astrophysics application to Arm64FX and adding Arm64FX support us...
Porting our astrophysics application to Arm64FX and adding Arm64FX support us...
 
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
 
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Calit2-a Persistent UCSD/UCI Framework for CollaborationCalit2-a Persistent UCSD/UCI Framework for Collaboration
Calit2-a Persistent UCSD/UCI Framework for Collaboration
 
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWSBioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
 
Ceoa Nov 2005 Final Small
Ceoa Nov 2005 Final SmallCeoa Nov 2005 Final Small
Ceoa Nov 2005 Final Small
 
Slide 1
Slide 1Slide 1
Slide 1
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research Platform
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
OptIPuter: From the End User Lab to Global Digital Assets
OptIPuter: From the End User Lab to Global Digital AssetsOptIPuter: From the End User Lab to Global Digital Assets
OptIPuter: From the End User Lab to Global Digital Assets
 
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
 
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained WorldInternet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
 
High Performance Collaboration – The Jump to Light Speed
High Performance Collaboration – The Jump to Light SpeedHigh Performance Collaboration – The Jump to Light Speed
High Performance Collaboration – The Jump to Light Speed
 
Berkeley cloud computing meetup may 2020
Berkeley cloud computing meetup may 2020Berkeley cloud computing meetup may 2020
Berkeley cloud computing meetup may 2020
 
Cascade Project
Cascade ProjectCascade Project
Cascade Project
 
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
 
Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025
 
Cycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC RunCycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC Run
 

Similar to Foster CRA March 2022.pptx

06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
VinothkumaR Ramu
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?
Daniel S. Katz
 
Ed Fox on Learning Technologies
Ed Fox on Learning TechnologiesEd Fox on Learning Technologies
Ed Fox on Learning TechnologiesGardner Campbell
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Carole Goble
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
Anita de Waard
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
Larry Smarr
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Jisc
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
Lee Dirks
 
Datanauts: Open Innovation & Earth Observation (Proposal)
Datanauts: Open Innovation & Earth Observation (Proposal)Datanauts: Open Innovation & Earth Observation (Proposal)
Datanauts: Open Innovation & Earth Observation (Proposal)
Wolfgang Weicht
 
Big Data for the Social Sciences
Big Data for the Social SciencesBig Data for the Social Sciences
Big Data for the Social Sciences
David De Roure
 
Knowledge Architecture: Graphing Your Knowledge
Knowledge Architecture: Graphing Your KnowledgeKnowledge Architecture: Graphing Your Knowledge
Knowledge Architecture: Graphing Your Knowledge
Neo4j
 
Facilitating the Evolution of our Collective IQ
Facilitating the Evolution of our Collective IQ Facilitating the Evolution of our Collective IQ
Facilitating the Evolution of our Collective IQ
Doug Engelbart Institute
 
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
The wider environment of open scholarship – Jisc and CNI conference 10 July ...The wider environment of open scholarship – Jisc and CNI conference 10 July ...
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
Jisc
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionality
Nicholas Loulloudes
 
Visualization notes
Visualization notesVisualization notes
Visualization notes
University of South Australlia
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
LEARN Project
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - Together
Kennisalliantie
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an example
Enis Afgan
 

Similar to Foster CRA March 2022.pptx (20)

06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?
 
Ed Fox on Learning Technologies
Ed Fox on Learning TechnologiesEd Fox on Learning Technologies
Ed Fox on Learning Technologies
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Datanauts: Open Innovation & Earth Observation (Proposal)
Datanauts: Open Innovation & Earth Observation (Proposal)Datanauts: Open Innovation & Earth Observation (Proposal)
Datanauts: Open Innovation & Earth Observation (Proposal)
 
Big Data for the Social Sciences
Big Data for the Social SciencesBig Data for the Social Sciences
Big Data for the Social Sciences
 
Knowledge Architecture: Graphing Your Knowledge
Knowledge Architecture: Graphing Your KnowledgeKnowledge Architecture: Graphing Your Knowledge
Knowledge Architecture: Graphing Your Knowledge
 
Facilitating the Evolution of our Collective IQ
Facilitating the Evolution of our Collective IQ Facilitating the Evolution of our Collective IQ
Facilitating the Evolution of our Collective IQ
 
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
The wider environment of open scholarship – Jisc and CNI conference 10 July ...The wider environment of open scholarship – Jisc and CNI conference 10 July ...
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionality
 
Visualizing the Digital Humanities, Deic2012
Visualizing the Digital Humanities, Deic2012Visualizing the Digital Humanities, Deic2012
Visualizing the Digital Humanities, Deic2012
 
Visualization notes
Visualization notesVisualization notes
Visualization notes
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - Together
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an example
 

More from Ian Foster

Global Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptxGlobal Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptx
Ian Foster
 
The Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionThe Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, Evolution
Ian Foster
 
Better Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumBetter Information Faster: Programming the Continuum
Better Information Faster: Programming the Continuum
Ian Foster
 
ESnet6 and Smart Instruments
ESnet6 and Smart InstrumentsESnet6 and Smart Instruments
ESnet6 and Smart Instruments
Ian Foster
 
Linking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationLinking Scientific Instruments and Computation
Linking Scientific Instruments and Computation
Ian Foster
 
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryA Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
Ian Foster
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
Ian Foster
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
Ian Foster
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationData Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud Automation
Ian Foster
 
Research Automation for Data-Driven Discovery
Research Automation for Data-Driven DiscoveryResearch Automation for Data-Driven Discovery
Research Automation for Data-Driven Discovery
Ian Foster
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and Jupyter
Ian Foster
 
Data Automation at Light Sources
Data Automation at Light SourcesData Automation at Light Sources
Data Automation at Light Sources
Ian Foster
 
Team Argon Summary
Team Argon SummaryTeam Argon Summary
Team Argon Summary
Ian Foster
 
Thoughts on interoperability
Thoughts on interoperabilityThoughts on interoperability
Thoughts on interoperability
Ian Foster
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Ian Foster
 
NIH Data Commons Architecture Ideas
NIH Data Commons Architecture IdeasNIH Data Commons Architecture Ideas
NIH Data Commons Architecture Ideas
Ian Foster
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCF
Ian Foster
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Ian Foster
 
Software Infrastructure for a National Research Platform
Software Infrastructure for a National Research PlatformSoftware Infrastructure for a National Research Platform
Software Infrastructure for a National Research Platform
Ian Foster
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Ian Foster
 

More from Ian Foster (20)

Global Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptxGlobal Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptx
 
The Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionThe Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, Evolution
 
Better Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumBetter Information Faster: Programming the Continuum
Better Information Faster: Programming the Continuum
 
ESnet6 and Smart Instruments
ESnet6 and Smart InstrumentsESnet6 and Smart Instruments
ESnet6 and Smart Instruments
 
Linking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationLinking Scientific Instruments and Computation
Linking Scientific Instruments and Computation
 
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryA Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationData Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud Automation
 
Research Automation for Data-Driven Discovery
Research Automation for Data-Driven DiscoveryResearch Automation for Data-Driven Discovery
Research Automation for Data-Driven Discovery
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and Jupyter
 
Data Automation at Light Sources
Data Automation at Light SourcesData Automation at Light Sources
Data Automation at Light Sources
 
Team Argon Summary
Team Argon SummaryTeam Argon Summary
Team Argon Summary
 
Thoughts on interoperability
Thoughts on interoperabilityThoughts on interoperability
Thoughts on interoperability
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
NIH Data Commons Architecture Ideas
NIH Data Commons Architecture IdeasNIH Data Commons Architecture Ideas
NIH Data Commons Architecture Ideas
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCF
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Software Infrastructure for a National Research Platform
Software Infrastructure for a National Research PlatformSoftware Infrastructure for a National Research Platform
Software Infrastructure for a National Research Platform
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
 

Recently uploaded

Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 

Recently uploaded (20)

Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 

Foster CRA March 2022.pptx

  • 1. Crescat scientia; vita excolatur A National Discovery Cloud Ian Foster The University of Chicago Argonne National Laboratory https://cra.org/ccc/wp-content/uploads/sites/2/2021/04/CCC-Whitepaper-National-Discovery-Cloud-2021.pdf foster@uchicago.edu, @ianfoster
  • 2. Tools for augmenting human intellect: 1962 “By ‘augmenting human intellect’ we mean increasing the capability of a [person] to approach a complex problem situation, to gain comprehension to suit [their] particular needs, and to derive solutions to problems.” * * Doug Engelbart, 1962 -- https://www.dougengelbart.org/content/view/138/
  • 3. Tools for augmenting human intellect: 1962 “By ‘augmenting human intellect’ we mean increasing the capability of a [person] to approach a complex problem situation, to gain comprehension to suit [their] particular needs, and to derive solutions to problems.” * * Doug Engelbart, 1962 -- https://www.dougengelbart.org/content/view/138/ + https://www.theregister.com/2008/12/11/engelbart_celebration/ "I don't get it - everything you've shown me today I can do on my ASR-33.” – prominent prof, as reported by Andries Van Dam +
  • 4. Numerical simulation A scientific method on par with, and sometimes exceeding, experiment Public cloud A new computing platform enabling new approaches to building, delivering services 2022: Three transformative technologies Sensors, data, ML Powerful methods for generating, and extracting information from, huge data x x Sources: servecentric.com, visibleearth.nasa.gov, quantamagazine.org, e3sm.org
  • 5. Challenge and opportunity in 2022: Create new tools for augmenting human intellect • Curated collections of observational, experimental, and simulated data, plus derived ML models • A global knowledge graph linking publications, data, models—updated by computational agents • Digital twins of complex systems, running on powerful computers, plus ML surrogates • Rich set of science services, with infrastructure to simplify operations and incentives to sustain See DOI: 10.1126/science.1110411, 2005, substituting “cloud” for “grid”
  • 6. Challenge and opportunity in 2022: Create new tools for augmenting human intellect • Curated collections of observational, experimental, and simulated data, plus derived ML models • A global knowledge graph linking publications, data, models—updated by computational agents • Digital twins of complex systems, running on powerful computers, plus ML surrogates • Rich set of science services, with infrastructure to simplify operations and incentives to sustain See DOI: 10.1126/science.1110411, 2005, substituting “cloud” for “grid” Such tools are needed across all of research and education The CS community should be: • leading design • creating tools for CS-specific needs
  • 7. Operated by UChicago for researchers worldwide Made possible by the support of 150+ subscribers globus.org Science services Hosted on
  • 8. Heatmap and clustering of the occurrence of Corynebacteria in study mgp128 doi: 10.1093/bib/bbx105 ttps://www.mg-rast.org
  • 9. A National Discovery Cloud requires new capabilities • The definition, creation, and curation of large reference datasets to fuel new data-driven models of the natural world, economy, human physiology, healthcare system, manufacturing processes, etc. • A discovery cloud platform to enable the collaborative development of value- added services that support NDC-powered scholarship and education • New educational programs and curricula to prepare a generation for whom programming and using NDC capabilities is second nature • Substantial computing, storage, and network resources to host and compute over enormous datasets, and to host and operate discovery cloud services that enhance the value of datasets • Innovative integrations of NDC capabilities with high-performance computers, automated laboratories, and other elements of a 21st century discovery and innovation ecosystem • Privacy and security designed in from the beginning, rather than added post facto, and with integrated assurances and audit capabilities so that the NDC advances rather than hinders computing in the public interest
  • 10. Open issues and challenges include … • Weaving diverse capabilities just listed into a coherent whole that US R&D enterprise can harness for discovery, innovation, and workforce • Balancing needs for persistent resources to support R&E communities vs. supporting innovation by those communities • Enabling research at lower levels of the ‘stack’ (Touch’s Law: The lowest level at which research is permitted in a testbed is also the highest level at which it can occur) • Privacy and security: Balancing “free and open” vs. “private and secure” in data and services • Building an NDC that contributes to environmental sustainability • Appropriate balance between bespoke and private sector data centers
  • 11. Summary: Let’s not underestimate public cloud • An elastic source of computing and storage capacity – sure • A cheap source of computing and storage capacity – maybe/not? • A new technology to study and engineer – yes • An immensely powerful platform for delivering scalable, reliable, and democratizing digital services – absolutely! • Our opportunity and challenge is a top-to-bottom rethink of what computing means for research and education

Editor's Notes

  1. A National Discovery Cloud: Preparing the US for Global Competitiveness in the New Era of 21st Century Digital Transformation Ian Foster, Daniel Lopresti, Bill Gropp, Mark D. Hill, Katie Schuman The nature of computation and its role in our lives have been transformed in the past two decades by three remarkable developments: the emergence of public cloud utilities as a new computing platform; the ability to extract information from enormous quantities of data via machine learning; and the emergence of computational simulation as a research method on par with experimental science. Each development has major implications for how societies function and compete; together, they represent a change in technological foundations of society as profound as the telegraph or electrification. Societies that embrace these changes will lead in the 21st Century; those that do not, will decline in prosperity and influence. Nowhere is this stark choice more evident than in research and education, the two sectors that produce the innovations that power the future and prepare a workforce able to exploit those innovations, respectively. In this article, we introduce these developments and suggest steps that the US government might take to prepare the research and education system for its implications. Our message is: think big. We have a historic opportunity to rethink how computing is organized and applied to empower the research community
  2. Look back 60 years to another historic moment, when Doug Engelbart demonstrated a computer system designed to enhance human abilities to tackle complex problems His demonstration of what he called NLS showed how by integrating emerging technologies of the day (displays and telecommunications) and inventing new ones (mouse, multiple windows) one could provide entirely new capabilities.
  3. While we now see this demo as a defining moment in computing history, at the time 90% of the community thought he was a crackpot.
  4. Today, humanity faces yet bigger problems, but we also have access to technologies hardly imagined by Engelbart. I’d like to highlight three: 1) A new computing platform that makes it possible for anyone to create and deploy powerful digital services for use by tens or tens of millions 2) Sensors that can acquire enormous datasets, and powerful methods, including ML methods, that can extract information from those data 3) Simulation methods that can …
  5. Opportunity is not just to provide increased access to computing power, But to create new digital services that enhance human capabilities
  6. A first example of an advanced digital service. Developed with NSF and DOE support over more than a decade. Links storage resources at more than 1600 institutions. Used by 10,000s of users to manage and share large data. Two key points: -- Leverages public cloud (AWS) to run a powerful, scalable national-scale service -- Sustained by subscriptions from more than 150 institutions worldwide
  7. A second example. Discipline-specific: Used by 10,000s to process metagenomic data from environmental samples. Has transformed this field by allowing biological scientists without informatics resources or expertise to participate in metagenomics research, AND permitting large meta-metagnomics studies. Difficulties: NOT hosted on cloud; no sustainability model
  8. In the CCC white paper, we speak to the issues that I have already mentioned, and emphasize that to realize these new tools for augmenting human intellect, we need a range of new capabilities, including those listed here.
  9. New capabilities and leadership are required if the US research and education enterprise is to effectively harness this new computational fabric for discovery, innovation, and workforce. The challenge is to enable researchers, educators, students, and industrial collaborators to develop and use the value-added services that will underpin the society of tomorrow; aggregate the massive datasets required for AI-driven discoveries and innovation; and construct and run the simulation models used to understand future products and scenarios.
  10. The big opportunity is to transform science processes, much as business and consumer relationships with IT are being transformed Doing this right will require a top-to-bottom rethink of what computing means for science; how it should be delivered; how it should be funded; how contributors should be rewarded