SlideShare a Scribd company logo
CYVERSE: TRANSFORMING LIFE
SCIENCE RESEARCH VIA
CYBERINFRASTRUCTURE
Matthew Vaughn @mattdotvaughn
Director, Life Sciences Computing, TACC
Co-PI Cyverse, Araport, Jetstream Cloud
9/8/2016 1
OVERVIEW
9/8/2016 2
• WHAT IS CYVERSE?
• HOW IS IT TRANSFORMATIONAL FOR LIFE
SCIENCES RESEARCH?
• HOW DOES IT FIT INTO THE BIGGER SCHEME?
• WHAT DIRECTIONS AND CHALLENGES ARE IN ITS
FUTURE?
CYVERSE IS A CYBERINFRASTRUCTURE
9/8/2016 3
Vision: Transforming science through data-driven
discovery
Mission: To design, develop, deploy, and expand a
national cyberinfrastructure for life
science research, and to train scientists in
its use
SUPPORTED BY THE NSF BIO DIRECTORATE
9/8/2016 4
• Division of Biological
Infrastructure
• $100 Million, 10-year investment
• CyVerse resources are
– Freely available to the
community
– Intended to spur national and
international collaboration for
research and education
iPlant 2008
Empowering a
New Plant Biology
iPlant 2013
Cyberinfrastructure for
Life Science
CyVerse 2016
Transforming
Science Through
Data-Driven
Discovery
DBI-0735191
DBI-1265383
9/8/2016 5
DISRUPTIVE MEASUREMENT TECHNOLOGIES
NOT JUST ONE DATA TSUNAMI BUT THOUSANDS OF THEM
9/8/2016 6
EXPLOSION IN SOFTWARE AND SYSTEMS COMPLEXITY
9/8/2016 8
INCREASED ADOPTION OF COMPUTATIONAL METHODS
RESEARCH TEAMS NEED THIS
 Store, organize, share primary data
 Do basic analysis
 Store, organize, share data products
 Generate and explore hypotheses
 Share analysis code with the scientific public
 Integrate results from new experiments
 Publish data alongside plots, visualizations and
analytical tools
9/8/2016 9
BUT END UP DEALING WITH THIS
 Data lifecycle management
 Fine-grained permission management
 Discoverability
 Version control
 Taming promising new analysis codes (usually
based immature technology)
 Paying for storage, cycles, and consulting
 Making their science reproducible
9/8/2016 10
THE CYVERSE APPROACH
9/8/2016 11
CYVERSE PRODUCT MATRIX
9/8/2016 12
Atmosphere
User-provisioned, highly configurable cloud computing environment tailored for
sciences
Discovery
Environment
Web-accessible analysis workbench and gateway to national HPC infrastructure
(XSEDE)
Bisque Software for managing, analyzing and visualizing high throughput imaging data
Data Store
Scalable data storage for managing and sharing data across CyVerse’s CI and
external data resources
Science APIs
Automation interfaces to connect data and computation for rapid integration
external resources. Also used as a graduate teaching platform.
DNA Subway Classroom-friendly bioinformatics teaching platform
Powered by CyVerse Third-party applications built on CyVerse’s foundational services and
Welch et al. 2013
Bioinformatics
Specialist
Computing
Professional
Bench Scientist
EMPOWER USERS AT ALL LEVELS
Help them avoid
data and
operations siloes
9/8/2016 14
Science
applications
Domain-specific
services
Established
software and CI
Physical resources
Federated
Storage
National CI Virtualization
Job
Scheduling
Single
Sign-on
EaseofUse
EaseofRe-use
IMPACTS
9/8/2016 15
• 500+ publications
• >2PB user data stored
• 40+k registered users
• Millions of compute
hours annually
• Hundreds of trainees
CYVERSE IS A HUB
IN A RICH &
COLLABORATIVE
ECOSYSTEM
9/8/2016 16
• Using
• Collaborating
• Contributing
• Supporting
• Inventing
CURRENT INITIATIVES
9/8/2016 17
Enabling Data-Driven Discovery. Providing Advanced Training to Researchers. Removing
Barriers to Reproducible Science.
Cyverse Data Commons
Portable Science Lab
Intensive Engagement
CYVERSE DATA COMMONS
9/8/2016 18
Make research data discoverable and reusable. Ensure it ends up stored in its natural repository.
Cyverse Data
Store
Staging Area
Data Commons
Portal
Natural
Repositories
Publish in place
simply by sharing
Curate, format,
describe metadata
Published
snapshot with
DOI and open
access
Facilitated deposit
to NCBI-SRA,
Genbank, and
more
PORTABLE SCIENCE LAB
9/8/2016 19
Continue adoption of technologies to describe, encapsulate, and share research code and
data.
Virtual machines, Linux containers, Web Service APIs,
Workflow Standards
Integrated via Interactive, Narrative
Notebooks
INTENSIVE ENGAGEMENT
9/8/2016 20
Extended
Collaborative
Support
Consultation and
Support Forums
Hands-on
Training and
Tutorials
Enhanced
Support Tooling
Empower
Researchers to
Embrace and
Extend Cyverse
SUMMARY
9/8/2016 21
• CyVerse is a reference model for cyberinfrastructure that is already
being extended to other disciplines
• CyVerse provides a vertically integrated, scalable data-to-discovery
cyberinfrastructure that leverages existing federal and state
investments to transform life science research
• Cyverse is driving technological and operational innovation via a
web of interactions and collaborations with other projects,
platforms, and infrastructures.
KEY CHALLENGE - CYVERSE VALUE PROPOSITION
9/8/2016 22
“Are you still going to be around in 3 years?”
”Why did my analysis fail? Don’t you have big computers?”
“Shouldn’t we just go to Amazon Web Services?”
“I don’t want my students spending time learning computing.”
“Why aren’t you working on X?”
DISCUSSION
9/8/2016 23
@mattdotvaughn www.slideshare.net/mattdotvaughn vaughn@tacc.utexas.edu

More Related Content

What's hot

Cloud Dataverse
Cloud DataverseCloud Dataverse
Cloud Dataverse
Merce Crosas
 
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft AzureAccelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Microsoft Azure for Research
 
Cyverse: Extensible Cyberinfrastructure for Life Science
Cyverse: Extensible Cyberinfrastructure for Life ScienceCyverse: Extensible Cyberinfrastructure for Life Science
Cyverse: Extensible Cyberinfrastructure for Life Science
EMBL Australia Bioinformatics Resource
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Ola Spjuth
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
National Information Standards Organization (NISO)
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
National Information Standards Organization (NISO)
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
National Information Standards Organization (NISO)
 
Virtualization for HPC at NCI
Virtualization for HPC at NCIVirtualization for HPC at NCI
Virtualization for HPC at NCI
inside-BigData.com
 
Interoperability and scalability with microservices in science
Interoperability and scalability with microservices in scienceInteroperability and scalability with microservices in science
Interoperability and scalability with microservices in science
Ola Spjuth
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
National Information Standards Organization (NISO)
 
SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science
Robert H. McDonald
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an example
Enis Afgan
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
Blue BRIDGE
 
Panel members v2_datajournals_repositories_repofringe3aug2015
Panel members v2_datajournals_repositories_repofringe3aug2015Panel members v2_datajournals_repositories_repofringe3aug2015
Panel members v2_datajournals_repositories_repofringe3aug2015
University of Edinburgh
 
From data to discovery webinar - University of Newcastle
From data to discovery webinar - University of NewcastleFrom data to discovery webinar - University of Newcastle
From data to discovery webinar - University of Newcastle
ARDC
 
ANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data ReuseANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data Reuse
Andrew Treloar
 
Provenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four TransformationsProvenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four Transformations
Andrew Treloar
 
Storage for research-data webinar - Deakin University
Storage for research-data webinar - Deakin UniversityStorage for research-data webinar - Deakin University
Storage for research-data webinar - Deakin University
ARDC
 
Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...
Andrew Treloar
 
Dataverse on the MOC
Dataverse on the MOCDataverse on the MOC
Dataverse on the MOC
Merce Crosas
 

What's hot (20)

Cloud Dataverse
Cloud DataverseCloud Dataverse
Cloud Dataverse
 
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft AzureAccelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
 
Cyverse: Extensible Cyberinfrastructure for Life Science
Cyverse: Extensible Cyberinfrastructure for Life ScienceCyverse: Extensible Cyberinfrastructure for Life Science
Cyverse: Extensible Cyberinfrastructure for Life Science
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Virtualization for HPC at NCI
Virtualization for HPC at NCIVirtualization for HPC at NCI
Virtualization for HPC at NCI
 
Interoperability and scalability with microservices in science
Interoperability and scalability with microservices in scienceInteroperability and scalability with microservices in science
Interoperability and scalability with microservices in science
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an example
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
 
Panel members v2_datajournals_repositories_repofringe3aug2015
Panel members v2_datajournals_repositories_repofringe3aug2015Panel members v2_datajournals_repositories_repofringe3aug2015
Panel members v2_datajournals_repositories_repofringe3aug2015
 
From data to discovery webinar - University of Newcastle
From data to discovery webinar - University of NewcastleFrom data to discovery webinar - University of Newcastle
From data to discovery webinar - University of Newcastle
 
ANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data ReuseANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data Reuse
 
Provenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four TransformationsProvenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four Transformations
 
Storage for research-data webinar - Deakin University
Storage for research-data webinar - Deakin UniversityStorage for research-data webinar - Deakin University
Storage for research-data webinar - Deakin University
 
Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...
 
Dataverse on the MOC
Dataverse on the MOCDataverse on the MOC
Dataverse on the MOC
 

Similar to CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE

EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
Philippa Griffin
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
OpenStack
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simons
ARDC
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
Nancy Wilkins-Diehr
 
Globus "Down Under"
Globus "Down Under"Globus "Down Under"
Globus "Down Under"
Globus
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Dan Taylor
 
VREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative researchVREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative research
Christopher Brown
 
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
African Open Science Platform
 
Ucsd research-it-09-11-18
Ucsd research-it-09-11-18Ucsd research-it-09-11-18
Ucsd research-it-09-11-18
Nancy Wilkins-Diehr
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
Dag Endresen
 
iplant-highlights-pag2015
iplant-highlights-pag2015iplant-highlights-pag2015
iplant-highlights-pag2015
Matthew Vaughn
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Peter Löwe
 
Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014
Jisc
 
12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”
DuraSpace
 
ICT Infrastructure in Support of Data Sharing
ICT Infrastructure in Support of Data SharingICT Infrastructure in Support of Data Sharing
ICT Infrastructure in Support of Data Sharing
Academy of Science of South Africa (ASSAf)
 
Perspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanPerspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan Veldsman
African Open Science Platform
 
The View from South Africa/Colin Wright, David Walwyn
The View from South Africa/Colin Wright, David WalwynThe View from South Africa/Colin Wright, David Walwyn
The View from South Africa/Colin Wright, David Walwyn
Academy of Science of South Africa (ASSAf)
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
Perspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanPerspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan Veldsman
African Open Science Platform
 
Emerging researchers slideshow jv r -7-fonts
Emerging researchers slideshow   jv r -7-fontsEmerging researchers slideshow   jv r -7-fonts
Emerging researchers slideshow jv r -7-fonts
eResearchatUCT
 

Similar to CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE (20)

EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simons
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
Globus "Down Under"
Globus "Down Under"Globus "Down Under"
Globus "Down Under"
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2
 
VREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative researchVREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative research
 
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
 
Ucsd research-it-09-11-18
Ucsd research-it-09-11-18Ucsd research-it-09-11-18
Ucsd research-it-09-11-18
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
iplant-highlights-pag2015
iplant-highlights-pag2015iplant-highlights-pag2015
iplant-highlights-pag2015
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014
 
12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”
 
ICT Infrastructure in Support of Data Sharing
ICT Infrastructure in Support of Data SharingICT Infrastructure in Support of Data Sharing
ICT Infrastructure in Support of Data Sharing
 
Perspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanPerspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan Veldsman
 
The View from South Africa/Colin Wright, David Walwyn
The View from South Africa/Colin Wright, David WalwynThe View from South Africa/Colin Wright, David Walwyn
The View from South Africa/Colin Wright, David Walwyn
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Perspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan VeldsmanPerspectives from the African Open Science Platform/Susan Veldsman
Perspectives from the African Open Science Platform/Susan Veldsman
 
Emerging researchers slideshow jv r -7-fonts
Emerging researchers slideshow   jv r -7-fontsEmerging researchers slideshow   jv r -7-fonts
Emerging researchers slideshow jv r -7-fonts
 

More from Matthew Vaughn

On-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and EducationOn-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and Education
Matthew Vaughn
 
Towards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resourcesTowards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resources
Matthew Vaughn
 
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computingClouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
Matthew Vaughn
 
Packaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reusePackaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reuse
Matthew Vaughn
 
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National CyberinfrastructureJetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Matthew Vaughn
 
Scaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesScaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data Challenges
Matthew Vaughn
 
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
Arabidopsis Information Portal: A Community-Extensible Platform for Open DataArabidopsis Information Portal: A Community-Extensible Platform for Open Data
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
Matthew Vaughn
 
Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through Araport
Matthew Vaughn
 
Dinosaur bioinformatics
Dinosaur bioinformaticsDinosaur bioinformatics
Dinosaur bioinformatics
Matthew Vaughn
 
aip-developer-intro_pag2015
aip-developer-intro_pag2015aip-developer-intro_pag2015
aip-developer-intro_pag2015
Matthew Vaughn
 
aip-workshop1-dev-tutorial
aip-workshop1-dev-tutorialaip-workshop1-dev-tutorial
aip-workshop1-dev-tutorial
Matthew Vaughn
 
aip_developer_overview_icar_2014
aip_developer_overview_icar_2014aip_developer_overview_icar_2014
aip_developer_overview_icar_2014
Matthew Vaughn
 
Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014
Matthew Vaughn
 

More from Matthew Vaughn (13)

On-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and EducationOn-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and Education
 
Towards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resourcesTowards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resources
 
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computingClouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
 
Packaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reusePackaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reuse
 
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National CyberinfrastructureJetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
 
Scaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesScaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data Challenges
 
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
Arabidopsis Information Portal: A Community-Extensible Platform for Open DataArabidopsis Information Portal: A Community-Extensible Platform for Open Data
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
 
Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through Araport
 
Dinosaur bioinformatics
Dinosaur bioinformaticsDinosaur bioinformatics
Dinosaur bioinformatics
 
aip-developer-intro_pag2015
aip-developer-intro_pag2015aip-developer-intro_pag2015
aip-developer-intro_pag2015
 
aip-workshop1-dev-tutorial
aip-workshop1-dev-tutorialaip-workshop1-dev-tutorial
aip-workshop1-dev-tutorial
 
aip_developer_overview_icar_2014
aip_developer_overview_icar_2014aip_developer_overview_icar_2014
aip_developer_overview_icar_2014
 
Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014
 

Recently uploaded

如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 

Recently uploaded (20)

如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 

CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE

  • 1. CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE Matthew Vaughn @mattdotvaughn Director, Life Sciences Computing, TACC Co-PI Cyverse, Araport, Jetstream Cloud 9/8/2016 1
  • 2. OVERVIEW 9/8/2016 2 • WHAT IS CYVERSE? • HOW IS IT TRANSFORMATIONAL FOR LIFE SCIENCES RESEARCH? • HOW DOES IT FIT INTO THE BIGGER SCHEME? • WHAT DIRECTIONS AND CHALLENGES ARE IN ITS FUTURE?
  • 3. CYVERSE IS A CYBERINFRASTRUCTURE 9/8/2016 3 Vision: Transforming science through data-driven discovery Mission: To design, develop, deploy, and expand a national cyberinfrastructure for life science research, and to train scientists in its use
  • 4. SUPPORTED BY THE NSF BIO DIRECTORATE 9/8/2016 4 • Division of Biological Infrastructure • $100 Million, 10-year investment • CyVerse resources are – Freely available to the community – Intended to spur national and international collaboration for research and education iPlant 2008 Empowering a New Plant Biology iPlant 2013 Cyberinfrastructure for Life Science CyVerse 2016 Transforming Science Through Data-Driven Discovery DBI-0735191 DBI-1265383
  • 6. NOT JUST ONE DATA TSUNAMI BUT THOUSANDS OF THEM 9/8/2016 6
  • 7. EXPLOSION IN SOFTWARE AND SYSTEMS COMPLEXITY
  • 8. 9/8/2016 8 INCREASED ADOPTION OF COMPUTATIONAL METHODS
  • 9. RESEARCH TEAMS NEED THIS  Store, organize, share primary data  Do basic analysis  Store, organize, share data products  Generate and explore hypotheses  Share analysis code with the scientific public  Integrate results from new experiments  Publish data alongside plots, visualizations and analytical tools 9/8/2016 9
  • 10. BUT END UP DEALING WITH THIS  Data lifecycle management  Fine-grained permission management  Discoverability  Version control  Taming promising new analysis codes (usually based immature technology)  Paying for storage, cycles, and consulting  Making their science reproducible 9/8/2016 10
  • 12. CYVERSE PRODUCT MATRIX 9/8/2016 12 Atmosphere User-provisioned, highly configurable cloud computing environment tailored for sciences Discovery Environment Web-accessible analysis workbench and gateway to national HPC infrastructure (XSEDE) Bisque Software for managing, analyzing and visualizing high throughput imaging data Data Store Scalable data storage for managing and sharing data across CyVerse’s CI and external data resources Science APIs Automation interfaces to connect data and computation for rapid integration external resources. Also used as a graduate teaching platform. DNA Subway Classroom-friendly bioinformatics teaching platform Powered by CyVerse Third-party applications built on CyVerse’s foundational services and
  • 13. Welch et al. 2013 Bioinformatics Specialist Computing Professional Bench Scientist EMPOWER USERS AT ALL LEVELS Help them avoid data and operations siloes
  • 14. 9/8/2016 14 Science applications Domain-specific services Established software and CI Physical resources Federated Storage National CI Virtualization Job Scheduling Single Sign-on EaseofUse EaseofRe-use
  • 15. IMPACTS 9/8/2016 15 • 500+ publications • >2PB user data stored • 40+k registered users • Millions of compute hours annually • Hundreds of trainees
  • 16. CYVERSE IS A HUB IN A RICH & COLLABORATIVE ECOSYSTEM 9/8/2016 16 • Using • Collaborating • Contributing • Supporting • Inventing
  • 17. CURRENT INITIATIVES 9/8/2016 17 Enabling Data-Driven Discovery. Providing Advanced Training to Researchers. Removing Barriers to Reproducible Science. Cyverse Data Commons Portable Science Lab Intensive Engagement
  • 18. CYVERSE DATA COMMONS 9/8/2016 18 Make research data discoverable and reusable. Ensure it ends up stored in its natural repository. Cyverse Data Store Staging Area Data Commons Portal Natural Repositories Publish in place simply by sharing Curate, format, describe metadata Published snapshot with DOI and open access Facilitated deposit to NCBI-SRA, Genbank, and more
  • 19. PORTABLE SCIENCE LAB 9/8/2016 19 Continue adoption of technologies to describe, encapsulate, and share research code and data. Virtual machines, Linux containers, Web Service APIs, Workflow Standards Integrated via Interactive, Narrative Notebooks
  • 20. INTENSIVE ENGAGEMENT 9/8/2016 20 Extended Collaborative Support Consultation and Support Forums Hands-on Training and Tutorials Enhanced Support Tooling Empower Researchers to Embrace and Extend Cyverse
  • 21. SUMMARY 9/8/2016 21 • CyVerse is a reference model for cyberinfrastructure that is already being extended to other disciplines • CyVerse provides a vertically integrated, scalable data-to-discovery cyberinfrastructure that leverages existing federal and state investments to transform life science research • Cyverse is driving technological and operational innovation via a web of interactions and collaborations with other projects, platforms, and infrastructures.
  • 22. KEY CHALLENGE - CYVERSE VALUE PROPOSITION 9/8/2016 22 “Are you still going to be around in 3 years?” ”Why did my analysis fail? Don’t you have big computers?” “Shouldn’t we just go to Amazon Web Services?” “I don’t want my students spending time learning computing.” “Why aren’t you working on X?”

Editor's Notes

  1. WHY?
  2. LATCHING ONTO MOORE”S LAW… MRI, PET, Multispectral imaging, Laser scanning, LIDAR, Xray Everyone’s generating TERASCALE DATA
  3. Everyone’s generating TERASCALE DATA There aren’t thousands of locations capable of computing at this scale Collaborative teams are geographically dispersed iPlant can HELP
  4. But now we need flexibility! Jetstream doesn't’t solve hardware issues but is aimed at other challenging aspects.
  5. “Create detailed spatial-temporal molecular atlas (RNA, proteins, metabolites) of the developing lung” Here are their high level requirements. Seems familiar, right?
  6. They’re inevitably bogged down in these kinds of details… all while their NEED for computing is outpacing their resources Ah-ha, you say. They should just move to the cloud!