SlideShare a Scribd company logo
1 of 26
Opportunities and Challenges for 
International Collaboration Around 
Big Data 
Philip E. Bourne, PhD 
Associate Director for Data Science 
National Institutes of Health 
philip.bourne@nih.gov 
November 12, 2014
A Bottom Up Exemplar
Top Down 
Protein 
sequence and 
functional 
annotation 
Protein 
sequence and 
functional 
annotation 
CCeelllululalarr m mooddeelsls 
Pathway and reaction 
annotation 
Pathway and reaction 
annotation 
Protein interaction 
annotation 
Protein interaction 
annotation 
Evidence-based proteomics 
annotation 
Evidence-based proteomics 
annotation 
Gene Ontology 
annotation 
Gene Ontology 
annotation 
VVaarriaiannttss A Annnnoottaattioionn ClinVar / OMIM 
MedThesaurus 
[adapted from Ioannis Xenarios
What Else Can we Do from the Top 
Down?
The NIH Data 
Science Mission 
Statement 
To foster an ecosystem that enables 
biomedical* research to be 
conducted as a digital enterprise that 
enhances health, lengthens life and 
reduces illness and disability 
* Includes biological, biomedical, behavioral, social, 
environmental, and clinical studies that relate to understanding 
health and disease.
Elements of The Ecosystem 
Community Policy 
Infrastructure 
• Sustainability 
• Collaboration 
• Training
Elements of The Ecosystem 
Community Policy 
Infrastructure 
• Sustainability 
Collaboration 
• Training 
Virtuous 
Research 
Cycle
The Virtuous Cycle 
September 3, 2014 Workshop 
http://goo.gl/fkWjhS
Policies – Now & Forthcoming 
 Data Sharing 
– Genomic data sharing announced 
– Data sharing plans on all research awards 
– Data sharing plan enforcement 
• Machine readable plan 
• Repository requirements to include grant numbers 
http://www.nih.gov/news/health/aug2014/od-27.htm
Policies - Forthcoming 
 Data Citation 
– Goal: legitimize data as a form of scholarship 
– Process: 
• Machine readable standard for data citation (done) 
• Endorsement of data citation for inclusion in NIH bib 
sketch, grants, reports, etc. 
• Example formats for human readable data citations 
• Slowly work into NLM/NCBI workflow
Infrastructure - The 
BD2K 
Center 
Commons 
BD2K 
Center 
BD2K 
Center 
BD2K 
Center 
BD2K 
Center 
BD2K 
Center 
DDICC 
Software 
Standard 
s 
Labs 
Labs 
Labs 
Labs
What is the Commons? 
 A Conceptual Framework for; 
 Sharing, finding, integrating, reusing and 
attributing digital research objects 
– “Each digital object has a UID that must allow it to 
be found, shared and attributed” – The Commons 
Document 
 The Commons is agnostic of computing platform
The Commons: 
Framework  Implementation 
Digital Objects 
(with UIDs) 
Search 
(indexed metadata) 
Computing 
Platform 
The Commons
The Commons: 
Framework  Implementation 
Digital Objects 
(with UIDs) 
Search 
(indexed metadata) 
Computing 
Platform 
The Commons
The Commons: 
Framework  Draft Implementations 
The Commons 
Conceptual Framework 
Public Cloud 
Platforms 
Super Computing 
(HPC) Platforms 
Other 
Platforms ? 
 Google, AWS (Amazon) 
 Microsoft (Azure), IBM, 
other? 
 Most easily accessed by 
NIH PIs 
 In house compute 
solutions 
 Private clouds, HPC 
– Pharma 
– The Broad 
– Bionimbus 
 Low access by NIH PIs 
 Super Computing 2014 
 ADDS coordinating 
meeting with SC centers 
 NERSC “Commons Pilot”
The Commons: 
Framework  Implementation 
Digital Objects 
(with UIDs) 
Search 
(indexed metadata) 
Computing 
Platform 
The Commons
The Commons: 
Framework  Draft Implementation 
The Commons 
Conceptual Framework 
 Digital Objects to populate and test the Commons; 
– BD2K centers, NCI Cloud pilots (Google & AWS supported) 
– Large Public Data Sets, MODs 
 Search 
– BD2K Data and Software Discovery Indices 
– Google Search functions 
 Use cases 
Public Cloud 
Platforms
The Commons: 
Framework  Draft Implementation 
The Commons 
Conceptual Framework 
 Next Steps 
– Determine which BD2K centers are most appropriate for a 
cloud Commons pilot 
– Develop a plan of action with NCI Cloud pilots 
– Working with DDIC/SW Discovery Indices (UIDs, Search) 
– Working with Google and AWS (Amazon) to determine what 
is needed computationally 
• In kind support (short term pilot) 
• Conformant clouds (long term sustainable model) 
– Developing Use cases! 
Public Cloud 
Platforms
A Business Model for 
The Commons 
The Commons: 
Framework  Draft Implementation
Community – BD2K Awards
Community: BD2K Awards 
Governance 
 November 3 Kick-off PI Meeting 
– Emphasis on working groups that span centers and begin 
the work of building the ecosystem 
• Common API development (with GA4GH) 
• Mobile 
• Metadata 
• Grand challenges 
– Emphasize sharing from day 1 
– Incentivized to work in the Commons
Community Short Term Interactions 
 NSF Workshops and Dear Colleague letter 
 Workshop with NOAA on public – private 
partnerships 
 ELIXIR Workshop 
– Standards 
– Training 
 Workshop Inspiring the Game Developer 
Community to Engage in and Enhance 
Biomedical Research, Dec 2014
Community: Training 
Data Science Training Goals 
1) Build a digital framework for data science 
training: 
NIH Data Science Workforce Development Center 
1) Develop short-tem training opportunities: 
Courses, educational resources, etc. 
1) Develop the discipline of biomedical data 
science and support cross-training 
Goals expanded from recommendations in the June 2012 DIWG and Aug 2013 
Training workshop reports.
Heads Up on What is Coming in FY15 
 Calls for using the Commons 
 Calls for a standards framework development 
 Calls for software development 
 Calls to stimulate interactions between communities 
(diversity, rotations, library) 
 Calls for high risk, high return projects 
 Your ideas here…..
NNIIHH…… 
philip.bourne@nih.gov 
TTuurrnniinngg DDiissccoovveerryy IInnttoo HHeeaalltthh

More Related Content

What's hot

Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017Vivien Bonazzi
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Vivien Bonazzi
 
Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Vivien Bonazzi
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD
 
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityA VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality Paul Courtney
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH Philip Bourne
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands Vivien Bonazzi
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314Philip Bourne
 
NIH Data Commons - Note: Presentation has animations
NIH Data Commons  - Note:  Presentation has animations NIH Data Commons  - Note:  Presentation has animations
NIH Data Commons - Note: Presentation has animations Vivien Bonazzi
 
Data Commons Garvan - 2016
Data Commons Garvan -  2016 Data Commons Garvan -  2016
Data Commons Garvan - 2016 Vivien Bonazzi
 
The NIH Data Commons - BD2K All Hands Meeting 2015
The NIH Data Commons -  BD2K All Hands Meeting 2015The NIH Data Commons -  BD2K All Hands Meeting 2015
The NIH Data Commons - BD2K All Hands Meeting 2015Vivien Bonazzi
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14SEAD
 
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoEDINA, University of Edinburgh
 

What's hot (20)

Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
Elsevier1 vc
Elsevier1 vcElsevier1 vc
Elsevier1 vc
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017Bonazzi data commons nhgri council feb 2017
Bonazzi data commons nhgri council feb 2017
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Data cite
Data citeData cite
Data cite
 
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityA VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314
 
NIH Data Commons - Note: Presentation has animations
NIH Data Commons  - Note:  Presentation has animations NIH Data Commons  - Note:  Presentation has animations
NIH Data Commons - Note: Presentation has animations
 
Data Commons Garvan - 2016
Data Commons Garvan -  2016 Data Commons Garvan -  2016
Data Commons Garvan - 2016
 
The NIH Data Commons - BD2K All Hands Meeting 2015
The NIH Data Commons -  BD2K All Hands Meeting 2015The NIH Data Commons -  BD2K All Hands Meeting 2015
The NIH Data Commons - BD2K All Hands Meeting 2015
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
 
Delivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRADelivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRA
 
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
 

Viewers also liked

The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHPhilip Bourne
 
Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...
Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...
Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...Wholeeducation
 
Big Data and Population Health: SBM 2015
Big Data and Population Health: SBM 2015Big Data and Population Health: SBM 2015
Big Data and Population Health: SBM 2015Bradford Hesse
 
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Philip Bourne
 
Big Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedBig Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedPhilip Bourne
 
Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...
Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...
Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...Thomas Weller
 
Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global HealthPhilip Bourne
 
Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...
Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...
Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...Dion Hinchcliffe
 

Viewers also liked (10)

The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
 
Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...
Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...
Designing an impact curriculum | Phil Bourne, Director, School and Academy Co...
 
AMIA 2014
AMIA 2014AMIA 2014
AMIA 2014
 
BD2K Update
BD2K UpdateBD2K Update
BD2K Update
 
Big Data and Population Health: SBM 2015
Big Data and Population Health: SBM 2015Big Data and Population Health: SBM 2015
Big Data and Population Health: SBM 2015
 
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
 
Big Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedBig Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH Headed
 
Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...
Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...
Introduction to testing with MSTest, Visual Studio, and Team Foundation Serve...
 
Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global Health
 
Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...
Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...
Vital Trends in Digital Experience and Transformation in 2016 | Dreamforce 20...
 

Similar to Opportunities and Challenges for International Cooperation Around Big Data

Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthPhilip Bourne
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGPhilip Bourne
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficePhilip Bourne
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPhilip Bourne
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery InformaticsPhilip Bourne
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityTERN Australia
 
Martin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineMartin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineFuture Perfect 2012
 
From policy to practice with DMP Online
From policy to practice with DMP OnlineFrom policy to practice with DMP Online
From policy to practice with DMP OnlineSarah Jones
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineMartin Donnelly
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical CoordinatorRafael C. Jimenez
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinarSarah Jones
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterprisePhilip Bourne
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 

Similar to Opportunities and Challenges for International Cooperation Around Big Data (20)

Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human Health
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAG
 
Yale Day of Data
Yale Day of Data Yale Day of Data
Yale Day of Data
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) Office
 
BD2K Update
BD2K Update BD2K Update
BD2K Update
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical Research
 
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content TypesIlik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery Informatics
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 
Martin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineMartin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP Online
 
From policy to practice with DMP Online
From policy to practice with DMP OnlineFrom policy to practice with DMP Online
From policy to practice with DMP Online
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP Online
 
The Commons
The CommonsThe Commons
The Commons
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical Coordinator
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital Enterprise
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 

More from Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 

Recently uploaded

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSAnaAcapella
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactisticshameyhk98
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answersdalebeck957
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsNbelano25
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111GangaMaiya1
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningMarc Dusseiller Dusjagr
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 

Recently uploaded (20)

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactistics
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answers
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 

Opportunities and Challenges for International Cooperation Around Big Data

  • 1. Opportunities and Challenges for International Collaboration Around Big Data Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health philip.bourne@nih.gov November 12, 2014
  • 2. A Bottom Up Exemplar
  • 3. Top Down Protein sequence and functional annotation Protein sequence and functional annotation CCeelllululalarr m mooddeelsls Pathway and reaction annotation Pathway and reaction annotation Protein interaction annotation Protein interaction annotation Evidence-based proteomics annotation Evidence-based proteomics annotation Gene Ontology annotation Gene Ontology annotation VVaarriaiannttss A Annnnoottaattioionn ClinVar / OMIM MedThesaurus [adapted from Ioannis Xenarios
  • 4. What Else Can we Do from the Top Down?
  • 5. The NIH Data Science Mission Statement To foster an ecosystem that enables biomedical* research to be conducted as a digital enterprise that enhances health, lengthens life and reduces illness and disability * Includes biological, biomedical, behavioral, social, environmental, and clinical studies that relate to understanding health and disease.
  • 6.
  • 7. Elements of The Ecosystem Community Policy Infrastructure • Sustainability • Collaboration • Training
  • 8. Elements of The Ecosystem Community Policy Infrastructure • Sustainability Collaboration • Training Virtuous Research Cycle
  • 9. The Virtuous Cycle September 3, 2014 Workshop http://goo.gl/fkWjhS
  • 10. Policies – Now & Forthcoming  Data Sharing – Genomic data sharing announced – Data sharing plans on all research awards – Data sharing plan enforcement • Machine readable plan • Repository requirements to include grant numbers http://www.nih.gov/news/health/aug2014/od-27.htm
  • 11. Policies - Forthcoming  Data Citation – Goal: legitimize data as a form of scholarship – Process: • Machine readable standard for data citation (done) • Endorsement of data citation for inclusion in NIH bib sketch, grants, reports, etc. • Example formats for human readable data citations • Slowly work into NLM/NCBI workflow
  • 12. Infrastructure - The BD2K Center Commons BD2K Center BD2K Center BD2K Center BD2K Center BD2K Center DDICC Software Standard s Labs Labs Labs Labs
  • 13. What is the Commons?  A Conceptual Framework for;  Sharing, finding, integrating, reusing and attributing digital research objects – “Each digital object has a UID that must allow it to be found, shared and attributed” – The Commons Document  The Commons is agnostic of computing platform
  • 14. The Commons: Framework  Implementation Digital Objects (with UIDs) Search (indexed metadata) Computing Platform The Commons
  • 15. The Commons: Framework  Implementation Digital Objects (with UIDs) Search (indexed metadata) Computing Platform The Commons
  • 16. The Commons: Framework  Draft Implementations The Commons Conceptual Framework Public Cloud Platforms Super Computing (HPC) Platforms Other Platforms ?  Google, AWS (Amazon)  Microsoft (Azure), IBM, other?  Most easily accessed by NIH PIs  In house compute solutions  Private clouds, HPC – Pharma – The Broad – Bionimbus  Low access by NIH PIs  Super Computing 2014  ADDS coordinating meeting with SC centers  NERSC “Commons Pilot”
  • 17. The Commons: Framework  Implementation Digital Objects (with UIDs) Search (indexed metadata) Computing Platform The Commons
  • 18. The Commons: Framework  Draft Implementation The Commons Conceptual Framework  Digital Objects to populate and test the Commons; – BD2K centers, NCI Cloud pilots (Google & AWS supported) – Large Public Data Sets, MODs  Search – BD2K Data and Software Discovery Indices – Google Search functions  Use cases Public Cloud Platforms
  • 19. The Commons: Framework  Draft Implementation The Commons Conceptual Framework  Next Steps – Determine which BD2K centers are most appropriate for a cloud Commons pilot – Develop a plan of action with NCI Cloud pilots – Working with DDIC/SW Discovery Indices (UIDs, Search) – Working with Google and AWS (Amazon) to determine what is needed computationally • In kind support (short term pilot) • Conformant clouds (long term sustainable model) – Developing Use cases! Public Cloud Platforms
  • 20. A Business Model for The Commons The Commons: Framework  Draft Implementation
  • 22. Community: BD2K Awards Governance  November 3 Kick-off PI Meeting – Emphasis on working groups that span centers and begin the work of building the ecosystem • Common API development (with GA4GH) • Mobile • Metadata • Grand challenges – Emphasize sharing from day 1 – Incentivized to work in the Commons
  • 23. Community Short Term Interactions  NSF Workshops and Dear Colleague letter  Workshop with NOAA on public – private partnerships  ELIXIR Workshop – Standards – Training  Workshop Inspiring the Game Developer Community to Engage in and Enhance Biomedical Research, Dec 2014
  • 24. Community: Training Data Science Training Goals 1) Build a digital framework for data science training: NIH Data Science Workforce Development Center 1) Develop short-tem training opportunities: Courses, educational resources, etc. 1) Develop the discipline of biomedical data science and support cross-training Goals expanded from recommendations in the June 2012 DIWG and Aug 2013 Training workshop reports.
  • 25. Heads Up on What is Coming in FY15  Calls for using the Commons  Calls for a standards framework development  Calls for software development  Calls to stimulate interactions between communities (diversity, rotations, library)  Calls for high risk, high return projects  Your ideas here…..
  • 26. NNIIHH…… philip.bourne@nih.gov TTuurrnniinngg DDiissccoovveerryy IInnttoo HHeeaalltthh

Editor's Notes

  1. Swiss-Prot annotation efforts are structured in such a way as to cover various community needs. We move from the basic curation of protein sequences and their individual functions – in individual records – to the representation of higher order assemblies of proteins in complexes and networks, and functional pathways. To do this we maintain curation efforts targeting reactions and pathways, GO functions, protein interactions, proteomics annotations… This means we have a reservoir of prior experience and expertise in this domain. We actively participate in development of standards and protocols for annotation in the context of numerous consortia. We have access to the ChEBI and Rhea submission tools. We can create universal, stable identifiers for new lipid species (and any other small molecules).