SlideShare a Scribd company logo
1 of 30
The NIH as a Digital Enterprise:
Implications for PAG
Philip E. Bourne, PhD
Associate Director for Data Science
National Institutes of Health
PAG San Diego
January 11, 2015
What do we mean by the notion of a
Digital Enterprise?
Start by considering how far we have
come in just one researcher’s
career….
Biomedical Research is Becoming
More Digital and FAIR
 Finding
 Accessing
 Integrating
 Reusing
digital research objects
This move from an observational
science to a more analytical science
is being driven by ever increasing
amounts of digital data
The NIH Fire Hose Slide
And This May Just be the Beginning
 Evidence:
– Google car
– 3D printers
– Waze
– Robotics
From: The Second Machine Age: Work, Progress,
and Prosperity in a Time of Brilliant Technologies
by Erik Brynjolfsson & Andrew McAfee
Further Perturbation:
The Story of Meredith
http://fora.tv/2012/04/20/Congress_Unplugged_
Phil_Bourne
Stephen Friend
47/53 “landmark” publications
could not be replicated
[Begley, Ellis Nature,
483, 2012] [Carole Goble]
ADDS Mission
Statement
To foster an open ecosystem that
enables biomedical* research to be
conducted as a digital enterprise that
enhances health, lengthens life and
reduces illness and disability
* Includes biological, biomedical, behavioral, social,
environmental, and clinical studies that relate to understanding
health and disease.
Some Goals of the Digital Enterprise
 Cost savings through sharing of best
practices
 Sustainability of digital assets
 Collaboration through identification of
collaborators at the point of data collection
not publication
 Improved reproducibility through data and
methods sharing
 Integration of data types and data and
literature to accelerate discovery
Some of Today’s Observations
 Bad News
– We do not yet have a
data sustainability plan
– Global policies define the
why but not the how
– We do not know how all
the data we currently
have are used
– We can’t estimate future
supply and demand
– We need to ramp up
training programs in data
science
 Good news
– Genuine willingness to
address the problem
– Global communities are
emerging
– Efficiencies can be
achieved
– BD2K is the beginnings
of a plan
– We are beginning to
quantify the issues
Sustainability 101
Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
What is the NIH Doing to Fulfill
That Promise?
Elements of The Digital Enterprise
Community
Policy
Infrastructure
• Sustainability
• Collaboration
• Training
Elements of The Digital Enterprise
Community
Policy
Infrastructure
• Sustainability
Collaboration
• Training
Virtuous
Research
Cycle
Policies – Now & Forthcoming
 Data Sharing
– Genomic data sharing announced
– Data sharing plans on all research awards
– Data sharing plan enforcement
• Machine readable plan
• Repository requirements to include grant numbers
http://www.nih.gov/news/health/aug2014/od-27.htm
Policies - Forthcoming
 Data Citation
– Goal: legitimize data as a form of scholarship
– Process:
• Machine readable standard for data citation (done)
• Endorsement of data citation for inclusion in NIH bib
sketch, grants, reports, etc.
• Example formats for human readable data citations
• Slowly work into NLM/NCBI workflow
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center
DDICC
Software
Standard
s
Infrastructure - The
Commons
Labs
Labs
Labs
Labs
The Commons
Digital Objects
(with UIDs)
Search
(indexed metadata)
Computing
Platform
TheCommons
Vivien Bonazzi
George Komatsoulis
The Commons: Compute Platforms
The Commons
Conceptual Framework
Public Cloud
Platforms
Super Computing
(HPC) Platforms
Other
Platforms ?
 Google, AWS (Amazon)
 Microsoft (Azure), IBM,
other?
 In house compute
solutions
 Private clouds, HPC
– Pharma
– The Broad
– Bionimbus
 Traditionally low access
by NIH
The Commons:
Business Model
[George Komatsoulis]
How Might PAG’s Participate?
 Consider contributing digital research objects into the
Commons – data, software, standards, narrative,
course materials …
 Initiate your own moves from cylinders of excellence
to more integrated and multi-functional data sources
 Work to define new business models for the scientific
enterprise
Accelerate This Kind of Study
Pfenning et al 2014 Science 346 1333
Generic Needs
 Homogenization of disparate large unstructured
datasets
 Deriving structure from unstructured data
 Feature mapping and comparison from image data
 Visualization and analysis of multi-dimensional
phenotypic datasets
 Causal modeling of large scale dynamic networks
and subsequent discovery
 Utilize data that are sparsely and irregularly sampled
and noisy
BD2K will offer reference datasets and points of
domain expertise to explore these questions
1) Build an OPEN digital framework for data
science training:
NIH Data Science Workforce Development Center
1) Develop short-term training opportunities:
Courses, educational resources, etc.
1) Develop the discipline of biomedical data
science and support cross-training – OPEN
courseware
Community: Training
Data Science Training Goals
All goals have a diversity component and manate
Associate Director for Data Science
Commons BD2K Efficiency
Sustainability Education Innovation Process
• Cloud – Data &
Compute
• Search
• Security
• Reproducibility
Standards
• App Store
• Coordinate
• Hands-on
• Syllabus
• MOOCs
• Community
• Centers
• Training Grants
• Catalogs
• Standards
• Analysis
• Data
Resource
Support
• Metrics
• Best
Practices
• Evaluation
• Portfolio
Analysis
The Biomedical Research Digital Enterprise
Partnerships
Collaboration
rogrammatic Theme
Deliverable
Example Features • IC’s
• Researchers
• Federal
Agencies
• International
Partners
• Computer
Scientists
Scientific Data Council External Advisory Board
Training
NIHNIH……
Turning Discovery Into HealthTurning Discovery Into Health
philip.bourne@nih.gov
Potential Outcomes
 Mobility: improve the outcomes of surgeries in
children with cerebral palsy and gait pathology
 Wellness: markers derived from constantly monitored
eHealth/mobile health devices – apply to smoking
cessation, weight loss
 Cancer: further personalization of treatment
 Mental Health: better identify factors that resist and
promote brain disease e.g., schizophrenia, bipolar
disorder, major depression, attention deficit
hyperactivity disorder (ADHD), obsessive compulsive
disorder (OCD), autism
 Addiction: utilizing social media to track and treat
drug use and addiction

More Related Content

What's hot

Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
Edward Blurock
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
University of California Curation Center
 

What's hot (20)

RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
 
Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Data Science BD2K Update for NIH
Data Science BD2K Update for NIH
 
Poster RDAP13: Data information literacy multiple paths to a single goal
Poster RDAP13: Data information literacy multiple paths to a single goalPoster RDAP13: Data information literacy multiple paths to a single goal
Poster RDAP13: Data information literacy multiple paths to a single goal
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
 
NIH BD2K DataMed model, DATS
NIH BD2K DataMed model, DATSNIH BD2K DataMed model, DATS
NIH BD2K DataMed model, DATS
 
There is No Intelligent Life Down Here
There is No Intelligent Life Down HereThere is No Intelligent Life Down Here
There is No Intelligent Life Down Here
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIH
 
RDAP 033111
RDAP 033111RDAP 033111
RDAP 033111
 
Towards the Digital Research Enterprise
Towards the Digital Research EnterpriseTowards the Digital Research Enterprise
Towards the Digital Research Enterprise
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Highlights from NIH Data Science
Highlights from NIH Data ScienceHighlights from NIH Data Science
Highlights from NIH Data Science
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human Health
 
Open Science: Some Possible Actions by University Leaders on Behalf of Resear...
Open Science:Some Possible Actions by University Leaders on Behalf of Resear...Open Science:Some Possible Actions by University Leaders on Behalf of Resear...
Open Science: Some Possible Actions by University Leaders on Behalf of Resear...
 

Similar to The NIH as a Digital Enterprise: Implications for PAG

Similar to The NIH as a Digital Enterprise: Implications for PAG (20)

The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
 
AMIA 2014
AMIA 2014AMIA 2014
AMIA 2014
 
Yale Day of Data
Yale Day of Data Yale Day of Data
Yale Day of Data
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical Research
 
Data at the NIH
Data at the NIHData at the NIH
Data at the NIH
 
Opportunities and Challenges for International Cooperation Around Big Data
Opportunities and Challenges for International Cooperation Around Big DataOpportunities and Challenges for International Cooperation Around Big Data
Opportunities and Challenges for International Cooperation Around Big Data
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314
 
BD2K Update
BD2K UpdateBD2K Update
BD2K Update
 
Data!
Data!Data!
Data!
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
3 dvc nsf-062813
3 dvc nsf-0628133 dvc nsf-062813
3 dvc nsf-062813
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery Informatics
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Magle data curation in libraries
Magle data curation in librariesMagle data curation in libraries
Magle data curation in libraries
 

More from Philip Bourne

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data Science
 
The Most Important Ten Simple Rules
The Most Important Ten Simple RulesThe Most Important Ten Simple Rules
The Most Important Ten Simple Rules
 

Recently uploaded

QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
httgc7rh9c
 

Recently uploaded (20)

Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food Additives
 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 

The NIH as a Digital Enterprise: Implications for PAG

  • 1. The NIH as a Digital Enterprise: Implications for PAG Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health PAG San Diego January 11, 2015
  • 2. What do we mean by the notion of a Digital Enterprise?
  • 3. Start by considering how far we have come in just one researcher’s career….
  • 4.
  • 5. Biomedical Research is Becoming More Digital and FAIR  Finding  Accessing  Integrating  Reusing digital research objects
  • 6. This move from an observational science to a more analytical science is being driven by ever increasing amounts of digital data
  • 7. The NIH Fire Hose Slide
  • 8. And This May Just be the Beginning  Evidence: – Google car – 3D printers – Waze – Robotics From: The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies by Erik Brynjolfsson & Andrew McAfee
  • 9. Further Perturbation: The Story of Meredith http://fora.tv/2012/04/20/Congress_Unplugged_ Phil_Bourne Stephen Friend
  • 10. 47/53 “landmark” publications could not be replicated [Begley, Ellis Nature, 483, 2012] [Carole Goble]
  • 11. ADDS Mission Statement To foster an open ecosystem that enables biomedical* research to be conducted as a digital enterprise that enhances health, lengthens life and reduces illness and disability * Includes biological, biomedical, behavioral, social, environmental, and clinical studies that relate to understanding health and disease.
  • 12. Some Goals of the Digital Enterprise  Cost savings through sharing of best practices  Sustainability of digital assets  Collaboration through identification of collaborators at the point of data collection not publication  Improved reproducibility through data and methods sharing  Integration of data types and data and literature to accelerate discovery
  • 13. Some of Today’s Observations  Bad News – We do not yet have a data sustainability plan – Global policies define the why but not the how – We do not know how all the data we currently have are used – We can’t estimate future supply and demand – We need to ramp up training programs in data science  Good news – Genuine willingness to address the problem – Global communities are emerging – Efficiencies can be achieved – BD2K is the beginnings of a plan – We are beginning to quantify the issues
  • 14. Sustainability 101 Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
  • 15. What is the NIH Doing to Fulfill That Promise?
  • 16. Elements of The Digital Enterprise Community Policy Infrastructure • Sustainability • Collaboration • Training
  • 17. Elements of The Digital Enterprise Community Policy Infrastructure • Sustainability Collaboration • Training Virtuous Research Cycle
  • 18. Policies – Now & Forthcoming  Data Sharing – Genomic data sharing announced – Data sharing plans on all research awards – Data sharing plan enforcement • Machine readable plan • Repository requirements to include grant numbers http://www.nih.gov/news/health/aug2014/od-27.htm
  • 19. Policies - Forthcoming  Data Citation – Goal: legitimize data as a form of scholarship – Process: • Machine readable standard for data citation (done) • Endorsement of data citation for inclusion in NIH bib sketch, grants, reports, etc. • Example formats for human readable data citations • Slowly work into NLM/NCBI workflow
  • 21. The Commons Digital Objects (with UIDs) Search (indexed metadata) Computing Platform TheCommons Vivien Bonazzi George Komatsoulis
  • 22. The Commons: Compute Platforms The Commons Conceptual Framework Public Cloud Platforms Super Computing (HPC) Platforms Other Platforms ?  Google, AWS (Amazon)  Microsoft (Azure), IBM, other?  In house compute solutions  Private clouds, HPC – Pharma – The Broad – Bionimbus  Traditionally low access by NIH
  • 24. How Might PAG’s Participate?  Consider contributing digital research objects into the Commons – data, software, standards, narrative, course materials …  Initiate your own moves from cylinders of excellence to more integrated and multi-functional data sources  Work to define new business models for the scientific enterprise
  • 25. Accelerate This Kind of Study Pfenning et al 2014 Science 346 1333
  • 26. Generic Needs  Homogenization of disparate large unstructured datasets  Deriving structure from unstructured data  Feature mapping and comparison from image data  Visualization and analysis of multi-dimensional phenotypic datasets  Causal modeling of large scale dynamic networks and subsequent discovery  Utilize data that are sparsely and irregularly sampled and noisy BD2K will offer reference datasets and points of domain expertise to explore these questions
  • 27. 1) Build an OPEN digital framework for data science training: NIH Data Science Workforce Development Center 1) Develop short-term training opportunities: Courses, educational resources, etc. 1) Develop the discipline of biomedical data science and support cross-training – OPEN courseware Community: Training Data Science Training Goals All goals have a diversity component and manate
  • 28. Associate Director for Data Science Commons BD2K Efficiency Sustainability Education Innovation Process • Cloud – Data & Compute • Search • Security • Reproducibility Standards • App Store • Coordinate • Hands-on • Syllabus • MOOCs • Community • Centers • Training Grants • Catalogs • Standards • Analysis • Data Resource Support • Metrics • Best Practices • Evaluation • Portfolio Analysis The Biomedical Research Digital Enterprise Partnerships Collaboration rogrammatic Theme Deliverable Example Features • IC’s • Researchers • Federal Agencies • International Partners • Computer Scientists Scientific Data Council External Advisory Board Training
  • 29. NIHNIH…… Turning Discovery Into HealthTurning Discovery Into Health philip.bourne@nih.gov
  • 30. Potential Outcomes  Mobility: improve the outcomes of surgeries in children with cerebral palsy and gait pathology  Wellness: markers derived from constantly monitored eHealth/mobile health devices – apply to smoking cessation, weight loss  Cancer: further personalization of treatment  Mental Health: better identify factors that resist and promote brain disease e.g., schizophrenia, bipolar disorder, major depression, attention deficit hyperactivity disorder (ADHD), obsessive compulsive disorder (OCD), autism  Addiction: utilizing social media to track and treat drug use and addiction

Editor's Notes

  1. Ioannidis JPA (2005) Why Most Published Research Findings Are False. PLoS Med 2(8): e124. doi:10.1371/journal.pmed.0020124 http://www.reuters.com/article/2012/03/28/us-science-cancer-idUSBRE82R12P20120328