SlideShare a Scribd company logo
What Is It Going To Cost
And What Is In It For Me?
Philip E. Bourne PhD, FACMI
Stephenson Chair of Data Science
Director, Data Science Institute
Professor of Biomedical Engineering
peb6a@virginia.edu
https://www.slideshare.net/pebourne
1
@pebourne
Forecasting Costs and Preserving, Archiving, & Promoting Access to Biomedical Data07/12/19
My Perspective
207/12/19
Why This Title –
3
What Is It Going To Cost And What
Is In It For Me?
Because whomever is considering questions of
data management/preservation/access
these are the only questions that seem to matter
07/12/19
Consider the problem from the perspective of
stakeholders in a supply chain ….
407/12/19
Stakeholders in
Supply Chains
5
Funders
Publishers
NIH Directors
Congress
Deans
Presidents
Resource
Developers
Readers
Authors
Researchers
Faculty
Students
07/12/19
When it comes to data …
None of these supply chains is sustainable in its
current form ….
607/12/19
• Even for a resource so heavily used 5-year funding cycles are not assured
• There is little international cooperation at the funder level
• Funders have ownership issues too
• Developers are reluctant to seek private funding as they fear it will impact
their federal funding
7
Funders
Resource
Developers
07/12/19
• Only large publishers have the
means to sustain a data
ecosystem – they are large
because they are profit
making
• Lack of expertise
• Authors want to publish their next
paper not deposit high quality data
because there is little reward
• Data are only accessed a small
fraction of the time
• Data are move valuable in
aggregate
8
Publishers
Readers
Authors
07/12/19
• The distinction between data
science and data management is
not clear
• Experimental mindedness
• Need to support alternative
business models
• Need to put teeth into data
management plans
• Need to think business models
• Need to move beyond a sense of
entitlement
9
NIH Directors
Congress
Researchers
07/12/19
• May not appreciate the value of
data – think its free
• Have yet to realize how data are
critical to the future of the
institution
• Lack appropriate access even to
their own data
10
Deans
Presidents
Faculty
Students
07/12/19
Moreover, “Forecasting Costs” whether you
believe the system is sustainable or unsustainable
is very difficult.. and becoming more difficult …
Here is why …
1107/12/19
Story of the Trauma Surgeon …
• What does this story tell us?
• It’s the promise of things to come
• Data integration by new types of researchers leading to important biomedical
outcomes
• Suddenly biomedical data is only part of the story to be told
• That data must be preserved collectively if the story is to be reproduced
• There is no repository as suitable support for this story
• It’s the tip of an iceberg
1207/12/19
How Disruptive Could this Be?
(with Apologies)
1307/12/19
From a 2015 presentation to the Advisory Board to the NIH Director
Digitization
Deception
Disruption
Demonetization
Dematerialization
Democratization
Time
Volume,Velocity,Variety
Digital camera invented by
Kodak but shelved
Megapixels & quality improve slowly;
Kodak slow to react
Film market collapses;
Kodak goes bankrupt
Phones replace
cameras
Instagram,
Flickr become the
value proposition
Digital media becomes bona fide
form of communication
From a 2015 presentation to the Advisory Board to the NIH Director
Example - Photography
1407/12/19
15
Machine Learning
& Analytics
Yet Another Wake Up Call
07/12/19
Yet Another Wake Up Call
16
https://www.sciencemag.org/news/2018/12/google-s-deepmind-aces-protein-folding
https://moalquraishi.wordpress.com/2018/12/09/alphafold-casp13-what-just-happened/
Machine Learning
& Analytics
07/12/19
Further Drivers of Change (ELSI Notwithstanding)
• Training data is doubling every two
years
• Robust and reusable tools in Python
and R
• More advanced tools e.g., Deep
Artificial Neural Networks (DNNs)
• New computing power e.g., GPUs,
the cloud
• Advances coming from the private
sector NOT academia
• Successful integration into lifestyles
- patients will demand it
17
Pastur-Romay et al. 2016 doi:10.3390/ijms17081313
https://www.ebi.ac.uk/uniprot/TrEMBLstats
Contents of the Protein Data Bank07/12/19
Lets summarize with respect to our original
questions…
What is it going to cost?
As much as you are willing to spend
What is in it for me?
A significant part of the future of biomedical
research proportional to your spend
07/12/19 18
These answers are not very satisfactory to say the
least…
Let us consider possible solutions at least at the
academic institution level
07/12/19 19
One institution with an
important opportunity
07/12/19 20
We would not exist if
not for open data
We Need to Change the Institutional Culture
Surrounding Data
• We need use cases of “eat your own dog food” to show value
• We need to embrace the institutional libraries role as one beyond
data preservation to that of analyst
• We need to reward reproducible science and open science where
data plays a major role:
• Part of the faculty/staff handbook
• Part of the hiring process
• Part of the promotion process
• We need better data governance
07/12/19 21
We need the institutional infrastructure for data …
07/12/19 22
https://blog.lexicata.com/wp-content/uploads/2015/03/platform-model-
750x410.png
We need to move from pipes to platforms
We Need a Realistic Business Model
• Tuition
• Students use and reuse data and hence should pay for that quality data
• Federal Funding
• It’s a part of the solution, but not the whole solution, it will not scale
• Philanthropy
• Most philanthropists are not aware of the importance of data in what they give
money to support – Advancement offices need to be educated first
• Public Private Partnership
• Funding agencies should encourage this – it is more than SBIRs – witness capstones
07/12/19 23
We Are Not Alone
Data Science Offerings at Research Universities (n=116)
07/12/19 24
2019 N> 160
What Should be Done?
• A data deluge and opportunities lost are what happens when you are
forecasting costs
• Demand (science) far outweighs supply (data resources) – support
those resources that make the most strategic sense
• Broaden the responsibility for data to include academic institutions
and the private sector
• Develop incentives to support institutional data resources that impact the
culture
• Resource institutional/biomedical libraries
• Foster public private partnerships that support public data
07/12/19 25
My answers to the original questions…
What is it going to cost?
Less if we consider data as part of a broader
ecosystem with many stakeholders
What is in it for me?
Improved research and healthcare outcomes
07/12/19 26
Conversation Cards
• What role do you think institutions should play in support of data?
• Does the emergence of data as a science – data science – present
opportunities?
• What role should the private sector play?
07/12/19 27

More Related Content

What's hot

5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
DuraSpace
 
EC Open Access Co-ordination workshop - 4th May 2011
EC Open Access Co-ordination workshop - 4th May 2011EC Open Access Co-ordination workshop - 4th May 2011
EC Open Access Co-ordination workshop - 4th May 2011Jisc
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery Informatics
Philip Bourne
 
Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...
Academy of Science of South Africa (ASSAf)
 
Jisc visions: research
Jisc visions: researchJisc visions: research
Jisc visions: research
Jisc
 
The Information Challenge
The Information ChallengeThe Information Challenge
The Information Challenge
Opportunity Links
 
The Information Challenge
The Information ChallengeThe Information Challenge
The Information Challenge
ennui2342
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Jisc
 
Moving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT AnalysisMoving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT Analysis
Philip Bourne
 
Open science and data sharing: the DataFirst experience/Martin Wittenberg
Open science and data sharing: the DataFirst experience/Martin WittenbergOpen science and data sharing: the DataFirst experience/Martin Wittenberg
Open science and data sharing: the DataFirst experience/Martin Wittenberg
African Open Science Platform
 
Hampson "Our Open Future"
Hampson "Our Open Future"Hampson "Our Open Future"
If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?
Philip Bourne
 
School Libraries Under Threat: How to Ensure Survival?
School Libraries Under Threat:  How to Ensure Survival?School Libraries Under Threat:  How to Ensure Survival?
School Libraries Under Threat: How to Ensure Survival?Johan Koren
 
Research data spring: streamlining deposit
Research data spring: streamlining depositResearch data spring: streamlining deposit
Research data spring: streamlining deposit
Jisc RDM
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
Philip Bourne
 
New emerging assistive technologies - Jisc Digifest 2016
New emerging assistive technologies - Jisc Digifest 2016New emerging assistive technologies - Jisc Digifest 2016
New emerging assistive technologies - Jisc Digifest 2016
Jisc
 
Where is Open Going?
Where is Open Going?Where is Open Going?
Where is Open Going?
Philip Bourne
 
The African Open Science Platform/Geoffrey Boulton
The African Open Science Platform/Geoffrey BoultonThe African Open Science Platform/Geoffrey Boulton
The African Open Science Platform/Geoffrey Boulton
African Open Science Platform
 
UKSG 2018 Breakout - Setting your cites to open I4OC - Maccallum
UKSG 2018 Breakout - Setting your cites to open I4OC - MaccallumUKSG 2018 Breakout - Setting your cites to open I4OC - Maccallum
UKSG 2018 Breakout - Setting your cites to open I4OC - Maccallum
UKSG: connecting the knowledge community
 
African Open Science Platform
African Open Science PlatformAfrican Open Science Platform
African Open Science Platform
Academy of Science of South Africa (ASSAf)
 

What's hot (20)

5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
 
EC Open Access Co-ordination workshop - 4th May 2011
EC Open Access Co-ordination workshop - 4th May 2011EC Open Access Co-ordination workshop - 4th May 2011
EC Open Access Co-ordination workshop - 4th May 2011
 
Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery Informatics
 
Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...Benefits of Open Data and Policy Developments, perspectives from research ins...
Benefits of Open Data and Policy Developments, perspectives from research ins...
 
Jisc visions: research
Jisc visions: researchJisc visions: research
Jisc visions: research
 
The Information Challenge
The Information ChallengeThe Information Challenge
The Information Challenge
 
The Information Challenge
The Information ChallengeThe Information Challenge
The Information Challenge
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Moving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT AnalysisMoving Forward with Open Data Science - SWOT Analysis
Moving Forward with Open Data Science - SWOT Analysis
 
Open science and data sharing: the DataFirst experience/Martin Wittenberg
Open science and data sharing: the DataFirst experience/Martin WittenbergOpen science and data sharing: the DataFirst experience/Martin Wittenberg
Open science and data sharing: the DataFirst experience/Martin Wittenberg
 
Hampson "Our Open Future"
Hampson "Our Open Future"Hampson "Our Open Future"
Hampson "Our Open Future"
 
If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?
 
School Libraries Under Threat: How to Ensure Survival?
School Libraries Under Threat:  How to Ensure Survival?School Libraries Under Threat:  How to Ensure Survival?
School Libraries Under Threat: How to Ensure Survival?
 
Research data spring: streamlining deposit
Research data spring: streamlining depositResearch data spring: streamlining deposit
Research data spring: streamlining deposit
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
 
New emerging assistive technologies - Jisc Digifest 2016
New emerging assistive technologies - Jisc Digifest 2016New emerging assistive technologies - Jisc Digifest 2016
New emerging assistive technologies - Jisc Digifest 2016
 
Where is Open Going?
Where is Open Going?Where is Open Going?
Where is Open Going?
 
The African Open Science Platform/Geoffrey Boulton
The African Open Science Platform/Geoffrey BoultonThe African Open Science Platform/Geoffrey Boulton
The African Open Science Platform/Geoffrey Boulton
 
UKSG 2018 Breakout - Setting your cites to open I4OC - Maccallum
UKSG 2018 Breakout - Setting your cites to open I4OC - MaccallumUKSG 2018 Breakout - Setting your cites to open I4OC - Maccallum
UKSG 2018 Breakout - Setting your cites to open I4OC - Maccallum
 
African Open Science Platform
African Open Science PlatformAfrican Open Science Platform
African Open Science Platform
 

Similar to What Is It Going To Cost And What Is In It For Me?

Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?
Philip Bourne
 
Big Data and Data Science: Opportunities for Biomedical Engineering
Big Data and Data Science: Opportunities for Biomedical EngineeringBig Data and Data Science: Opportunities for Biomedical Engineering
Big Data and Data Science: Opportunities for Biomedical Engineering
Philip Bourne
 
3 dvc nsf-062813
3 dvc nsf-0628133 dvc nsf-062813
3 dvc nsf-062813
Philip Bourne
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
Philip Bourne
 
Data!
Data!Data!
It's Just Not FAIR
It's Just Not FAIRIt's Just Not FAIR
It's Just Not FAIR
Philip Bourne
 
Ps rwebinar january2019final
Ps rwebinar january2019finalPs rwebinar january2019final
Ps rwebinar january2019final
Margaret Henderson
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
Soil and Water Conservation Society
 
Yale Day of Data
Yale Day of Data Yale Day of Data
Yale Day of Data
Philip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
Philip Bourne
 
The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?
Philip Bourne
 
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWUSING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
Nellore Harilakshmi
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
Philip Bourne
 
Data at the NIH
Data at the NIHData at the NIH
Data at the NIH
Philip Bourne
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
Spencer Keralis
 
Big Data and Analytics Across the Interdisciplinary Divide
Big Data and Analytics Across the Interdisciplinary DivideBig Data and Analytics Across the Interdisciplinary Divide
Big Data and Analytics Across the Interdisciplinary Divide
Philip Bourne
 
The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
Philip Bourne
 
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
National Information Standards Organization (NISO)
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...University of California Curation Center
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
Megan O'Donnell
 

Similar to What Is It Going To Cost And What Is In It For Me? (20)

Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?
 
Big Data and Data Science: Opportunities for Biomedical Engineering
Big Data and Data Science: Opportunities for Biomedical EngineeringBig Data and Data Science: Opportunities for Biomedical Engineering
Big Data and Data Science: Opportunities for Biomedical Engineering
 
3 dvc nsf-062813
3 dvc nsf-0628133 dvc nsf-062813
3 dvc nsf-062813
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Data!
Data!Data!
Data!
 
It's Just Not FAIR
It's Just Not FAIRIt's Just Not FAIR
It's Just Not FAIR
 
Ps rwebinar january2019final
Ps rwebinar january2019finalPs rwebinar january2019final
Ps rwebinar january2019final
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Yale Day of Data
Yale Day of Data Yale Day of Data
Yale Day of Data
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?
 
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWUSING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data at the NIH
Data at the NIHData at the NIH
Data at the NIH
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
 
Big Data and Analytics Across the Interdisciplinary Divide
Big Data and Analytics Across the Interdisciplinary DivideBig Data and Analytics Across the Interdisciplinary Divide
Big Data and Analytics Across the Interdisciplinary Divide
 
The Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIHThe Thinking Behind Big Data at the NIH
The Thinking Behind Big Data at the NIH
 
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
 

More from Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
Philip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
Philip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
Philip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
Philip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
Philip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
Philip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
Philip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
Philip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
Philip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
Philip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
Philip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
Philip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
Philip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
Philip Bourne
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
Philip Bourne
 

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 

Recently uploaded

Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 

Recently uploaded (20)

Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 

What Is It Going To Cost And What Is In It For Me?

  • 1. What Is It Going To Cost And What Is In It For Me? Philip E. Bourne PhD, FACMI Stephenson Chair of Data Science Director, Data Science Institute Professor of Biomedical Engineering peb6a@virginia.edu https://www.slideshare.net/pebourne 1 @pebourne Forecasting Costs and Preserving, Archiving, & Promoting Access to Biomedical Data07/12/19
  • 3. Why This Title – 3 What Is It Going To Cost And What Is In It For Me? Because whomever is considering questions of data management/preservation/access these are the only questions that seem to matter 07/12/19
  • 4. Consider the problem from the perspective of stakeholders in a supply chain …. 407/12/19
  • 5. Stakeholders in Supply Chains 5 Funders Publishers NIH Directors Congress Deans Presidents Resource Developers Readers Authors Researchers Faculty Students 07/12/19
  • 6. When it comes to data … None of these supply chains is sustainable in its current form …. 607/12/19
  • 7. • Even for a resource so heavily used 5-year funding cycles are not assured • There is little international cooperation at the funder level • Funders have ownership issues too • Developers are reluctant to seek private funding as they fear it will impact their federal funding 7 Funders Resource Developers 07/12/19
  • 8. • Only large publishers have the means to sustain a data ecosystem – they are large because they are profit making • Lack of expertise • Authors want to publish their next paper not deposit high quality data because there is little reward • Data are only accessed a small fraction of the time • Data are move valuable in aggregate 8 Publishers Readers Authors 07/12/19
  • 9. • The distinction between data science and data management is not clear • Experimental mindedness • Need to support alternative business models • Need to put teeth into data management plans • Need to think business models • Need to move beyond a sense of entitlement 9 NIH Directors Congress Researchers 07/12/19
  • 10. • May not appreciate the value of data – think its free • Have yet to realize how data are critical to the future of the institution • Lack appropriate access even to their own data 10 Deans Presidents Faculty Students 07/12/19
  • 11. Moreover, “Forecasting Costs” whether you believe the system is sustainable or unsustainable is very difficult.. and becoming more difficult … Here is why … 1107/12/19
  • 12. Story of the Trauma Surgeon … • What does this story tell us? • It’s the promise of things to come • Data integration by new types of researchers leading to important biomedical outcomes • Suddenly biomedical data is only part of the story to be told • That data must be preserved collectively if the story is to be reproduced • There is no repository as suitable support for this story • It’s the tip of an iceberg 1207/12/19
  • 13. How Disruptive Could this Be? (with Apologies) 1307/12/19 From a 2015 presentation to the Advisory Board to the NIH Director
  • 14. Digitization Deception Disruption Demonetization Dematerialization Democratization Time Volume,Velocity,Variety Digital camera invented by Kodak but shelved Megapixels & quality improve slowly; Kodak slow to react Film market collapses; Kodak goes bankrupt Phones replace cameras Instagram, Flickr become the value proposition Digital media becomes bona fide form of communication From a 2015 presentation to the Advisory Board to the NIH Director Example - Photography 1407/12/19
  • 15. 15 Machine Learning & Analytics Yet Another Wake Up Call 07/12/19
  • 16. Yet Another Wake Up Call 16 https://www.sciencemag.org/news/2018/12/google-s-deepmind-aces-protein-folding https://moalquraishi.wordpress.com/2018/12/09/alphafold-casp13-what-just-happened/ Machine Learning & Analytics 07/12/19
  • 17. Further Drivers of Change (ELSI Notwithstanding) • Training data is doubling every two years • Robust and reusable tools in Python and R • More advanced tools e.g., Deep Artificial Neural Networks (DNNs) • New computing power e.g., GPUs, the cloud • Advances coming from the private sector NOT academia • Successful integration into lifestyles - patients will demand it 17 Pastur-Romay et al. 2016 doi:10.3390/ijms17081313 https://www.ebi.ac.uk/uniprot/TrEMBLstats Contents of the Protein Data Bank07/12/19
  • 18. Lets summarize with respect to our original questions… What is it going to cost? As much as you are willing to spend What is in it for me? A significant part of the future of biomedical research proportional to your spend 07/12/19 18
  • 19. These answers are not very satisfactory to say the least… Let us consider possible solutions at least at the academic institution level 07/12/19 19
  • 20. One institution with an important opportunity 07/12/19 20 We would not exist if not for open data
  • 21. We Need to Change the Institutional Culture Surrounding Data • We need use cases of “eat your own dog food” to show value • We need to embrace the institutional libraries role as one beyond data preservation to that of analyst • We need to reward reproducible science and open science where data plays a major role: • Part of the faculty/staff handbook • Part of the hiring process • Part of the promotion process • We need better data governance 07/12/19 21
  • 22. We need the institutional infrastructure for data … 07/12/19 22 https://blog.lexicata.com/wp-content/uploads/2015/03/platform-model- 750x410.png We need to move from pipes to platforms
  • 23. We Need a Realistic Business Model • Tuition • Students use and reuse data and hence should pay for that quality data • Federal Funding • It’s a part of the solution, but not the whole solution, it will not scale • Philanthropy • Most philanthropists are not aware of the importance of data in what they give money to support – Advancement offices need to be educated first • Public Private Partnership • Funding agencies should encourage this – it is more than SBIRs – witness capstones 07/12/19 23
  • 24. We Are Not Alone Data Science Offerings at Research Universities (n=116) 07/12/19 24 2019 N> 160
  • 25. What Should be Done? • A data deluge and opportunities lost are what happens when you are forecasting costs • Demand (science) far outweighs supply (data resources) – support those resources that make the most strategic sense • Broaden the responsibility for data to include academic institutions and the private sector • Develop incentives to support institutional data resources that impact the culture • Resource institutional/biomedical libraries • Foster public private partnerships that support public data 07/12/19 25
  • 26. My answers to the original questions… What is it going to cost? Less if we consider data as part of a broader ecosystem with many stakeholders What is in it for me? Improved research and healthcare outcomes 07/12/19 26
  • 27. Conversation Cards • What role do you think institutions should play in support of data? • Does the emergence of data as a science – data science – present opportunities? • What role should the private sector play? 07/12/19 27