SlideShare a Scribd company logo
It’s Just Not FAIR
Philip E. Bourne PhD, FACMI
Stephenson Chair of Data Science
Director, Data Science Institute
Professor of Biomedical Engineering
peb6a@virginia.edu
https://www.slideshare.net/pebourne
July 23, 2019 ISMB 2019 1
@pebourne
My Role Here
• Co-developer of the RCSB PDB
• First President of FORCE11
• An author of the FAIR Principles
• Chair of Data Policy Committee at PLOS
• Acting Dean of a proposed school that
will reward open
• What follows are my own opinions & I
can’t contribute much to the technology
discussion
Printer's Device of Johannes Froben
https://commons.wikimedia.org/wiki/File:Printer%27s_Device_of_Johannes_Froben.jpg
July 23, 2019 ISMB 2019 2
Probability of
finding the data
associated with a
paper declined by
17% every year
Vines, Timothy et al. “The
Availability of Research Data
Declines Rapidly with Article Age.”
Current Biology (June 1, 2014)
Image: Nature doi:10.1038/nature.2013.14416
Just One Motivator –
Data Availability Declines Over Time
ALMOST ALL DATA LOST 10-15 YRS AFTER PUBLICATION
From Emma Ganley @ PLOS
July 23, 2019 4
In 2005 I Had a Dream
0. Paper is but one view
1. User clicks on thumbnail
2. FAIR data provide a
rendered image that
can be annotated
3. Selecting a features
provides a
database/literature
mashup
4. That leads to new
papers
1. A link brings up figures
from the paper
0. Full text of PLoS papers stored
in a database
2. Clicking the paper figure retrieves
data from the PDB which is
analyzed
3. A composite view of
journal and database
content results
4. The composite view has
links to pertinent blocks
of literature text and back to the PDB
1.
2.
3.
4.
PLoS Comp. Biol. 2005 1(3) e34
ISMB 2019
Why has the dream not been realized?
FAIR is like broccoli …
You know it is good for you,
but no one wants to eat it
By Fir0002 - Own work, GFDL 1.2, https://commons.wikimedia.org/w/index.php?curid=5772317
The incentives are not there
https://www.wideopeneats.com/recipes/broccoli-topped-cheese-sauce/July 23, 2019 ISMB 2019 5
What is the secret sauce …..
July 23, 2019 ISMB 2019 6
Need to Impact All Aspects of the Research
Lifecycle
Publishershttps://www.vertigoventures.com/lesson/embedding-impact/impact-research-life-cycle/
Funders
Academic
Institutions
Publishers
July 23, 2019 ISMB 2019 7
The role of institutions …
The secret sauce is they know they need to
change, but change is hard…
Needs to happen slowly with exemplars …
Data science initiatives represent those exemplars
July 23, 2019 ISMB 2019 8
One institution with an
important opportunity
July 23, 2019 9
We would not exist if
not for open data
ISMB 2019
We Need to Change the Institutional Culture
Surrounding Data
• We need use cases of “eat your own dog food” to show value
• We need to embrace the institutional libraries role as one beyond
data preservation to that of curator & analyst
• We need to reward reproducible science and open science where
data plays a major role:
• Part of the faculty/staff handbook
• Part of the hiring process
• Part of the promotion process
• We need better data governance
July 23, 2019 10ISMB 2019
We need the institutional infrastructure for data …
July 23, 2019 11
https://blog.lexicata.com/wp-content/uploads/2015/03/platform-model-
750x410.png
We need to move from pipes to platforms
ISMB 2019
What is it Going to Cost and What is in it for
Me?
July 23, 2019 ISMB 2019 12
We Need a Realistic Business Model
• Tuition
• Students use and reuse data and hence should pay for that quality data
• Federal Funding
• It’s a part of the solution, but not the whole solution, it will not scale
• Philanthropy
• Most philanthropists are not aware of the importance of data in what they give
money to support – Advancement offices need to be educated first
• Public Private Partnership
• Funding agencies should encourage this – it is more than SBIRs – witness capstones
July 23, 2019 13ISMB 2019
We Are Not Alone
Data Science Offerings at USA Research Universities (n=116)
July 23, 2019 14
2019 N> 160
ISMB 2019
Need to Impact All Aspects of the Research
Lifecycle
Publishershttps://www.vertigoventures.com/lesson/embedding-impact/impact-research-life-cycle/
Funders
Academic
Institutions
Publishers
July 23, 2019 ISMB 2019 15
PLOS Data Policy
• PLOS journals require authors to make all data underlying the findings
described in their manuscript fully available without restriction at the
time of publication. When specific legal or ethical requirements
prohibit public sharing of a dataset, authors must indicate how
researchers may obtain access to the data.
• When submitting a manuscript, authors must provide a Data
Availability Statement describing compliance with PLOS's policy. If the
article is accepted for publication, the data availability statement will
be published as part of the accepted article.
• Reliance on resources whose sustainability is unknown
• Who checks?
July 23, 2019 ISMB 2019 16
>100,000
papers published with a data statement at PLOS
<0.1%
of submissions rejected due to authors’
unwillingness or inability to share data
~20%
of submissions use data repositories
From Emma Ganley @ PLOS
Natural Tension & Resistance
https://commons.wikimedia.org/wiki/File:Water_surface_tension_2.jpg
Challenges
• Research areas such as clinical studies require more
complex data sharing considerations and data release
mechanisms. Community input is important for policy
implementation
• Data citations and mechanisms to provide author
credit need to see a stronger uptake
• Metadata of published data sets is often lacking,
needs community-agreed standards
• It is not always clear what constitutes compliance
From Emma Ganley @ PLOS
Related PLOS Efforts Beyond Data
• Moving towards a software sharing policy
• Working a pilot with CodeOcean to include executable code in
publications
• Notion of reproducible models
• Emphasizing benchmarking section to show the value of data
July 23, 2019 ISMB 2019 20
Summary - Why Has FAIR (Let Alone My
Dream) Not Been Realized (Optimistic View)?
• It starts with funder incentives – things are looking up
• Institutions have a role to play – the value of data is slowly
being realized
• Small publishers like PLOS have limited leverage - publishers
need to coordinate
July 23, 2019 ISMB 2019 21
Thank You!
Questions?
peb6a@virginia.edu @pebourne
July 23, 2019 ISMB 2019 22

More Related Content

What's hot

Data from experiments
Data from experimentsData from experiments
Data from experiments
Tomohiro Nagashima
 
Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...
Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...
Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...
Jonathan Pichot
 
A Sea of Information
A Sea of InformationA Sea of Information
A Sea of Information
Latia Ward
 
Copy of OSTP RFI on Big Data and Privacy
Copy of OSTP RFI on Big Data and PrivacyCopy of OSTP RFI on Big Data and Privacy
Copy of OSTP RFI on Big Data and Privacy
Micah Altman
 
Presentasjon
PresentasjonPresentasjon
Presentasjon
UNSW
 
How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...
How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...
How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...
News Leaders Association's NewsTrain
 
Promoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsPromoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analytics
Jisc
 
Krystyn J. Van Vliet Advanced Manufacturing
Krystyn J. Van Vliet Advanced Manufacturing Krystyn J. Van Vliet Advanced Manufacturing
Krystyn J. Van Vliet Advanced Manufacturing
MIT Startup Exchange
 
Innovation, KM, and Data.gov
Innovation, KM, and Data.govInnovation, KM, and Data.gov
Innovation, KM, and Data.gov
Jeanne Holm
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data Enterprise
Philip Bourne
 
Brian Anthony MIT STEX Automation Workshop June 17, 2015
Brian Anthony MIT STEX Automation Workshop June 17, 2015Brian Anthony MIT STEX Automation Workshop June 17, 2015
Brian Anthony MIT STEX Automation Workshop June 17, 2015
MIT Startup Exchange
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data
Sharjeel Imtiaz
 

What's hot (13)

Data from experiments
Data from experimentsData from experiments
Data from experiments
 
Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...
Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...
Data Standards and Linked Data: Challenges & Use Cases in Europe and the Unit...
 
A Sea of Information
A Sea of InformationA Sea of Information
A Sea of Information
 
Copy of OSTP RFI on Big Data and Privacy
Copy of OSTP RFI on Big Data and PrivacyCopy of OSTP RFI on Big Data and Privacy
Copy of OSTP RFI on Big Data and Privacy
 
Presentasjon
PresentasjonPresentasjon
Presentasjon
 
How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...
How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...
How to Learn More about Data Journalism by Ron Nixon - Philadelphia NewsTrain...
 
Promoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsPromoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analytics
 
MIT Biotech startups
MIT Biotech startupsMIT Biotech startups
MIT Biotech startups
 
Krystyn J. Van Vliet Advanced Manufacturing
Krystyn J. Van Vliet Advanced Manufacturing Krystyn J. Van Vliet Advanced Manufacturing
Krystyn J. Van Vliet Advanced Manufacturing
 
Innovation, KM, and Data.gov
Innovation, KM, and Data.govInnovation, KM, and Data.gov
Innovation, KM, and Data.gov
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data Enterprise
 
Brian Anthony MIT STEX Automation Workshop June 17, 2015
Brian Anthony MIT STEX Automation Workshop June 17, 2015Brian Anthony MIT STEX Automation Workshop June 17, 2015
Brian Anthony MIT STEX Automation Workshop June 17, 2015
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data
 

Similar to It's Just Not FAIR

What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?
Philip Bourne
 
Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?
Philip Bourne
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
Philip Bourne
 
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
Karthikeyan Umapathy
 
Digital insights impact on strategy
Digital insights impact on strategyDigital insights impact on strategy
Digital insights impact on strategy
Jisc
 
From Data Policy Towards FAIR Data For All: How standardised data policies ca...
From Data Policy Towards FAIR Data For All: How standardised data policies ca...From Data Policy Towards FAIR Data For All: How standardised data policies ca...
From Data Policy Towards FAIR Data For All: How standardised data policies ca...
Rebecca Grant
 
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Karthikeyan Umapathy
 
Research Data Alliance March 19, 2013
Research Data Alliance March 19, 2013Research Data Alliance March 19, 2013
Research Data Alliance March 19, 2013
Philip Bourne
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
Philip Bourne
 
Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global Health
Philip Bourne
 
Are Funders and Academic Institutions Approaches to Data Science Aligned
Are Funders and Academic Institutions Approaches to Data Science AlignedAre Funders and Academic Institutions Approaches to Data Science Aligned
Are Funders and Academic Institutions Approaches to Data Science Aligned
Philip Bourne
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?
Varsha Khodiyar
 
Help Build Impact Indicators May 8 ENSULIB webinar.pdf
Help Build Impact Indicators May 8 ENSULIB webinar.pdfHelp Build Impact Indicators May 8 ENSULIB webinar.pdf
Help Build Impact Indicators May 8 ENSULIB webinar.pdf
Environment, Sustainability and Libraries Section IFLA
 
If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?
Philip Bourne
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
Megan O'Donnell
 
Information use in natural habitats: a comparative study of graduates in the ...
Information use in natural habitats: a comparative study of graduates in the ...Information use in natural habitats: a comparative study of graduates in the ...
Information use in natural habitats: a comparative study of graduates in the ...
IL Group (CILIP Information Literacy Group)
 
Information Use in Natural Habitats: A Comparative Study of Graduates in the ...
Information Use in Natural Habitats: A Comparative Study of Graduates in the ...Information Use in Natural Habitats: A Comparative Study of Graduates in the ...
Information Use in Natural Habitats: A Comparative Study of Graduates in the ...
Siobhán Dunne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
Philip Bourne
 
RDM and FAIR initiatives
RDM and FAIR initiativesRDM and FAIR initiatives
RDM and FAIR initiatives
Sarah Jones
 

Similar to It's Just Not FAIR (20)

What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?What Is It Going To Cost And What Is In It For Me?
What Is It Going To Cost And What Is In It For Me?
 
Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?Data Science Meets Academia - What Comes Next?
Data Science Meets Academia - What Comes Next?
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
 
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
 
Digital insights impact on strategy
Digital insights impact on strategyDigital insights impact on strategy
Digital insights impact on strategy
 
From Data Policy Towards FAIR Data For All: How standardised data policies ca...
From Data Policy Towards FAIR Data For All: How standardised data policies ca...From Data Policy Towards FAIR Data For All: How standardised data policies ca...
From Data Policy Towards FAIR Data For All: How standardised data policies ca...
 
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
 
Research Data Alliance March 19, 2013
Research Data Alliance March 19, 2013Research Data Alliance March 19, 2013
Research Data Alliance March 19, 2013
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Towards a Platform for Global Health
Towards a Platform for Global HealthTowards a Platform for Global Health
Towards a Platform for Global Health
 
Are Funders and Academic Institutions Approaches to Data Science Aligned
Are Funders and Academic Institutions Approaches to Data Science AlignedAre Funders and Academic Institutions Approaches to Data Science Aligned
Are Funders and Academic Institutions Approaches to Data Science Aligned
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?
 
Help Build Impact Indicators May 8 ENSULIB webinar.pdf
Help Build Impact Indicators May 8 ENSULIB webinar.pdfHelp Build Impact Indicators May 8 ENSULIB webinar.pdf
Help Build Impact Indicators May 8 ENSULIB webinar.pdf
 
If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
 
Information use in natural habitats: a comparative study of graduates in the ...
Information use in natural habitats: a comparative study of graduates in the ...Information use in natural habitats: a comparative study of graduates in the ...
Information use in natural habitats: a comparative study of graduates in the ...
 
Information Use in Natural Habitats: A Comparative Study of Graduates in the ...
Information Use in Natural Habitats: A Comparative Study of Graduates in the ...Information Use in Natural Habitats: A Comparative Study of Graduates in the ...
Information Use in Natural Habitats: A Comparative Study of Graduates in the ...
 
Research Data Management: Policy Development
Research Data Management: Policy DevelopmentResearch Data Management: Policy Development
Research Data Management: Policy Development
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
RDM and FAIR initiatives
RDM and FAIR initiativesRDM and FAIR initiatives
RDM and FAIR initiatives
 

More from Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
Philip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
Philip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
Philip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
Philip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
Philip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
Philip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
Philip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
Philip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
Philip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
Philip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
Philip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
Philip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
Philip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
Philip Bourne
 

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 

Recently uploaded

1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 

Recently uploaded (20)

1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 

It's Just Not FAIR

  • 1. It’s Just Not FAIR Philip E. Bourne PhD, FACMI Stephenson Chair of Data Science Director, Data Science Institute Professor of Biomedical Engineering peb6a@virginia.edu https://www.slideshare.net/pebourne July 23, 2019 ISMB 2019 1 @pebourne
  • 2. My Role Here • Co-developer of the RCSB PDB • First President of FORCE11 • An author of the FAIR Principles • Chair of Data Policy Committee at PLOS • Acting Dean of a proposed school that will reward open • What follows are my own opinions & I can’t contribute much to the technology discussion Printer's Device of Johannes Froben https://commons.wikimedia.org/wiki/File:Printer%27s_Device_of_Johannes_Froben.jpg July 23, 2019 ISMB 2019 2
  • 3. Probability of finding the data associated with a paper declined by 17% every year Vines, Timothy et al. “The Availability of Research Data Declines Rapidly with Article Age.” Current Biology (June 1, 2014) Image: Nature doi:10.1038/nature.2013.14416 Just One Motivator – Data Availability Declines Over Time ALMOST ALL DATA LOST 10-15 YRS AFTER PUBLICATION From Emma Ganley @ PLOS
  • 4. July 23, 2019 4 In 2005 I Had a Dream 0. Paper is but one view 1. User clicks on thumbnail 2. FAIR data provide a rendered image that can be annotated 3. Selecting a features provides a database/literature mashup 4. That leads to new papers 1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results 4. The composite view has links to pertinent blocks of literature text and back to the PDB 1. 2. 3. 4. PLoS Comp. Biol. 2005 1(3) e34 ISMB 2019
  • 5. Why has the dream not been realized? FAIR is like broccoli … You know it is good for you, but no one wants to eat it By Fir0002 - Own work, GFDL 1.2, https://commons.wikimedia.org/w/index.php?curid=5772317 The incentives are not there https://www.wideopeneats.com/recipes/broccoli-topped-cheese-sauce/July 23, 2019 ISMB 2019 5
  • 6. What is the secret sauce ….. July 23, 2019 ISMB 2019 6
  • 7. Need to Impact All Aspects of the Research Lifecycle Publishershttps://www.vertigoventures.com/lesson/embedding-impact/impact-research-life-cycle/ Funders Academic Institutions Publishers July 23, 2019 ISMB 2019 7
  • 8. The role of institutions … The secret sauce is they know they need to change, but change is hard… Needs to happen slowly with exemplars … Data science initiatives represent those exemplars July 23, 2019 ISMB 2019 8
  • 9. One institution with an important opportunity July 23, 2019 9 We would not exist if not for open data ISMB 2019
  • 10. We Need to Change the Institutional Culture Surrounding Data • We need use cases of “eat your own dog food” to show value • We need to embrace the institutional libraries role as one beyond data preservation to that of curator & analyst • We need to reward reproducible science and open science where data plays a major role: • Part of the faculty/staff handbook • Part of the hiring process • Part of the promotion process • We need better data governance July 23, 2019 10ISMB 2019
  • 11. We need the institutional infrastructure for data … July 23, 2019 11 https://blog.lexicata.com/wp-content/uploads/2015/03/platform-model- 750x410.png We need to move from pipes to platforms ISMB 2019
  • 12. What is it Going to Cost and What is in it for Me? July 23, 2019 ISMB 2019 12
  • 13. We Need a Realistic Business Model • Tuition • Students use and reuse data and hence should pay for that quality data • Federal Funding • It’s a part of the solution, but not the whole solution, it will not scale • Philanthropy • Most philanthropists are not aware of the importance of data in what they give money to support – Advancement offices need to be educated first • Public Private Partnership • Funding agencies should encourage this – it is more than SBIRs – witness capstones July 23, 2019 13ISMB 2019
  • 14. We Are Not Alone Data Science Offerings at USA Research Universities (n=116) July 23, 2019 14 2019 N> 160 ISMB 2019
  • 15. Need to Impact All Aspects of the Research Lifecycle Publishershttps://www.vertigoventures.com/lesson/embedding-impact/impact-research-life-cycle/ Funders Academic Institutions Publishers July 23, 2019 ISMB 2019 15
  • 16. PLOS Data Policy • PLOS journals require authors to make all data underlying the findings described in their manuscript fully available without restriction at the time of publication. When specific legal or ethical requirements prohibit public sharing of a dataset, authors must indicate how researchers may obtain access to the data. • When submitting a manuscript, authors must provide a Data Availability Statement describing compliance with PLOS's policy. If the article is accepted for publication, the data availability statement will be published as part of the accepted article. • Reliance on resources whose sustainability is unknown • Who checks? July 23, 2019 ISMB 2019 16
  • 17. >100,000 papers published with a data statement at PLOS <0.1% of submissions rejected due to authors’ unwillingness or inability to share data ~20% of submissions use data repositories From Emma Ganley @ PLOS
  • 18. Natural Tension & Resistance https://commons.wikimedia.org/wiki/File:Water_surface_tension_2.jpg
  • 19. Challenges • Research areas such as clinical studies require more complex data sharing considerations and data release mechanisms. Community input is important for policy implementation • Data citations and mechanisms to provide author credit need to see a stronger uptake • Metadata of published data sets is often lacking, needs community-agreed standards • It is not always clear what constitutes compliance From Emma Ganley @ PLOS
  • 20. Related PLOS Efforts Beyond Data • Moving towards a software sharing policy • Working a pilot with CodeOcean to include executable code in publications • Notion of reproducible models • Emphasizing benchmarking section to show the value of data July 23, 2019 ISMB 2019 20
  • 21. Summary - Why Has FAIR (Let Alone My Dream) Not Been Realized (Optimistic View)? • It starts with funder incentives – things are looking up • Institutions have a role to play – the value of data is slowly being realized • Small publishers like PLOS have limited leverage - publishers need to coordinate July 23, 2019 ISMB 2019 21

Editor's Notes

  1. Study by Tim Vines Probability of finding the data declined by 17% every year End result – almost all data lost 10-15 yrs after publication
  2. 4
  3. There is a natural tension for scientists and researchers who are busy and already tend to work long hrs They want recognition for their work, and the current currency is published manuscripts They don’t want to do things that they perceive as unnecessary extras – providing more info, adhering to reporting guidelines, gathering all metadata, doing something sensible with data somewhere etc. https://commons.wikimedia.org/wiki/File:Water_surface_tension_2.jpg By Kaldari (Own work) [Public domain], via Wikimedia Commons
  4. 22