SlideShare a Scribd company logo
1 of 38
In the Future Will a Biological
Database Really be Different than a
Biological Journal?
Philip E. Bourne PhD
pbourne@ucsd.edu

iDASH October 18, 2013

1
I am speaking to you today as
someone who..
• Maintains a major biological database – the
PDB – used by over 300,000 scientists per
month
• Is the Founding Editor in Chief of PLOS
Computational Biology

iDASH October 18, 2013

2
A Question First Posed in August 2005

PLOS Comp Biol 2005 1(3): e34

iDASH October 18, 2013

3
Here is one reason why the question is
important….

iDASH October 18, 2013

4
The Paper As Experiment
0. Full text of PLoS papers stored
in a database

4. The composite view has
links to pertinent blocks
of literature text and back to the PDB

4.

1.
1. A link brings up figures
from the paper

2.

3. A composite view of
journal and database
content results

3.

1. User clicks on thumbnail
2. Metadata and a
webservices call provide
a renderable image that
can be annotated
3. Selecting a features
provides a
database/literature
mashup
4. That leads to new
papers

PLoS Comp. Biol. 2005 1(3) e34

2. Clicking the paper figure retrieves
data from the PDB which is
analyzed

5
The answer 8 years ago, as is now is…

In principle there is no difference, but
the way in which each is perceived is
still very different…
Yet progress has been made and we
will focus on what we can do to further
accelerate change
iDASH October 18, 2013

6
Why Bother?
Better integration of data and the
knowledge derived from it can
accelerate discovery and improve the
comprehension and dissemination of
science
iDASH October 18, 2013

7
Lets take a step back ...
What got me thinking this way?

iDASH October 18, 2013

8
Data Are Becoming More Complex:
Witness The World Wide Protein Data Bank

http://www.wwpdb.org

• The single worldwide
repository for data on
the structure of
biological
macromolecules
• Vital for drug
discovery and the life
sciences
• 43 years old
• Free to all
iDASH October 18, 2013

9
The World Wide Protein Data Bank
Places High Value on Data
• Paper not published
unless data are
deposited – strong
data to literature
correspondence
• Highly structured data
conforming to an
extensive ontology
• DOI’s assigned to
every structure
http://www.wwpdb.org
iDASH October 18, 2013

10
The PLoS Corpus
• Established in 2000
• Identified as a high
quality publications
• Currently 8 journals
with healthy growth
• Open Access – free to
all
• PLOS ONE a huge
success
iDASH October 18, 2013

11
Similar Processes Lead to Similar Resources
Author Submission via the Web

Depositor Submission via the Web

Syntax Checking

Syntax Checking

Review by Scientists &
Editors

Review by Annotators

Corrections by Depositor

Corrections by Author

Release – Web Accessible

Publish – Web Accessible
iDASH October 18, 2013

12
The scientific process for handling data
and publications are not that
different, but the end product is
perceived very differently

iDASH October 18, 2013

13
Unfortunately the Metrics of
Success Remain…

[Carole Goble]

iDASH October 18, 2013

14
This makes no sense when you ask
yourself the question:
What is more valuable a dataset used
and cited by 100 scientists or a paper
you wrote that only you cite?
Case in point…

iDASH October 18, 2013

15
What can you do today to change the
situation?

iDASH October 18, 2013

16
Think Globally Act Locally
• Support emergent community commons/portals
• Be involved in the support and development of
metadata standards
• Contribute to workflow development etc. to drive
an open research lifecycle
• Educate your mentors on the importance of
open science and scholarly communication
• Write software thinking of an App model

iDASH October 18, 2013

17
Pressure Your Institutions to Play a
Greater Role
• We need institutional data/knowledge sharing
plans
• We need digital universities
• We need data/information scientists to be
better recognized by institutions – its not all
about papers – this implies new metrics
iDASH October 18, 2013

18
Committee on Academic
Promotions
• What Counts
–
–
–
–
–

Money
Grants
Papers
Teaching
Service

• What Does Not
–
–
–
–
–
–

Sharing data
Sharing software
Open access
Collaboration
Patents
Startups

Ten Simple Rules for Getting Ahead as a Computational Biologist in Academia
2011 PLOS Comp Biol 7(1) e1002001
iDASH October 18, 2013

19
We Need to Bend the Traditional System
The Wikipedia Experiment – Topic Pages

 Identify areas of Wikipedia that
relate to the journal that are
missing of stubs
 Develop a Wikipedia page in the
sandbox
 Have a Topic Page Editor Review
the page
 Publish the copy of record with
associated rewards
 Release the living version into
Wikipedia
iDASH October 18, 2013

20
We Need Innovative Contributions
to the Research Lifecycle
Authoring
Tools

Data
Capture

Lab
Notebooks

Software
Repositories

Analysis
Tools

Scholarly
Communication
Visualization

IDEAS – HYPOTHESES – EXPERIMENTS – DATA - ANALYSIS - COMPREHENSION - DISSEMINATION

Commercial &
Public Tools

DisciplineBased Metadata
Standards

Community Portals
Git-like
Resources
By Discipline

Data Journals

New Reward
Systems

Training
Institutional Repositories
iDASH October 18, 2013
Commercial Repositories

21
We Need Innovative Contributions
to the Research Lifecycle
Authoring
Tools

Data
Capture

Lab
Notebooks

Software
Repositories

Analysis
Tools

Scholarly
Communication
Visualization

IDEAS – HYPOTHESES – EXPERIMENTS – DATA - ANALYSIS - COMPREHENSION - DISSEMINATION

Commercial &
Public Tools

DisciplineBased Metadata
Standards

Community Portals
Git-like
Resources
By Discipline

Data Journals

New Reward
Systems

Training
Institutional Repositories
iDASH October 18, 2013
Commercial Repositories

22
Example Interoperability: The Database View
www.rcsb.org/pdb/explore/literature.do?structureId=1TIM

BMC Bioinformatics 2010 11:220

iDASH October 18, 2013

23
This is asking a lot of us, but our job is
being made easier by what is going on
around us

iDASH October 18, 2013

24
Open Access to Data and the
Literature is no Longer a Curiosity, but
Mainstream

iDASH October 18, 2013

25
Conservative Bodies Are Recognizing
Change
• Anyone, anything, anyt
ime
• publication
access, data, models, sour
ce
codes, resources, transpar
ent
methods, standards, forma
ts, identifiers, apis, license
s, education, policies

• “accessible, intelligible,
assessable, reusable”
[Carole Goble]

http://royalsociety.org/policy/projects/science-public-enterprise/report/
Governments Are Recognizing Change
G8 Open Data Charter

http://opensource.com/government/13/7/open-data-charter-g8
iDASH October 18, 2013

27
Funding Agencies are Changing

iDASH October 18, 2013

28
Publishing is Changing
• Today:
• Approx 10,000 publishers
• Publishing approx 25,000 journals
• Which publish approx 1.5 million articles per year
(almost 1 million of which appear in PubMed)

iDASH October 18, 2013

29
Witness the ‘Open Access Mega
Journal'
1. Very very large
– Publishing thousands of articles per year
– and benefiting from economies of scale

2. Open Access
– Because no one will pay a subscription fee for a journal that
large (and growing that fast)
– and using an OA Business Model where each article pays for its
own costs

3. (Preferably) without any ‘artificial’ constraints on
its ability to grow
– For example, a desire to only publish ‘high impact; papers
[Pete Binfield]
iDASH October 18, 2013

30
3500

3000

Publications by PLOSONE per quarter since launch
Publications by PLoS ONE per quarter
since launch

2500

2000

1500

1000

500

[Pete Binfield]

Q
1

20
Q 07
2
20
Q 07
3
20
Q 07
4
20
Q 07
1
20
Q 08
2
20
Q 08
3
20
Q 08
4
20
Q 08
1
20
Q 09
2
20
Q 09
3
20
Q 09
4
20
Q 09
1
20
Q 10
2
20
Q 10
3
20
Q 10
4
20
Q 10
1
20
Q 11
2
20
11

0
“Open Access Mega Journals”
– One Name, Two Flavours
• ‘Clones’ of PLoS ONE (not selective)
–
–
–
–
–
–

SAGE Open
BMJ Open
Scientific Reports (Nature)
AIP Advances (Am Inst Physics)
G3 (Genetics Soc of America)
Biology Open (Company of Biologists)

• ‘Pseudo-Clones’ of PLoS ONE (probably selective)
– Physical Review X (Am Physical Society)
– Open Biology (Royal Society)
– Cell Reports (Elsevier, Cell Press)
[Pete Binfield]
iDASH October 18, 2013

32
Attitudes are Changing

datasets
data collections
algorithms
configurations
tools and apps
codes
workflows
scripts
code libraries
services,
system software
infrastructure,
compilers
hardware
[Carole Goble]

“An article about computational
science in a scientific publication
is not the scholarship itself, it is
merely advertising of the
scholarship. The actual
scholarship is the complete
software development
environment, [the complete data]
and the complete set of
instructions which generated the
figures.”
David Donoho, “Wavelab and Reproducible
Research,” 1995

Morin et al Shining Light into Black Boxes
Science 13 April 2012: 336(6078) 159-160

Ince et al The case for open computer
33
programs, Nature 482, 2012
Flaws Are Becoming More
Obvious

Out of 18 microarray papers, results
from 10 could not be reproduced

More retractions:
>15X increase in last decade
At current % > by 2045 as many papers published as retracted
[Carole Goble]

1. Ioannidis et al., 2009. Repeatability of published microarray gene expression analyses. Nature Genetics 41: 14
2. Science publishing: The trouble with retractions http://www.nature.com/news/2011/111005/full/478026a.html
34
3. Bjorn Brembs: Open Access and the looming crisis in science https://theconversation.com/open-access-and-the-looming-crisis-in-science-14950
Science is Being Deinstitutionalized

Daniel Hulshizer/Associated Press

iDASH October 18, 2013

35
Science is Being Deinstitutionalized

Daniel Hulshizer/Associated Press

iDASH October 18, 2013

36
In Summary
• Question (2005): In the Future Will a Biological
Database Really be Different than a Biological
Journal?
• Answer:
–
–
–
–

Less different that they were in 2005
We still have a long way to go improve science
Change is accelerating
What one does on a daily basis as a scholar is very
different from when I was in graduate school and it
will be very different again
iDASH October 18, 2013

37
pbourne@ucsd.edu

Questions?

More Related Content

What's hot

Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey BoultonLEARN Project
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Jisc
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13DataDryad
 
The Landscape of Research Data Management
The Landscape of Research Data Management The Landscape of Research Data Management
The Landscape of Research Data Management Alastair Dunning
 
Responsible metrics for research - Jisc Digifest 2016
Responsible metrics for research - Jisc Digifest 2016Responsible metrics for research - Jisc Digifest 2016
Responsible metrics for research - Jisc Digifest 2016Jisc
 
Winning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipWinning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipAlastair Dunning
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data ThingsKatina Toufexis
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Katina Toufexis
 
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...OAbooks
 
Introduction to Research Data Management at UWA
Introduction to Research Data Management at UWAIntroduction to Research Data Management at UWA
Introduction to Research Data Management at UWAKatina Toufexis
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierMaaike Duine
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challengeScott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challengeGigaScience, BGI Hong Kong
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data networkJisc RDM
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
Research Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group SessionResearch Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group Sessionamiraryani
 
Research Information, Networking, and Collaboration - strategy and challenges
Research Information, Networking, and Collaboration - strategy and challengesResearch Information, Networking, and Collaboration - strategy and challenges
Research Information, Networking, and Collaboration - strategy and challengesLibrary_Connect
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Project
 

What's hot (19)

Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
The Landscape of Research Data Management
The Landscape of Research Data Management The Landscape of Research Data Management
The Landscape of Research Data Management
 
Responsible metrics for research - Jisc Digifest 2016
Responsible metrics for research - Jisc Digifest 2016Responsible metrics for research - Jisc Digifest 2016
Responsible metrics for research - Jisc Digifest 2016
 
Winning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipWinning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data Stewardship
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)
 
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
 
Introduction to Research Data Management at UWA
Introduction to Research Data Management at UWAIntroduction to Research Data Management at UWA
Introduction to Research Data Management at UWA
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challengeScott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data network
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Research Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group SessionResearch Data Alliance Plenary 9: DDRI Working Group Session
Research Data Alliance Plenary 9: DDRI Working Group Session
 
Research Information, Networking, and Collaboration - strategy and challenges
Research Information, Networking, and Collaboration - strategy and challengesResearch Information, Networking, and Collaboration - strategy and challenges
Research Information, Networking, and Collaboration - strategy and challenges
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
 

Similar to Is a Biological Database Really Different than a Biological Journal?

What Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDFWhat Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDFPhilip Bourne
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleepUoLResearchSupport
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...UoLResearchSupport
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep Kirsten Thompson
 
Ucsd library10182010
Ucsd library10182010Ucsd library10182010
Ucsd library10182010Philip Bourne
 
Presentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical SocietyPresentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical Societyosimod
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Susanna-Assunta Sansone
 
Open Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticeOpen Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticePhilip Bourne
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reusevoginip
 
Data are the new black : Susan Robbins
Data are the new black : Susan RobbinsData are the new black : Susan Robbins
Data are the new black : Susan Robbinstherese nolan-brown
 
Science20brussels osimo april2013
Science20brussels osimo april2013Science20brussels osimo april2013
Science20brussels osimo april2013osimod
 
UCSD Library Presentation 10182010
UCSD Library Presentation 10182010UCSD Library Presentation 10182010
UCSD Library Presentation 10182010Philip Bourne
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Maryann Martone
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional RepositoriesSridhar Gutam
 
Digital Resources for Open Science
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open ScienceMartin Donnelly
 
Birgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International PerspectiveBirgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International Perspectivedri_ireland
 
How practising open research can benefit you
How practising open research can benefit youHow practising open research can benefit you
How practising open research can benefit youUoLResearchSupport
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesMartin Donnelly
 

Similar to Is a Biological Database Really Different than a Biological Journal? (20)

The Era of Open
The Era of OpenThe Era of Open
The Era of Open
 
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDFWhat Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleep
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep
 
Ucsd library10182010
Ucsd library10182010Ucsd library10182010
Ucsd library10182010
 
Presentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical SocietyPresentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical Society
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Open Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticeOpen Science: Where Theory Meets Practice
Open Science: Where Theory Meets Practice
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Data are the new black : Susan Robbins
Data are the new black : Susan RobbinsData are the new black : Susan Robbins
Data are the new black : Susan Robbins
 
Science20brussels osimo april2013
Science20brussels osimo april2013Science20brussels osimo april2013
Science20brussels osimo april2013
 
UCSD Library Presentation 10182010
UCSD Library Presentation 10182010UCSD Library Presentation 10182010
UCSD Library Presentation 10182010
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional Repositories
 
Digital Resources for Open Science
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open Science
 
Birgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International PerspectiveBirgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International Perspective
 
How practising open research can benefit you
How practising open research can benefit youHow practising open research can benefit you
How practising open research can benefit you
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
 

More from Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 

Recently uploaded

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Recently uploaded (20)

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Is a Biological Database Really Different than a Biological Journal?

  • 1. In the Future Will a Biological Database Really be Different than a Biological Journal? Philip E. Bourne PhD pbourne@ucsd.edu iDASH October 18, 2013 1
  • 2. I am speaking to you today as someone who.. • Maintains a major biological database – the PDB – used by over 300,000 scientists per month • Is the Founding Editor in Chief of PLOS Computational Biology iDASH October 18, 2013 2
  • 3. A Question First Posed in August 2005 PLOS Comp Biol 2005 1(3): e34 iDASH October 18, 2013 3
  • 4. Here is one reason why the question is important…. iDASH October 18, 2013 4
  • 5. The Paper As Experiment 0. Full text of PLoS papers stored in a database 4. The composite view has links to pertinent blocks of literature text and back to the PDB 4. 1. 1. A link brings up figures from the paper 2. 3. A composite view of journal and database content results 3. 1. User clicks on thumbnail 2. Metadata and a webservices call provide a renderable image that can be annotated 3. Selecting a features provides a database/literature mashup 4. That leads to new papers PLoS Comp. Biol. 2005 1(3) e34 2. Clicking the paper figure retrieves data from the PDB which is analyzed 5
  • 6. The answer 8 years ago, as is now is… In principle there is no difference, but the way in which each is perceived is still very different… Yet progress has been made and we will focus on what we can do to further accelerate change iDASH October 18, 2013 6
  • 7. Why Bother? Better integration of data and the knowledge derived from it can accelerate discovery and improve the comprehension and dissemination of science iDASH October 18, 2013 7
  • 8. Lets take a step back ... What got me thinking this way? iDASH October 18, 2013 8
  • 9. Data Are Becoming More Complex: Witness The World Wide Protein Data Bank http://www.wwpdb.org • The single worldwide repository for data on the structure of biological macromolecules • Vital for drug discovery and the life sciences • 43 years old • Free to all iDASH October 18, 2013 9
  • 10. The World Wide Protein Data Bank Places High Value on Data • Paper not published unless data are deposited – strong data to literature correspondence • Highly structured data conforming to an extensive ontology • DOI’s assigned to every structure http://www.wwpdb.org iDASH October 18, 2013 10
  • 11. The PLoS Corpus • Established in 2000 • Identified as a high quality publications • Currently 8 journals with healthy growth • Open Access – free to all • PLOS ONE a huge success iDASH October 18, 2013 11
  • 12. Similar Processes Lead to Similar Resources Author Submission via the Web Depositor Submission via the Web Syntax Checking Syntax Checking Review by Scientists & Editors Review by Annotators Corrections by Depositor Corrections by Author Release – Web Accessible Publish – Web Accessible iDASH October 18, 2013 12
  • 13. The scientific process for handling data and publications are not that different, but the end product is perceived very differently iDASH October 18, 2013 13
  • 14. Unfortunately the Metrics of Success Remain… [Carole Goble] iDASH October 18, 2013 14
  • 15. This makes no sense when you ask yourself the question: What is more valuable a dataset used and cited by 100 scientists or a paper you wrote that only you cite? Case in point… iDASH October 18, 2013 15
  • 16. What can you do today to change the situation? iDASH October 18, 2013 16
  • 17. Think Globally Act Locally • Support emergent community commons/portals • Be involved in the support and development of metadata standards • Contribute to workflow development etc. to drive an open research lifecycle • Educate your mentors on the importance of open science and scholarly communication • Write software thinking of an App model iDASH October 18, 2013 17
  • 18. Pressure Your Institutions to Play a Greater Role • We need institutional data/knowledge sharing plans • We need digital universities • We need data/information scientists to be better recognized by institutions – its not all about papers – this implies new metrics iDASH October 18, 2013 18
  • 19. Committee on Academic Promotions • What Counts – – – – – Money Grants Papers Teaching Service • What Does Not – – – – – – Sharing data Sharing software Open access Collaboration Patents Startups Ten Simple Rules for Getting Ahead as a Computational Biologist in Academia 2011 PLOS Comp Biol 7(1) e1002001 iDASH October 18, 2013 19
  • 20. We Need to Bend the Traditional System The Wikipedia Experiment – Topic Pages  Identify areas of Wikipedia that relate to the journal that are missing of stubs  Develop a Wikipedia page in the sandbox  Have a Topic Page Editor Review the page  Publish the copy of record with associated rewards  Release the living version into Wikipedia iDASH October 18, 2013 20
  • 21. We Need Innovative Contributions to the Research Lifecycle Authoring Tools Data Capture Lab Notebooks Software Repositories Analysis Tools Scholarly Communication Visualization IDEAS – HYPOTHESES – EXPERIMENTS – DATA - ANALYSIS - COMPREHENSION - DISSEMINATION Commercial & Public Tools DisciplineBased Metadata Standards Community Portals Git-like Resources By Discipline Data Journals New Reward Systems Training Institutional Repositories iDASH October 18, 2013 Commercial Repositories 21
  • 22. We Need Innovative Contributions to the Research Lifecycle Authoring Tools Data Capture Lab Notebooks Software Repositories Analysis Tools Scholarly Communication Visualization IDEAS – HYPOTHESES – EXPERIMENTS – DATA - ANALYSIS - COMPREHENSION - DISSEMINATION Commercial & Public Tools DisciplineBased Metadata Standards Community Portals Git-like Resources By Discipline Data Journals New Reward Systems Training Institutional Repositories iDASH October 18, 2013 Commercial Repositories 22
  • 23. Example Interoperability: The Database View www.rcsb.org/pdb/explore/literature.do?structureId=1TIM BMC Bioinformatics 2010 11:220 iDASH October 18, 2013 23
  • 24. This is asking a lot of us, but our job is being made easier by what is going on around us iDASH October 18, 2013 24
  • 25. Open Access to Data and the Literature is no Longer a Curiosity, but Mainstream iDASH October 18, 2013 25
  • 26. Conservative Bodies Are Recognizing Change • Anyone, anything, anyt ime • publication access, data, models, sour ce codes, resources, transpar ent methods, standards, forma ts, identifiers, apis, license s, education, policies • “accessible, intelligible, assessable, reusable” [Carole Goble] http://royalsociety.org/policy/projects/science-public-enterprise/report/
  • 27. Governments Are Recognizing Change G8 Open Data Charter http://opensource.com/government/13/7/open-data-charter-g8 iDASH October 18, 2013 27
  • 28. Funding Agencies are Changing iDASH October 18, 2013 28
  • 29. Publishing is Changing • Today: • Approx 10,000 publishers • Publishing approx 25,000 journals • Which publish approx 1.5 million articles per year (almost 1 million of which appear in PubMed) iDASH October 18, 2013 29
  • 30. Witness the ‘Open Access Mega Journal' 1. Very very large – Publishing thousands of articles per year – and benefiting from economies of scale 2. Open Access – Because no one will pay a subscription fee for a journal that large (and growing that fast) – and using an OA Business Model where each article pays for its own costs 3. (Preferably) without any ‘artificial’ constraints on its ability to grow – For example, a desire to only publish ‘high impact; papers [Pete Binfield] iDASH October 18, 2013 30
  • 31. 3500 3000 Publications by PLOSONE per quarter since launch Publications by PLoS ONE per quarter since launch 2500 2000 1500 1000 500 [Pete Binfield] Q 1 20 Q 07 2 20 Q 07 3 20 Q 07 4 20 Q 07 1 20 Q 08 2 20 Q 08 3 20 Q 08 4 20 Q 08 1 20 Q 09 2 20 Q 09 3 20 Q 09 4 20 Q 09 1 20 Q 10 2 20 Q 10 3 20 Q 10 4 20 Q 10 1 20 Q 11 2 20 11 0
  • 32. “Open Access Mega Journals” – One Name, Two Flavours • ‘Clones’ of PLoS ONE (not selective) – – – – – – SAGE Open BMJ Open Scientific Reports (Nature) AIP Advances (Am Inst Physics) G3 (Genetics Soc of America) Biology Open (Company of Biologists) • ‘Pseudo-Clones’ of PLoS ONE (probably selective) – Physical Review X (Am Physical Society) – Open Biology (Royal Society) – Cell Reports (Elsevier, Cell Press) [Pete Binfield] iDASH October 18, 2013 32
  • 33. Attitudes are Changing datasets data collections algorithms configurations tools and apps codes workflows scripts code libraries services, system software infrastructure, compilers hardware [Carole Goble] “An article about computational science in a scientific publication is not the scholarship itself, it is merely advertising of the scholarship. The actual scholarship is the complete software development environment, [the complete data] and the complete set of instructions which generated the figures.” David Donoho, “Wavelab and Reproducible Research,” 1995 Morin et al Shining Light into Black Boxes Science 13 April 2012: 336(6078) 159-160 Ince et al The case for open computer 33 programs, Nature 482, 2012
  • 34. Flaws Are Becoming More Obvious Out of 18 microarray papers, results from 10 could not be reproduced More retractions: >15X increase in last decade At current % > by 2045 as many papers published as retracted [Carole Goble] 1. Ioannidis et al., 2009. Repeatability of published microarray gene expression analyses. Nature Genetics 41: 14 2. Science publishing: The trouble with retractions http://www.nature.com/news/2011/111005/full/478026a.html 34 3. Bjorn Brembs: Open Access and the looming crisis in science https://theconversation.com/open-access-and-the-looming-crisis-in-science-14950
  • 35. Science is Being Deinstitutionalized Daniel Hulshizer/Associated Press iDASH October 18, 2013 35
  • 36. Science is Being Deinstitutionalized Daniel Hulshizer/Associated Press iDASH October 18, 2013 36
  • 37. In Summary • Question (2005): In the Future Will a Biological Database Really be Different than a Biological Journal? • Answer: – – – – Less different that they were in 2005 We still have a long way to go improve science Change is accelerating What one does on a daily basis as a scholar is very different from when I was in graduate school and it will be very different again iDASH October 18, 2013 37