ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do open science and who gets the credit?

results may vary
reproducibility, open science
and all that jazz
Professor Carole Goble
The University of Manchester, UK
carole.goble@manchester.ac.uk
@caroleannegoble
Keynote ISMB/ECCB 2013 Berlin, Germany, 23 July 2013

“knowledge turning”
New Insight

• life sciences
• systems biology
• translational
medicine
• biodiversity
• chemistry
• heliophysics
• astronomy
• social science
• digital libraries
• language analysis

[Josh Sommer, Chordoma Foundation]

Goble et al Communications in Computer and Information Science 348, 2013

automate: workflows,
pipeline & service
integrative frameworks

scientific
software
engineering

CS
SE

pool, share &
collaborate
web systems

semantics & ontologies
machine readable documentation

nanopub

coordinated execution of services,
codes, resources
transparent, step-wise methods
auto documentation, logging
reuse variants

http://www.seek4science.org
store/organise/link
data, models, sops,
experiments,
publications

explore/annotate
data, models, sops

yellow pages, find
peers and experts

open and controlled curation & data
pooling & credit
mgt support

catalogue and gateway to
local and public resources
APIs

simulate models

governance & policies

reproducibility
a principle of the
scientific method
separates scientists
from other researchers
and normal people

http://xkcd.com/242/

datasets
data collections
algorithms
configurations
tools and apps
codes
workflows
scripts
code libraries
services,
system software
infrastructure,
compilers
hardware

“An article about
computational science in a
scientific publication is not the
scholarship itself, it is merely
advertising of the scholarship.
The actual scholarship is the
complete software
development environment,
[the complete data] and the
complete set of instructions
which generated the figures.”
David Donoho, “Wavelab and
Reproducible Research,” 1995

Morin et al Shining Light into Black Boxes
Science 13 April 2012: 336(6078) 159-160

Ince et al The case for open computer
programs, Nature 482, 2012

• Workshop Track (WK03) What
Bioinformaticians need to know
about digital publishing beyond the
PDF
• Workshop Track (WK02):
Bioinformatics Cores Workshop,
• ICSB Public Policy Statement on
Access to Data

hope over experience
“an experiment is reproducible until
another laboratory tries to repeat it.”
Alexander Kohn

even computational ones

hand-wringing,
weeping, wailing,
gnashing of teeth.
Nature checklist.
Science
requirements for
data and code
availability.
attacks on authors,
editors, reviewers,
publishers, funders,
and just about
everyone.
http://www.nature.com/nature/focus/reproducibility/index.html

47/53 “landmark” publications
could not be replicated
[Begley, Ellis Nature, 483, 2012]

Nekrutenko & Taylor, Next-generation sequencing data interpretation:
enhancing, reproducibility and accessibility, Nature Genetics 13 (2012)

59% of papers in the 50 highest-IF journals comply with
(often weak) data sharing rules.
Alsheikh-Ali et al Public Availability of Published Research Data in High-Impact Journals.
PLoS ONE 6(9) 2011

170 journals, 2011-2012
Required as condition of publication
Required but may not affect decisions
Explicitly encouraged may be reviewed
and/or hosted
Implied
No mention

Required as condition of publication
Required but may not affect decisions
Explicitly encouraged may be reviewed
and/or hosted
Implied
No mention

Stodden V, Guo P, Ma Z (2013) Toward Reproducible Computational
Research: An Empirical Analysis of Data and Code Policy Adoption by
Journals. PLoS ONE 8(6): e67111. doi:10.1371/journal.pone.0067111

replication gap
Out of 18 microarray papers, results
Out of 18 microarray papers, results
from 10 could not be reproduced
from 10 could not be reproduced

More retractions:
>15X increase in last decade
At current % > by 2045 as many papers published as
retracted
1. Ioannidis et al., 2009. Repeatability of published microarray gene expression analyses. Nature Genetics 41: 14
2. Science publishing: The trouble with retractions http://www.nature.com/news/2011/111005/full/478026a.html
3. Bjorn Brembs: Open Access and the looming crisis in science https://theconversation.com/open-access-and-the-looming-crisis-in-science-14950

“When I use a word," Humpty Dumpty
said in rather a scornful tone, "it means
just what I choose it to mean - neither
more nor less.”
[Lewis Carroll]
conceptual replication
“show A is true by doing B
rather than doing A again”
verify but not falsify
[Yong, Nature 485, 2012]

regenerate
the figure

replicate
rerun

repeat

re-compute
recreate
revise
regenerate

redo

restore
recycle
reuse

re-examine

reconstruct

review

repurpose

repeat
same
experiment
same lab

replicate

test

same
experiment
different set up

reproduce

same
experiment
different lab
different
experiment
some of same

reuse

Drummond C Replicability is not Reproducibility: Nor is it Good Science, online
Peng RD, Reproducible Research in Computational Science Science 2 Dec 2011: 1226-1227.

validation

verification

assurance meets the
needs of a
stakeholder
e.g. error
measurement,
documentation

complies with a
regulation,
requirement,
specification, or
imposed condition
e.g. a model

science review: articles, algorithms, methods
technical review: code, data, systems
V. Stodden, “Trust Your Science? Open Your Data and Code!”
Amstat News, 1 July 2011

defend
repeat

Sound

Design
Design

Collection
Collection

review1/certify
replicate
Peer
Peer
Review
Review

Prediction
Prediction

Peer
Peer
Reuse
Reuse

Execution
Execution
Publish
Publish
Result Analysis
Result Analysis

make&run&document

report&review&support

review2compare
reproduce

transfer
reuse

* Adapted from Mesirov, J. Accessible Reproducible Research Science 327(5964), 415-416 (2010)

disorganisation

“I can’t immediately reproduce the
research in my own laboratory. It
took an estimated 280 hours for
an average user to approximately
reproduce the paper.
Data/software versions. Workflows
are maturing and becoming
helpful”
Phil Bourne

Garijo et al. 2013 Quantifying Reproducibility in Computational Biology:
The Case of the Tuberculosis Drugome PLOS ONE under review.

fraud Corbyn, Nature Oct 2012

inherent

rigour reporting & experimental design
cherry picking data
misapplication use of black box software*
software misconfigurations, random seed reporting
non-independent bias, poor positive and negative controls
dodgy normalisation, arbitrary cut-offs, premature data triage
un-validated materials, improper statistical analysis, poor
statistical power, stop when “get to the right answer”
*8% validation Joppa, et al, Troubling Trends in Scientific Software Use SCIENCE 340 May 2013

http://www.nature.com/authors/policies/checklist.pdf

• anyone anything
anytime
• publication access, data,
models, source codes,
resources, transparent
methods, standards,
formats, identifiers, apis,
licenses, education,
policies

• “accessible, intelligible,
assessable, reusable”

http://royalsociety.org/policy/projects/science-public-enterprise/report/

G8 open data charter

http://opensource.com/government/13/7/open-data-charter-g8

regulation of science
institution cores public services
libraries
republic of science*
*Merton’s four norms of scientific behaviour (1942)

a meta-manifesto (I)

• all X should be available and assessable
forever
• the copyright of X should be clear
• X should have citable, versioned identifiers
• researchers using X should visibly credit X’s
creators
• credit should be assessable and count in all
assessments
• X should be curated, available, linked to all
necessary materials, and intelligible

What’s the real issue?

we do pretty well
•
•
•
•
•
•
•
•

major public data repositories
multiple declarations for depositing data
thriving open source community
plethora of data standardisation efforts
core facilities
heroic data campaigns
international and national bioinformatics coordination
diy biology movement

• great stories- Shiga-Toxin strain of E. coli, Hamburg, May
2011, China BGI Open data crowd sourcing effort.
• Oh, wait…University of Münster/University of Göttingen
squabble http://www.nature.com/news/2011/110721/full/news.2011.430.html

hard: patient data
(inter)national complications
bleeding heart paternalism
defensive research
informed consent
fortresses

[John Wilbanks]

http://www.broadinstitute.org/files/news/pd
fs/GAWhitePaperJune3.pdf

Kotz, J. SciBX 5(25) 2012

massive centralisation –
clouds, curated core
facilities
long tail massive
decentralisation –
investigator held datasets
fragmentation & fragility
a data scarcity at point of
delivery
RIP data
quality/trust/utility
Acta Crystallographica
section B or C

data/code as
first class citizen

we are not bad people
we make progress
there was never a golden age
there never is

a reproducibility paradox
big, fast,
complicated,
multi-step,
multi-type
multi-field

expectations
of
reproducibility

diy publishing
greater access

pretty stories shiny results feedback loop
announce a result, convince us its correct
novel, attention grabbing
neat, only positive
review: the direction of
science, the next paper,
how I would do it.
reject papers purely based
on public data
obfuscate to avoid scrutiny
PLoS and F1000 counter

the scientific sweatshop
no resources, time, accountability

getting it published not getting it right
game changing benefit to justify disruption

citation distortion

Micropublications arxive reference

Clark et al Micropublications 2013 arXiv:1305.3506

[Tim Clark]

Greenberg How citation distortions create unfounded authority: analysis of a citation network.
British Medical Journal 2009, 339:b2680.
Simkin, Roychowdhury Stochastic modeling of citation slips. Scientometrics 2005, 62(3):367-384.

independent replication studies
self-correcting science

“blue collar

• hostility
• hard
• resource
intensive
• no funding, time,
recognition, place
to publish
• invisible to
science” originators

John Quackenbush

independent review
self-correcting science

“blue collar

• hostility
• hard
• resource
intensive
• no funding, time,
recognition, place
to publish
• invisible to
science” originators

John Quackenbush

what is the point: “no one will want it”
“the questions don’t change but the answers do”*
• two years time when the paper is written
• reviewers want additional work
• statistician wants more runs
• analysis may need to be repeated
• post-doc leaves, student arrives
• new data, revised data
• updated versions of algorithms/codes
quid pro quo citizenship
• trickle down theory: more open more use more credit*
others might
• meta-analysis
• novel discovery
• other methods
* Dan Reed

emerging reproducible system ecosystem
App Store needed!
instrumented desktop tools
hosted services
packaging and archiving
repositories, catalogues
online sharing platforms
integrated authoring
integrative frameworks

XworX

ReproZip

Sweave

integrated database and journal
http://www.gigasciencejournal.com

copy editing computational workflows
from 10 scripts + 4 modules + >20 parameters
to Galaxy workflows

2-3 months
2-3 weeks

made reproducible

galaxy.cbiit.cuhk.edu.hk
[Peter Li]

supporting data reproducibility
Open-Paper

to

d
ke
Lin OI
D

Open-Data
Data sets

78GB CC0 data

DO
I

Lin
ke
DOI:10.1186/2047-217X-1-18 d

>11000 accesses

to

DOI:10.5524/100038

Analyses

Open-Pipelines
Open-Workflows
DOI:10.5524/100044

Open-Review
8 reviewers tested data in ftp server & named reports published

Open-Code

Enabled code to being picked apart by bloggers in wiki
http://homolog.us/wiki/index.php?title=SOAPdenovo2

Code in sourceforge under GPLv3:
>5000 downloads http://soapdenovo2.sourceforge.net/

[Scott Edmunds]

Here is What I Want – The Paper
As Experiment

0. Full text of PLoS papers stored
in a database

4. The composite view has
links to pertinent blocks
of literature text and back to the PDB

4.

1.
1. A link brings up figures
from the paper

2.
[Phil Bourne]

3. A composite view of
journal and database
content results

3.

2. Clicking the paper figure retrieves
data from the PDB which is
analyzed

1. User clicks on thumbnail
2. Metadata and a
webservices call provide
a renderable image that
can be annotated
3. Selecting a features
provides a
database/literature
mashup
4. That leads to new
papers
PLoS Comp. Biol. 2005 1(3) e34

"A single pass approach to reducing sampling variation, removing errors, and
scaling de novo assembly of shotgun sequences"
http://arxiv.org/abs/1203.4802

born reproducible
http://ged.msu.edu/papers/2012-diginorm/
http://ivory.idyll.org/blog/replication-i.html

[C. Titus Brown]

made reproducible
http://getutopia.com

[Pettifer, Attwood]

The Research Lifecycle
Authoring
Tools
Lab
Notebooks

Data
Capture

Software
Repositories

Analysis
Tools

Scholarly
Communication

Visualization

IDEAS – HYPOTHESES – EXPERIMENTS – DATA - ANALYSIS - COMPREHENSION - DISSEMINATION

Commercial &
Public Tools

DisciplineBased Metadata
Standards

Git-like
Resources
By Discipline

Community Portals

Data Journals

New Reward
Systems

Training
Institutional Repositories
Commercial Repositories

[Phil Bourne]

message #1: lower friction
born reproducible

Process

=

Interest
Friction

Number
x people
reach

the neylon equation
Cameron Neylon, BOSC 2013, http://cameronneylon.net/

4+1 architecture of reproducibility
“development” view

“logical” view

social scenarios

“process” view

“physical” view

“logical view”

rigour
reporting
reassembly

recognition
review
reuse

resources
responsibility
reskilling

reporting
documentation

availability

observations
• the strict letter of the law
• (methods) modeller/ workflow makers vs (data)
experimentalists
• young researchers, support from PIs
• buddy reproducibility testing, curation help
• just enough just in time
• staff leaving and project ends
• public scrutiny, competition
• decaying local systems
• long term safe haven commitment
• funder commitment from the start

(Lusch, Vargo 2008)
(Harris and Miller 2011)

(Nowak 2006)
(Clutton-Brock 2009)
Tenopir et al 2011)
Borgman, 2012)

(Malone 2010)
(Benkler 2011)

[Kristian Garza]

(Thomson, Perry, and Miller 2009)
(Wood and Gray 1991)
(Roberts and Bradley 1991)
(Shrum and Chompalov 2007)

scientific ego-system
trust, reciprocity, collaboration to compete
blame
scooped
uncredited
misinterpretation
scrutiny
cost
loss
distraction
left behind
Merton’s four norms of scientific behaviour (1942)
dependency

fame
competitive
advantage
productivity
credit
adoption
kudos
for love

Fröhlich’s principles of scientific communication (1998)

Malone, Laubacher & Dellarocas The Collective Intelligence Genome, Sloan Management Review,(2010)

local asset economies
economics of scarce prized
commodities

• local investment
– protective

• collective purchasing
trade

– share

• sole provider
– broadcast

[Nielson] [Roffel]

(Lusch, Vargo 2008)
(Harris and Miller 2011

asymmetrical reciprocity
•
•
•
•
•
•
•
•

hugging
flirting
voyerism
inertia
sharing creep
credit drift
local control
code throwaway

family
friends
acquaintances strangers
rivals
ex-friends

Tenopir, et al. Data Sharing by Scientists: Practices and Perceptions. PLoS ONE 6(6) 2012
Borgman The conundrum of sharing research data, JASIST 2012

1 0 JA N UA RY 2 0 1 3 | VO L 4 9 3 | N AT U R E | 1 5 9

recognition

“all research products and all scholarly labour
are equally valued except by promotion and
review committees”

message #2
visible reciprocity contract

citation is like ♥ not $
large data providers
infrastructure codes
“click and run”
instrument platforms
make credit count
Rung, Brazma Reuse of public wide gene expression data Nature Review Genetics 2012
Duck et al bioNerDS: exploring bioinformatics' database and software use through literature mining.
BMC Bioinformatics. 2013
Piwowar et al Sharing Detailed Research Data Is Associated with Increased Citation Rate PLoS
ONE 2007

Victoria Stodden, AMP 2011 http://www.stodden.net/AMP2011/,
Workshop: Reproducible Research: Tools and Strategies for Scientific Computing
Special Issue Reproducible Research Computing in Science and Engineering July/August 2012, 14(4)

in perpetuity
“its not ready yet”, “I need another publication”
shame
“its too ugly”, “I didn’t work out the details”
effort
“we don’t have the skills/resources”, “the reviewers
don’t need it”
loss
“the student left”, “we can’t find it”
insecurity
“you wouldn’t understand it”, “I made it so no one
could understand it”.
Randall J. LeVeque ,Top Ten Reasons To Not Share Your Code (and why you should anyway) April
2013 SIAM News

the goldilocks paradox
“the description needed
to make an experiment
reproducible is too much
for the author and too
little for the reader”
just enough just in time
Galaxy Luminosity Profiling
José Enrique Ruiz (IAA-CSIC)

http://www.rightfield.org.uk

1. Enrich Spreadsheet Template

reducing the
friction of
curation

2. Use in Excel or OpenOffice

3. Extract and Process

RDF Graph

:

anonymous reuse is hard
nearly always negotiated

reskilling: software making practices
Zeeya Merali , Nature 467, 775-777 (2010) | doi:10.1038/467775a
Computational science: ...Error…why scientific programming does not compute.

“As a general rule,
researchers do not
test or document
their programs
rigorously, and they
rarely release their
codes, making it
almost impossible
to reproduce and
verify published
results generated
by scientific
software”

http://matt.might.net/articles/crapl/

http://sciencecodemanifesto.org/

Greg
Wilson

better
software

C Titus Brown

better
research

data
carpentry

a word on reinventing
Sean Eddy

author HMMER and Infernal software
suites for sequence analysis

innovation is algorithms and methodology.
rediscovery of profile stochastic context-free grammars
(re)coding is reproducing.
reinvent what is innovative.
reuse what is utility.

Goble, seven deadly sins of bioinformatics, 35.5K views
http://www.slideshare.net/dullhunk/the-seven-deadly-sins-of-bioinformatics

message #3
placing value on reproducibility
take action
Organisation

Culture

Execution

Metrics

Process
[Daron Green]

(re)assembly
Gather the bits together
Find and get the bits
Bits broken/changed/lost
Have other bits
Understand the bits and
how to put together
Bits won’t work together
What bit is critical?
Can I use a different
tool?
Can’t operate the tool
Who’s job is this?

specialist codes

gateways

libraries, platforms, tools

data collections
catalogues
commodity
platforms

my data
my process
my codes

integrative
frameworks

service based

software
sepositories
(cloud)
hosted
services

Diff

Orig

repeat
(re-run)

replicate reproduce
(regenerate)

(recreate)

reuse
(repurpose/extend)

Actors
Results
Experiment
Materials

(datasets,
parameters, seeds)

Methods
(techniques, algorithms,
spec of the steps)

Setup
Instruments

(codes, services, scripts,
underlying libraries)

Laboratory
(sw and hw infrastructure,
systems software,
integrative platforms)

snapshot spectrum

materials

use workflows
capture the steps

method

instruments and laboratory

standardised pipelines
auto record of
experiment and set-up
report & variant reuse
buffered infrastructure

BioSTIF

interactive
local & 3rd party independent resources
shielded heterogeneous infrastructures

use provenance
the link between computation and results
static verifiable record
track changes
repair
partially repeat/reproduce
carry citation
calc data quality/trust
select data to keep/release
compare diffs/discrepancies

d1

d2

d1'

d2

S0

S1

S0

S1

w

z

w

S2

S'2

y

y'

S4

S4

df

df'

(i) Trace A

W3C PROV standard

(ii) Trace B

PDIFF: comparing provenance traces to
diagnose divergence across experimental
results [Woodman et al, 2011]

“an experiment is as transparent
as the visibility of its steps”
black boxes
closed codes &
services, proprietary
licences, magic cloud
services, manual
manipulations, poor
provenance/version
reporting, unknown
peer review, mis-use,
platform calculation
dependencies
Joppa et al SCIENCE 340 May 2013; Morin et al Science 336 2012

dependencies & change
degree of self-contained preservation
open world, distributed, alien hosted
data/software versions and accessibility hamper replication
spin-rate of versions

[Zhao et al. e-Science 2012]

“all you need to do is copy the box that the internet is in”

preservation & distribution
portability / packaging
VM

availability
open

[Adapted Freire, 2013]

gather
dependencies
capture
steps

variability
sameness

description
intelligibility

Reproducibility
framework

packaging bickering
byte execution
virtual machine
black box

description
archived record
white box

data+compute
co-location cloud
packaging
ELIXIR Embassy Cloud

reproduce

repeat

“in-nerd-tia”

big data big compute
community facilities
cloud host costs and confidence
data scales
dump and file
capability

message #4:
“the reproducible window”
all experiments become less reproducible over time
icanhascheezburger.com

how, why and what matters
benchmarks for codes
plan to preserve
repair on demand
description persists
use frameworks

results may vary

partial replication
approximate reproduction
verification

Sandve, Nekrutenko, Taylor, Hovig Ten simple rules for reproducible in silico research,
PLoS Comp Bio submitted

message #5: puppies aren’t free
long term reliability of hosts
multiple stewardship
fragmented
business models
reproducibility service industry

24% NAR services unmaintained after three years Schultheiss et al. (2010) PLoS Comp

•
•
•
•
•
•

the meta-manifesto

all X should be available and assessable forever
the copyright of X should be clear
X should have citable, versioned identifiers
researchers using X should visibly credit X’s creators
credit should be assessable and count in all assessments
X should be curated, available, linked to all necessary materials, and
intelligible

• making X reproducible/open should be from
cradle to grave, continuous, routine, and
easier
• tools/repositories should be made to help, be
maintained and be incorporated into working
practices
• researchers should be able to adapt their
working practices, use resources, and be
trained to reproduce
• cost and responsibility should be transparent,
planned for, accounted and borne collectively
• we all should start small, be imperfect but
take action. Today.
http://www.force11.org

• evolution of a body
• fork, pull, merge
• subpart different cycles,
stewardship, authors
• refactored granularity
• software release
practices for
workflows, scripts,
services, data and
articles
• thread the salami across
parts, repositories and
journals
• chop up and microattribute

research is like
software

Faculty1000

Jennifer Schopf, Treating Data Like Software: A Case for Production Quality Data, JCDL 2012

http://www.researchobject.org/

http://www.w3.org/community/rosc/

bundles and relates digital resources of a
scientific experiment or investigation using
standard mechanisms

towards a release app store
• checklists for
descriptive
reproducibility
• packaging for multihosted research
(executable)
components
• exchange between
tools and researchers
• framework for research
release and threaded
publishing using core
standards

TT43 Lounge 81

those messages again
•
•
•
•
•

lower friction, born reproducible
credit is like love
take action, use (workflow) frameworks
prepare for the reproducible window
puppies aren’t free

final message

The revolution is not
an apple that falls
when it is ripe. You
have to make it drop.

acknowledgements
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

David De Roure
Tim Clark
Sean Bechhofer
Robert Stevens
Christine Borgman
Victoria Stodden
Marco Roos
Jose Enrique Ruiz del Mazo
Oscar Corcho
Ian Cottam
Steve Pettifer
Magnus Rattray
Chris Evelo
Katy Wolstencroft
Robin Williams
Pinar Alper
C. Titus Brown
Greg Wilson
Kristian Garza

•

Wf4ever, SysMO, BioVel, UTOPIA and myGrid teams

•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

Juliana Freire
Jill Mesirov
Simon Cockell
Paolo Missier
Paul Watson
Gerhard Klimeck
Matthias Obst
Jun Zhao
Pinar Alper
Daniel Garijo
Yolanda Gil
James Taylor
Alex Pico
Sean Eddy
Cameron Neylon
Barend Mons
Kristina Hettne
Stian Soiland-Reyes
Rebecca Lawrence

Mr Cottam

10th anniversary today!

summary

[Jenny Cham]

https://twitter.com/csmcr/status/361835508994813954

•

myGrid
–

•

http://www.biovel.eu

Force11
–

•
•

http://www.software.ac.uk

BioVeL
–

•

http://www.wf4ever-project.org

Software Sustainability Institute
–

•

http://www.getutopia.com

Wf4ever
–

•

http://www.rightfield.org.uk

UTOPIA Documents
–

•

http://www.sysmo-db.org

Rightfield
–

•

http://www.biocatalogue.org

SysMO-SEEK
–

•

http://www.myexperiment.org

BioCatalogue
–

•

http://www.taverna.org.uk

myExperiment
–

•

http://www.mygrid.org.uk

Taverna
–

•

Further Information

http://www.force11.org

http://reproducibleresearch.net
http;//reproduciblescience.org

ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do open science and who gets the credit?

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do open science and who gets the credit?

Similar to ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do open science and who gets the credit? (20)

More from Carole Goble

More from Carole Goble (20)

Recently uploaded

Recently uploaded (20)

ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do open science and who gets the credit?

Editor's Notes