SlideShare a Scribd company logo
Data Publishing and
Post-Publication Reviews
Tobias Kuhn
http://www.tkuhn.ch
@txkuhn
ETH Zurich
PEERE Meeting: Peer Review: Past, Present and Future
ETH Zurich
22 April 2015
Peer Review in the Digital Age
Traditionally, peer review has two purposes:
(1) Assessment of the validity, accuracy, relevance, and quality of
presentation of the given scientific work ...
but this assessment is not made public (except for the fact that
some papers are accepted)
(2) Distribution of a scarce resource, namely pages in printed
journals, ...
but this scarce resource no longer exists (except for presentation
slots at conferences)
Is the current peer review system broken?
It’s at least terribly old-fashioned and inefficient.
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 2 / 19
Publishing has Gone from Print to Online
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 3 / 19
The Increasing Importance of Digital Data Sets
(London Underground staff sorting 4M used tickets to analyse line use in 1939)
http://www.telegraph.co.uk/travel/picturegalleries/9791007/
The-history-of-the-Tube-in-pictures-150-years-of-London-Underground.html?frame=2447159
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 4 / 19
Approaches for the
Future of Scientific Publishing
Short term:
• Open post-publication reviews
• Data publishing
• ...
Longer term:
• Publish results and reviews as Linked Data
• Global reputation system that includes artificial agents (“bots”)
• ...
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 5 / 19
Open Post-Publication Reviews
• Publish first — collect reviews afterwards
• Make reviews public: Open Evaluation (similar to Open Access)
Such open post-publication reviews have existed for a long time in the case
of books and book reviews:
See e.g.: Kriegeskorte. Open evaluation: a vision for entirely transparent post-publication peer review and rating for
science. Frontiers in Computational Neuroscience 6, 2012.
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 6 / 19
Benefits of Open Post-Publication Reviews
• Fast publication
• Transparent reviewing
• Publicly accessible detailed assessements of scientific works (as
compared to just knowing that a paper has been accepted for a
given journal)
• Incentives for reviewers
• Possibility for a multitude of evaluation metrics to be used in
parallel
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 7 / 19
Data Publishing
Data sets are becoming increasingly important for most scientific
disciplines, but so far they are not well integrated into the publishing
and reviewing process.
Current developments:
• Supplementary material for papers
• Data journals
• Data repositories: Figshare, Dryad, ...
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 8 / 19
Publishing and Reviewing Scientific Data
Scientific data that should be published and reviewed:
• Input/output data of scientific processes (raw/aggregated)
• Scientific interpretations/conclusions (e.g. causal relation
between gene and disease)
• Meta-data (e.g. involved researchers, citations, used methods)
• All of the above should ideally be formatted as Linked Data
• Source code of software
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 9 / 19
Nanopublications:
Provenance-Aware Semantic Publishing
Nanopublications are small pieces of scientific data with their
provenance information, represented in a machine-interpretable
language (RDF).
assertion
provenance
publication info
nanopublication
http://nanopub.org / @nanopub org
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 10 / 19
Vision: Changing Scholarly Communication
Now
Narrative articles at the center
Future
Nanopublications at the center
Images from Mons et al. The value of data. Nature genetics, 43(4):281–283, 2011
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 11 / 19
Nanopublications:
Provenance-Aware Semantic Publishing
Nanopub0001
Assertion:
prov:wasDerivedFrom d:DataSetX
Provenance:
ns1:mosquito ns2:malaria
ns3:transmission
Publication Information:
dc:created “2013-01-01”
pav:createdBy p:Isabelle_Dubois
Nanopub0042
Assertion:
dc:created “2013-05-01”
pav:createdBy p:Giuseppe
Publication Information:
p:Giuseppe pub:123456
r:givesNegativeReviewFor
Provenance
prov:wasInfluencedBy s:CommentX
http://nanopub.org / @nanopub org
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 12 / 19
How to Publish Data?
Published data should be:
• Verifiable (Is this really the data I am looking for?)
• Immutable (Can I be sure that it hasn’t been modified?)
• Permanent (Will it be available in 1, 5, 20 years from now?)
• Trustworthy (Can I trust the source?)
• Reliable (Can it be efficiently retrieved whenever needed?)
• Granular (Can I refer to individual data entries?)
These points are important (much more so than for papers) because
data can and will be consumed and produced automatically by
algorithms (“bots”).
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 13 / 19
Trusty URIs: Cryptographic Hash Values for
Verifiable and Immutable Web Identifiers
Example:
.trighttp://example.org/r1. RA 5AbXdpz5DcaYXCh9l3eI9ruBosiL5XDU3rxBbBaUO70
http://...RAcbjcRI...
http://...RAQozo2w...
http://...RABMq4Wc...
http://...RAcbjcRI...
http://...RAQozo2w...
http://.../resource23
http://.../resource23
...
http://...RAUx3Pqu...
http://.../resource55
http://...RABMq4Wc...
http://.../resource55
http://...RARz0AX-...
...
http://...RAUx3Pqu...
...
http://...RARz0AX...
...
range of
verifiability
Kuhn, Dumontier. Making Digital Artifacts on the Web Verifiable and Reliable. IEEE Transactions on Knowledge and
Data Engineering. To appear. / Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts
for Linked Data. ESWC 2014.
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 14 / 19
Nanopublication Indexes
Sets of nanopublications can be described as indexes that are
nanopublications themselves. This allows us to define arbitrary sets,
while keeping the individual addressability of the data entries:
(a) (b)
(c) (f)
(d) (e)
has element
has sub-index
appends to
Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of
Data. arXiv:1411.2749.
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 15 / 19
Nanopublication Server Network
Nanopublications
with Trusty URIs
Publication
Retrieval
Propagation /
Archiving
http://npmonitor.inn.ac
Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of
Data. arXiv:1411.2749.
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 16 / 19
How to Review Data?
• Large data sets only allow for samples of data entries to be
manually reviewed
• Data is often produced and consumed by algorithms: How can
these data be linked to the algorithms. How should the data and
the algorithms be reviewed?
• Certain kinds of technical reviewing can be automated!
• Can all this be achieved in an fashion that is decentralized,
open, and real-time?
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 17 / 19
Bots and Reputation Mechanisms
Robust automatic calculation of reputation metrics in a decentralized
and open system:
gives positive
assessment for
is contributed by
Eigenvector
centrality (0-100)
77
Eigenvector
centrality with
bidirectional
contribution edges
77
100
71 1
0
85
4
0.0
0 0
50
50 0
0.4 0.4
0
98
78
31
100
69
1
31 31
71
71 28
42 1
0.7
1
Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD 2015.
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 18 / 19
Thank you for your attention!
Questions?
Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 19 / 19

More Related Content

What's hot

Why should we study Data Science
Why should we study Data ScienceWhy should we study Data Science
Why should we study Data ScienceAn Mai
 
Getting started with CitNetExplorer
Getting started with CitNetExplorerGetting started with CitNetExplorer
Getting started with CitNetExplorer
Nees Jan van Eck
 
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
University of Groningen (The Netherlands)
 
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Paolo Nesi
 
OU Rise library analytics viz
OU Rise library analytics vizOU Rise library analytics viz
OU Rise library analytics vizTony Hirst
 
The drawbridge to knowledge - Linking scholarly publications and research inf...
The drawbridge to knowledge - Linking scholarly publications and research inf...The drawbridge to knowledge - Linking scholarly publications and research inf...
The drawbridge to knowledge - Linking scholarly publications and research inf...
Lukas Koster
 
rOpenGov: an R ecosystem for open government data and computational social sc...
rOpenGov: an R ecosystem for open government data and computational social sc...rOpenGov: an R ecosystem for open government data and computational social sc...
rOpenGov: an R ecosystem for open government data and computational social sc...
Leo Lahti
 
Large-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sourcesLarge-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sources
Nees Jan van Eck
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric data
Nees Jan van Eck
 

What's hot (9)

Why should we study Data Science
Why should we study Data ScienceWhy should we study Data Science
Why should we study Data Science
 
Getting started with CitNetExplorer
Getting started with CitNetExplorerGetting started with CitNetExplorer
Getting started with CitNetExplorer
 
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
 
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
 
OU Rise library analytics viz
OU Rise library analytics vizOU Rise library analytics viz
OU Rise library analytics viz
 
The drawbridge to knowledge - Linking scholarly publications and research inf...
The drawbridge to knowledge - Linking scholarly publications and research inf...The drawbridge to knowledge - Linking scholarly publications and research inf...
The drawbridge to knowledge - Linking scholarly publications and research inf...
 
rOpenGov: an R ecosystem for open government data and computational social sc...
rOpenGov: an R ecosystem for open government data and computational social sc...rOpenGov: an R ecosystem for open government data and computational social sc...
rOpenGov: an R ecosystem for open government data and computational social sc...
 
Large-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sourcesLarge-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sources
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric data
 

Viewers also liked

Buku tamu gugus depan
Buku tamu gugus depanBuku tamu gugus depan
Buku tamu gugus depan
OSPPMD
 
2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class
2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class
2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class
Ioana Stoica
 
A Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageA Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural Language
Tobias Kuhn
 
Controlled Natural Language and Opportunities for Standardization
Controlled Natural Language and Opportunities for StandardizationControlled Natural Language and Opportunities for Standardization
Controlled Natural Language and Opportunities for Standardization
Tobias Kuhn
 
How to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural LanguagesHow to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural Languages
Tobias Kuhn
 
Underspecified Scientific Claims in Nanopublications
Underspecified Scientific Claims in NanopublicationsUnderspecified Scientific Claims in Nanopublications
Underspecified Scientific Claims in Nanopublications
Tobias Kuhn
 
An Introduction to AceWiki
An Introduction to AceWikiAn Introduction to AceWiki
An Introduction to AceWiki
Tobias Kuhn
 
AceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic WikiAceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic Wiki
Tobias Kuhn
 
nanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublicationsnanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublications
Tobias Kuhn
 
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksMeme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Tobias Kuhn
 
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataA Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
Tobias Kuhn
 
Finding and Accessing Diagrams in Biomedical Publications
Finding and Accessing Diagrams in Biomedical PublicationsFinding and Accessing Diagrams in Biomedical Publications
Finding and Accessing Diagrams in Biomedical Publications
Tobias Kuhn
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...Tobias Kuhn
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...Tobias Kuhn
 
AceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic WikiAceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic Wiki
Tobias Kuhn
 
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
Tobias Kuhn
 
Broadening the Scope of Nanopublications
Broadening the Scope of NanopublicationsBroadening the Scope of Nanopublications
Broadening the Scope of Nanopublications
Tobias Kuhn
 

Viewers also liked (18)

Buku tamu gugus depan
Buku tamu gugus depanBuku tamu gugus depan
Buku tamu gugus depan
 
2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class
2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class
2013 Report on AngeI Investing Activity in Canada: Accelerating the Asset Class
 
A Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageA Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural Language
 
Controlled Natural Language and Opportunities for Standardization
Controlled Natural Language and Opportunities for StandardizationControlled Natural Language and Opportunities for Standardization
Controlled Natural Language and Opportunities for Standardization
 
AceWiki
AceWikiAceWiki
AceWiki
 
How to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural LanguagesHow to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural Languages
 
Underspecified Scientific Claims in Nanopublications
Underspecified Scientific Claims in NanopublicationsUnderspecified Scientific Claims in Nanopublications
Underspecified Scientific Claims in Nanopublications
 
An Introduction to AceWiki
An Introduction to AceWikiAn Introduction to AceWiki
An Introduction to AceWiki
 
AceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic WikiAceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic Wiki
 
nanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublicationsnanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublications
 
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksMeme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation Networks
 
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataA Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
 
Finding and Accessing Diagrams in Biomedical Publications
Finding and Accessing Diagrams in Biomedical PublicationsFinding and Accessing Diagrams in Biomedical Publications
Finding and Accessing Diagrams in Biomedical Publications
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 
AceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic WikiAceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic Wiki
 
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
 
Broadening the Scope of Nanopublications
Broadening the Scope of NanopublicationsBroadening the Scope of Nanopublications
Broadening the Scope of Nanopublications
 

Similar to Data Publishing and Post-Publication Reviews

Scientific Data Publishing
Scientific Data PublishingScientific Data Publishing
Scientific Data Publishing
Tobias Kuhn
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
Tobias Kuhn
 
Nanopubs
NanopubsNanopubs
Nanopubs
Tobias Kuhn
 
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Tobias Kuhn
 
Being an Open Scholar in a Connected World
Being an Open Scholar in a Connected WorldBeing an Open Scholar in a Connected World
Being an Open Scholar in a Connected World
Stian Håklev
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
GESIS
 
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Bertram Ludäscher
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
petermurrayrust
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data Infrastructures
LIBER Europe
 
The culture of researchData
The culture of researchData The culture of researchData
The culture of researchData
TheContentMine
 
The Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-RustThe Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-Rust
LEARN Project
 
Trends in Scholarly Publishing
Trends in Scholarly PublishingTrends in Scholarly Publishing
Trends in Scholarly Publishing
ETH-Bibliothek
 
Linked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsLinked Data Publishing with Nanopublications
Linked Data Publishing with Nanopublications
Tobias Kuhn
 
Jankowski presentation-scholarly-publishing-9dec14
Jankowski presentation-scholarly-publishing-9dec14Jankowski presentation-scholarly-publishing-9dec14
Jankowski presentation-scholarly-publishing-9dec14
Nick Jankowski
 
LIBER and LERU roadmap towards Open Access
LIBER and LERU roadmap towards Open AccessLIBER and LERU roadmap towards Open Access
LIBER and LERU roadmap towards Open Access
LIBER Europe
 
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
TimelessFuture
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research funding
Andrea Scharnhorst
 
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Jean Burgess
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
Europeana
 

Similar to Data Publishing and Post-Publication Reviews (20)

Scientific Data Publishing
Scientific Data PublishingScientific Data Publishing
Scientific Data Publishing
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
Nanopubs
NanopubsNanopubs
Nanopubs
 
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
 
Being an Open Scholar in a Connected World
Being an Open Scholar in a Connected WorldBeing an Open Scholar in a Connected World
Being an Open Scholar in a Connected World
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
 
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
 
Ngsp
NgspNgsp
Ngsp
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data Infrastructures
 
The culture of researchData
The culture of researchData The culture of researchData
The culture of researchData
 
The Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-RustThe Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-Rust
 
Trends in Scholarly Publishing
Trends in Scholarly PublishingTrends in Scholarly Publishing
Trends in Scholarly Publishing
 
Linked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsLinked Data Publishing with Nanopublications
Linked Data Publishing with Nanopublications
 
Jankowski presentation-scholarly-publishing-9dec14
Jankowski presentation-scholarly-publishing-9dec14Jankowski presentation-scholarly-publishing-9dec14
Jankowski presentation-scholarly-publishing-9dec14
 
LIBER and LERU roadmap towards Open Access
LIBER and LERU roadmap towards Open AccessLIBER and LERU roadmap towards Open Access
LIBER and LERU roadmap towards Open Access
 
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research funding
 
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
 

More from Tobias Kuhn

Genuine semantic publishing
Genuine semantic publishingGenuine semantic publishing
Genuine semantic publishing
Tobias Kuhn
 
The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer
Tobias Kuhn
 
Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Tobias Kuhn
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureTobias Kuhn
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureTobias Kuhn
 
Automatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiAutomatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiTobias Kuhn
 
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Tobias Kuhn
 
AceRules: Executing Rules in Controlled Natural Language
AceRules: Executing Rules in Controlled Natural LanguageAceRules: Executing Rules in Controlled Natural Language
AceRules: Executing Rules in Controlled Natural Language
Tobias Kuhn
 
How Controlled English can Improve Semantic Wikis
How Controlled English can Improve Semantic WikisHow Controlled English can Improve Semantic Wikis
How Controlled English can Improve Semantic Wikis
Tobias Kuhn
 
Wissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem EnglischWissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem Englisch
Tobias Kuhn
 
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive EditorsCodeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Tobias Kuhn
 

More from Tobias Kuhn (11)

Genuine semantic publishing
Genuine semantic publishingGenuine semantic publishing
Genuine semantic publishing
 
The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer
 
Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific Literature
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific Literature
 
Automatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiAutomatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen Wiki
 
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
 
AceRules: Executing Rules in Controlled Natural Language
AceRules: Executing Rules in Controlled Natural LanguageAceRules: Executing Rules in Controlled Natural Language
AceRules: Executing Rules in Controlled Natural Language
 
How Controlled English can Improve Semantic Wikis
How Controlled English can Improve Semantic WikisHow Controlled English can Improve Semantic Wikis
How Controlled English can Improve Semantic Wikis
 
Wissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem EnglischWissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem Englisch
 
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive EditorsCodeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
 

Recently uploaded

Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 

Recently uploaded (20)

Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 

Data Publishing and Post-Publication Reviews

  • 1. Data Publishing and Post-Publication Reviews Tobias Kuhn http://www.tkuhn.ch @txkuhn ETH Zurich PEERE Meeting: Peer Review: Past, Present and Future ETH Zurich 22 April 2015
  • 2. Peer Review in the Digital Age Traditionally, peer review has two purposes: (1) Assessment of the validity, accuracy, relevance, and quality of presentation of the given scientific work ... but this assessment is not made public (except for the fact that some papers are accepted) (2) Distribution of a scarce resource, namely pages in printed journals, ... but this scarce resource no longer exists (except for presentation slots at conferences) Is the current peer review system broken? It’s at least terribly old-fashioned and inefficient. Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 2 / 19
  • 3. Publishing has Gone from Print to Online Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 3 / 19
  • 4. The Increasing Importance of Digital Data Sets (London Underground staff sorting 4M used tickets to analyse line use in 1939) http://www.telegraph.co.uk/travel/picturegalleries/9791007/ The-history-of-the-Tube-in-pictures-150-years-of-London-Underground.html?frame=2447159 Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 4 / 19
  • 5. Approaches for the Future of Scientific Publishing Short term: • Open post-publication reviews • Data publishing • ... Longer term: • Publish results and reviews as Linked Data • Global reputation system that includes artificial agents (“bots”) • ... Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 5 / 19
  • 6. Open Post-Publication Reviews • Publish first — collect reviews afterwards • Make reviews public: Open Evaluation (similar to Open Access) Such open post-publication reviews have existed for a long time in the case of books and book reviews: See e.g.: Kriegeskorte. Open evaluation: a vision for entirely transparent post-publication peer review and rating for science. Frontiers in Computational Neuroscience 6, 2012. Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 6 / 19
  • 7. Benefits of Open Post-Publication Reviews • Fast publication • Transparent reviewing • Publicly accessible detailed assessements of scientific works (as compared to just knowing that a paper has been accepted for a given journal) • Incentives for reviewers • Possibility for a multitude of evaluation metrics to be used in parallel Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 7 / 19
  • 8. Data Publishing Data sets are becoming increasingly important for most scientific disciplines, but so far they are not well integrated into the publishing and reviewing process. Current developments: • Supplementary material for papers • Data journals • Data repositories: Figshare, Dryad, ... Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 8 / 19
  • 9. Publishing and Reviewing Scientific Data Scientific data that should be published and reviewed: • Input/output data of scientific processes (raw/aggregated) • Scientific interpretations/conclusions (e.g. causal relation between gene and disease) • Meta-data (e.g. involved researchers, citations, used methods) • All of the above should ideally be formatted as Linked Data • Source code of software Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 9 / 19
  • 10. Nanopublications: Provenance-Aware Semantic Publishing Nanopublications are small pieces of scientific data with their provenance information, represented in a machine-interpretable language (RDF). assertion provenance publication info nanopublication http://nanopub.org / @nanopub org Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 10 / 19
  • 11. Vision: Changing Scholarly Communication Now Narrative articles at the center Future Nanopublications at the center Images from Mons et al. The value of data. Nature genetics, 43(4):281–283, 2011 Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 11 / 19
  • 12. Nanopublications: Provenance-Aware Semantic Publishing Nanopub0001 Assertion: prov:wasDerivedFrom d:DataSetX Provenance: ns1:mosquito ns2:malaria ns3:transmission Publication Information: dc:created “2013-01-01” pav:createdBy p:Isabelle_Dubois Nanopub0042 Assertion: dc:created “2013-05-01” pav:createdBy p:Giuseppe Publication Information: p:Giuseppe pub:123456 r:givesNegativeReviewFor Provenance prov:wasInfluencedBy s:CommentX http://nanopub.org / @nanopub org Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 12 / 19
  • 13. How to Publish Data? Published data should be: • Verifiable (Is this really the data I am looking for?) • Immutable (Can I be sure that it hasn’t been modified?) • Permanent (Will it be available in 1, 5, 20 years from now?) • Trustworthy (Can I trust the source?) • Reliable (Can it be efficiently retrieved whenever needed?) • Granular (Can I refer to individual data entries?) These points are important (much more so than for papers) because data can and will be consumed and produced automatically by algorithms (“bots”). Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 13 / 19
  • 14. Trusty URIs: Cryptographic Hash Values for Verifiable and Immutable Web Identifiers Example: .trighttp://example.org/r1. RA 5AbXdpz5DcaYXCh9l3eI9ruBosiL5XDU3rxBbBaUO70 http://...RAcbjcRI... http://...RAQozo2w... http://...RABMq4Wc... http://...RAcbjcRI... http://...RAQozo2w... http://.../resource23 http://.../resource23 ... http://...RAUx3Pqu... http://.../resource55 http://...RABMq4Wc... http://.../resource55 http://...RARz0AX-... ... http://...RAUx3Pqu... ... http://...RARz0AX... ... range of verifiability Kuhn, Dumontier. Making Digital Artifacts on the Web Verifiable and Reliable. IEEE Transactions on Knowledge and Data Engineering. To appear. / Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014. Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 14 / 19
  • 15. Nanopublication Indexes Sets of nanopublications can be described as indexes that are nanopublications themselves. This allows us to define arbitrary sets, while keeping the individual addressability of the data entries: (a) (b) (c) (f) (d) (e) has element has sub-index appends to Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. arXiv:1411.2749. Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 15 / 19
  • 16. Nanopublication Server Network Nanopublications with Trusty URIs Publication Retrieval Propagation / Archiving http://npmonitor.inn.ac Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. arXiv:1411.2749. Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 16 / 19
  • 17. How to Review Data? • Large data sets only allow for samples of data entries to be manually reviewed • Data is often produced and consumed by algorithms: How can these data be linked to the algorithms. How should the data and the algorithms be reviewed? • Certain kinds of technical reviewing can be automated! • Can all this be achieved in an fashion that is decentralized, open, and real-time? Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 17 / 19
  • 18. Bots and Reputation Mechanisms Robust automatic calculation of reputation metrics in a decentralized and open system: gives positive assessment for is contributed by Eigenvector centrality (0-100) 77 Eigenvector centrality with bidirectional contribution edges 77 100 71 1 0 85 4 0.0 0 0 50 50 0 0.4 0.4 0 98 78 31 100 69 1 31 31 71 71 28 42 1 0.7 1 Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD 2015. Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 18 / 19
  • 19. Thank you for your attention! Questions? Tobias Kuhn, ETH Zurich Data Publishing and Post-Publication Reviews 19 / 19