Varsha Khodiyar, PhD
Data Curation Editor, Scientific Data
Nature Publishing Group
@varsha_khodiyar
@scientificdata
Twitter friendly talk
Data Publishing and Institutional Repositories
FORCE16 : Libraries United in Opening New Scholarly Platforms
18th April 2016
2
www.nature.com/scientificdata
Why data papers? – Credit for data producers
3
Why data papers? - Data reuse is easier
4
“The Data Descriptor made it easier
to use the data, for me it was critical
that everything was there…all the
technical details like voxel size.”
Professor Daniele Marinazzo
Results
Discussion
Analysis & Conclusions
What was done to generate the data?
How was the data processed?
Where is the data?
Who did what and when?
Methods and technical analyses supporting the quality of the measurements.
Do not contain tests of new scientific hypotheses
Comparison of data paper to traditional article
Data papers at Scientific Data
6
Human readable
representation of
study
i.e. article (HTML &
PDF)
Human readable
component
i.e. article text
(HTML & PDF)
Machine
accessible
component
i.e. metadata
(ISA format)
Scientific Data’s Repository List
Almost 80 recommended data repositories listed
1. Is there a public data-specific repository for your data?
2. If there is no public data-specific repository for your data
exists, does your funder or institution mandate deposition to
a particular repository?
www.nature.com/sdata/data-policies/repositories
7
Authors can use their own IR when submitting to
Scientific Data
For institutional repositories (such as ScholarsArchive@OSU), select
‘DataCite DOI’ as Repository Name during submission
DataCite
8
+
IR
Scientific Data’s IR workflow
10
Final Data Citation
What do data journals require from a repository?
1. Recognized within their scientific community
2. Long-term data preservation plan
3. Implementation of community reporting standards
4. Stable identifiers for published datasets
5. Allow open access to data without unnecessary
restrictions e.g. no commercial use restrictions
Questionnaire for new repositories requesting listing:
http://www.nature.com/sdata/data-policies#repo-suggest
11
Visit nature.com/sdata
Email scientificdata@nature.com
Tweet @ScientificData
Honorary Academic Editor
Susanna-Assunta Sansone
Managing Editor
Andrew L. Hufton
Data Curation Editor
Varsha K. Khodiyar
Advisory Panel and Editorial
Board including senior researchers,
funders, librarians and curators
Supported by

Data Publishing and Institutional Repositories

  • 1.
    Varsha Khodiyar, PhD DataCuration Editor, Scientific Data Nature Publishing Group @varsha_khodiyar @scientificdata Twitter friendly talk Data Publishing and Institutional Repositories FORCE16 : Libraries United in Opening New Scholarly Platforms 18th April 2016
  • 2.
  • 3.
    Why data papers?– Credit for data producers 3
  • 4.
    Why data papers?- Data reuse is easier 4 “The Data Descriptor made it easier to use the data, for me it was critical that everything was there…all the technical details like voxel size.” Professor Daniele Marinazzo
  • 5.
    Results Discussion Analysis & Conclusions Whatwas done to generate the data? How was the data processed? Where is the data? Who did what and when? Methods and technical analyses supporting the quality of the measurements. Do not contain tests of new scientific hypotheses Comparison of data paper to traditional article
  • 6.
    Data papers atScientific Data 6 Human readable representation of study i.e. article (HTML & PDF) Human readable component i.e. article text (HTML & PDF) Machine accessible component i.e. metadata (ISA format)
  • 7.
    Scientific Data’s RepositoryList Almost 80 recommended data repositories listed 1. Is there a public data-specific repository for your data? 2. If there is no public data-specific repository for your data exists, does your funder or institution mandate deposition to a particular repository? www.nature.com/sdata/data-policies/repositories 7
  • 8.
    Authors can usetheir own IR when submitting to Scientific Data For institutional repositories (such as ScholarsArchive@OSU), select ‘DataCite DOI’ as Repository Name during submission DataCite 8
  • 9.
  • 10.
  • 11.
    What do datajournals require from a repository? 1. Recognized within their scientific community 2. Long-term data preservation plan 3. Implementation of community reporting standards 4. Stable identifiers for published datasets 5. Allow open access to data without unnecessary restrictions e.g. no commercial use restrictions Questionnaire for new repositories requesting listing: http://www.nature.com/sdata/data-policies#repo-suggest 11
  • 12.
    Visit nature.com/sdata Email scientificdata@nature.com Tweet@ScientificData Honorary Academic Editor Susanna-Assunta Sansone Managing Editor Andrew L. Hufton Data Curation Editor Varsha K. Khodiyar Advisory Panel and Editorial Board including senior researchers, funders, librarians and curators Supported by

Editor's Notes

  • #3 Scientific Data is an open-access, peer-reviewed publication for descriptions of scientifically valuable datasets. Our primary article-type, the Data Descriptor, is designed to make your data more discoverable, interpretable and reusable.
  • #4 Biology example: Sequence data from this Data Descriptor, was used to help develop and test this groundbreaking assembly algorithm published at Nature. Data publication does NOT in any way preclude collaboration and co-authorship – our observations so far suggest very much the opposite.  
  • #5 Daniele knew about the dataset prior to Chris’ paper being published, as Chris had shared this in Torrent Exchange.  However he did not access the data from this.  He saw on Twitter when the SciData paper was published and then read the paper. Daniele said “I would never have collected this data myself, as it’s not my primary field of work”. He said the Data Descriptor made it easier to use the data “for me it was critical that everything was there [in the Data Descriptor ]…all the technical details like voxel size.”
  • #6 Data Descriptors are methodologically driven while traditional research articles are hypothesis driven. Nature-titled journals have agreed that prior publication of a Data Descriptor will not compromise the novelty of new manuscript submissions as long as those manuscripts go substantially beyond a descriptive analysis of the data, and report important new scientific findings appropriate for the journal. See full policy online: http://www.nature.com/sdata/for-authors/editorial-and-publishing-policies/#prior-pub
  • #8 We currently list almost 80 repositories, across biological, medical, physical and social sciences When required, we provide guidance to authors on the best place to store their data