Your SlideShare is downloading. ×
Dataset citation and identification
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.


Introducing the official SlideShare app

Stunning, full-screen experience for iPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Dataset citation and identification


Published on

I gave this presentation to the STM Publishers Association Innovation Conference in London, 4-December-2009. It frames the data citation problem and introduces DataCite - the international data …

I gave this presentation to the STM Publishers Association Innovation Conference in London, 4-December-2009. It frames the data citation problem and introduces DataCite - the international data citation initiative.

Published in: Technology, Education

1 Comment
No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide


  • 1. Dataset citation and identification Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite December, 2009
  • 2. Widening gap A widening gap in the scientific record between published research and the data that underlies it Published work held by libraries Datasets held by data centres No effective way to link between datasets and articles No widely used method to identify datasets No widely used method to cite datasets As a result, datasets are Difficult to discover Difficult to access Second-class citizens in the scientific record 2
  • 3. Datasets – first class citizens? Datasets Published articles Data is difficult to manage after Libraries ensure long-term storage project funding ceases and management Informal networks provide the Established funded services provide primary means of sharing the primary means of access Only 21% use a national or Nearly all published articles are held international facility in multiple national libraries Datasets are not included in impact Articles and citations form the analysis backbone of impact analysis Good luck finding it or getting Catalogues and full-text search permission to use it (your discipline support discovery may vary) Source: UKRDS Study 3
  • 4. Dataset citation using Digital Object Identifiers (DOIs) The DOI system offers an Dataset easy way to connect the G.Yancheva, N. R. Nowaczyk et al (2007) article with the underlying data Rock magnetism and X-ray flourescence spectrometry analyses on sediment cores Several organisations assign of the Lake Huguang Maar, Southeast DOIs to datasets China, PANGAEA IUCR, ICPSR, OECD through CrossRef doi:10.1594/PANGAEA.587840 Pangea, Mare, and others through TIB (German Science Library) Article G. Yancheva, N. R. Nowaczyk et al (2007) s Influence of the intertropical convergence te Ci zone on the East Asian monsoon Nature 445, 74-77 doi:10.1038/nature05431 4
  • 5. DataCite – International Data Citation Initiative Our long term vision is to support researchers by providing methods for them to locate, identify, and cite research datasets with confidence. Milestones 2005, Hannover, TIB begins to issue DOIs for datasets March 2009, Paris Memorandum signed at ICSTI December 2009, London DataCite Association founded (DataCite : Data Centres :: CrossRef : Publishers) 5
  • 6. Global partnership Germany - Technische Informationsbibliothek (TIB) United Kingdom - The British Library France - L’Institut de l’Information Scientifique et Technique (INIST) Switzerland - Library of the ETH Zürich Denmark - Library of TU Delft Netherlands - Technical Information Center Canada - Canadian Institute for Scientific and Technical Information (CISTI) Australia - National Data Service (ANDS) USA - California Digital Library USA - Purdue University 6
  • 7. DataCite The DataCite registration agency Maintains the resolution infrastructure Maintains a searchable database of metadata Manages the identifiers over the long term Establishes and shares best practice Publishing agents (data centres, research institutes, publishers) are responsible for Quality assurance Content storage and access Creating the identifier Creating and updating metadata 7
  • 8. DataCite Structure International DOI Foundation Member s Managing Agent rr ie DataCite (TIB) Ca Member Member Associate Institution Institution Stakeholder … Works with Data Centre Data Centre Data Centre Data Centre Data Centre Data Centre 8
  • 9. 9
  • 10. 10
  • 11. 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. Research Data in Articles 17
  • 18. How can we work together? DataCite supports researchers Help to establish best by enabling them to locate, practices identify, and cite research Adjust author policies to datasets with confidence require clear unambiguous citations for datasets This is the start of a conversation Integrate links to datasets into delivery platforms We welcome your comments, Collaborate to understand questions, and ideas! evolving roles and responsibilities among Contact: publishers, data centres, and adam.farquhar {@} libraries jan.brase {@} Help me to rewrite this list! 18