Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

FSCI Persistent Identifiers


Published on

Identifying and linking data using persistent identifiers:
What are persistent identifiers and how do they help research discovery, accessibility and reproducibility?
Which identifier should you choose and when?

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

FSCI Persistent Identifiers

  1. 1. Force 11 Scholarly Communications Institute Summer School 31 July – 4 August 2017 University of California, San Diego Data in the Scholarly Communications Lifecycle Natasha Simons Senior Research Data Specialist
  2. 2. Wednesday 2 August Session two – persistent identifiers for research (data) • Why do we need PiDs? • What are PiDs? • Why use PiDs? • Why are there so many PiDs? • Examples: Handles, DOIs, ORCIDs • Which PiD to choose? • Power of linking PiDs • PiD fails • PiD community Duration: 30 mins
  3. 3. What’s the problem?
  4. 4. What are Persistent Identifiers (PiDs)? A persistent identifier is a long–lasting reference to a digital resource Photo attribution: Jan Hettenhausen - (reproduced with permission)
  5. 5. Use PiDs to connect… Researchers Publications Data Software Methods Equipment ??? Why use PiDs? PiDs play a key role in the discoverability, accessibility and reproducibility of research.
  6. 6. Why are there so many PiDs? Marked by differences in: • Purpose • Scope • Underlying technology • Governance and social infrastructure • Metadata collected • Cost • Extent of use ARK PURL NLA party ID
  7. 7. Example: The Handle System • Run by CNRI • Robust system • Widely used in publication repositories • Used to identify research datasets
  8. 8. How do Handles work? Example: = resolver service / 11343 = prefix identifying assigning body (Uni Melb) / 130078 = suffix identifying resource (Melb Uni report)
  9. 9. Example: Digital Object Identifiers (DOIs) • Run by international DOI Foundation • Robust – built on the Handle System • Origins in publishing industry • Used to identify and cite publications and research datasets • The most widely used PiD for research data
  10. 10. How do DOIs work? This is an example from Griffith University: = resolver service / 10.4225 = prefix identifying the assigning body (ANDS) / 01 = Suffix 1 – the institution identifier (Griffith University) / 4F3DB08617645 = Suffix 2 – the resource item or collection identifier (a dataset held in the Griffith data repository)
  11. 11. More about DOIs • Metadata required! Example: DataCite Metadata Schema • DOI search services e.g. DataCite • Cost involved but some agencies like ANDS offer a free service • To get a DOI through the ANDS service: m2m or manual minting
  12. 12. Example: ORCIDs • Run by ORCID organisation • Identifier for people (researchers) • Links people with their research ‘works’ • Widely used internationally • Australian research sector-wide endorsement • Embedded in scholarly workflows
  13. 13. How do ORCIDs work? • 16 digit identifier based on ISNI block • Prototype: Thomson Reuters ResearcherID • Most metadata fields are optional • Free for researchers, fee for members (organisations) • Public API (free) and premium API (members) • Transparent governance and development process
  14. 14. The power of linking PiDs • International efforts to link ORCIDs (researchers) with DOIs (publications and data) • The Scholix initiative: • a global framework to improve the links between publications and data • beneficial for all, especially publishers (display this link in journals) and repositories (link back to data held in repositories)
  15. 15. Which PiD to choose? Evaluate the PiD service: • Purpose • Scope • Underlying technology • Governance and social infrastructure • Metadata collected • Cost • Extent of use • Trustworthiness? Choose the best fit PiD for the type of resource and it’s point in the research lifecycle Better to choose one than none!
  16. 16. PiDs sound great - but hang on….? Erm… • Recent PiD crises: PURL, LSID • “Zombie PiDs”? Remember: • PiDs are both social and technical systems • Governance/ organisations can be the archilles heel of PiD systems See: Klump, J. & Huber, R., (2017). 20 Years of Persistent Identifiers – Which Systems are Here to Stay?. Data Science Journal. 16, p.9. DOI: Have PiD systems ever failed? What’s the guarantee they will stay “long lasting”?
  17. 17. Cool and groovy international PiD community
  18. 18. Summary • PiDs play a key role in the discovery, accessibility and reproducibility of research. • There are many PiD systems which vary in purpose, scope, underlying technology, governance and social infrastructure, metadata collected, cost, extent of use. • When evaluating which PiD to assign to a resource, consider: • The differences above and importantly, trustworthiness • Better to assign a PiD or more than no PiD at all • Remember that PiDs are about social as well as technical infrastructure. It is the responsibility of the PiD owner (e.g. a university) to update the PiD if the resource location changes. • PiDs are evolving so get your geek on and join in the discussions!
  19. 19. Want more? Have a go at: • Thing 14 – Identifiers and linked data Read: • ANDS website for PiD Guides, DOI service, Handle service: • More about DataCite • More about ORCID • ICSU/CODATA Data Science Journal special issue: 20 years of Persistent Identifiers Watch: • ANDS PiDs short bites webinar series (persistent identifiers playlist) - more to come in this series! • THOR Project webinar series
  20. 20. With the exception of logos, third party images or where otherwise indicated, this work is licensed under the Creative Commons Australia Attribution 3.0 Licence. ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy Program. Monash University leads the partnership with the Australian National University and CSIRO. Natasha Simons Tw: @n_simons ORCID: