Identifiers and Data Management
Upcoming SlideShare
Loading in...5
×
 

Identifiers and Data Management

on

  • 664 views

How can persistent identifiers like digital object identifiers (DOIs) and archival resource keys (ARKs) help with data management? When should you use a DOI vs and ARK? These "talking point" slides ...

How can persistent identifiers like digital object identifiers (DOIs) and archival resource keys (ARKs) help with data management? When should you use a DOI vs and ARK? These "talking point" slides give you sample text you can use in your data management plans--and more!

Statistics

Views

Total Views
664
Views on SlideShare
657
Embed Views
7

Actions

Likes
3
Downloads
7
Comments
0

1 Embed 7

https://twitter.com 7

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • DOIs are one kind of persistent identifier.But what is an identifier?An identifier is an alphanumeric string assigned to an object, and if that assignment is managed with some metadata and the object is made available over time, the identifier becomes a VERY reliable way of keeping track of that object.
  • Let’s take a look at one.So you can see that with just the identifier and a simple set of metadata, you get:Location for VERIFICATIONEXPOSURE & CITATION TRACKING(this is not an actual DOI, nor an actual study)
  • And here’s that same DOI some time later.THE STRING NEVER CHANGES. This means it can be cited, tracked and associated with all kinds of metadata. More on that in a minute.
  • More on that idea about getting credit for data
  • DATA CITATION1. Get full credit for your research2. Ensure transparency & accountability3. Get more citations & track the impact of your work4. Promote scientific re-use of your work
  • How can EZID be in the business of issuing DataCite DOIs? California Digital Library was one of the founding members.DataCite was indeed formed in 2009 by 10 Libraries and Research Centers with a Mission: “"Helping you find, access, and reuse data“The number has now grown to 17. In addition there are 5 associate members, including the Korea Institute of Science and Technology Information, the National Research Council of Thailand and Beijing Genomics Institute (BGI), so there is a presence in Asia.DATACITE’s primary methodology for achieving this mission: issuing DOIs (Digital Object Identifiers) for datasets.
  • In BETA
  • Image credit: http://www.flickr.com/photos/lgh75/6096247026/By lgh75
  • Image credit: http://www.flickr.com/photos/lgh75/6096247026/By lgh75OSTP Public Access Memo:http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdfIncreased citations: Piwowar’s 2007 study: http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0000308
  • Image credit: http://www.flickr.com/photos/lgh75/6096247026/By lgh75Reference for “documentation and metadata”: http://libraries.mit.edu/guides/subjects/data-management/metadata.html
  • Image credit: http://www.flickr.com/photos/abbot45/187640227/By *USB*And that in turn, means that researchers can-->Build on previous work-->Conduct new research-->Avoid duplicating previous workAnd if that isn’t enough of an incentive, we now have the federal policy statements calling for …“Develop approaches for identifying and providing appropriate attribution to scientific data sets” New OSTP policy

Identifiers and Data Management Identifiers and Data Management Presentation Transcript

  • Identifiers and Data Management Joan Starr California Digital Library
  • What is an identifier?
  • What is an identifier? What you see: alphanumeric string (never changes) Associated with: location of object (such as a URL) Optional: who, what, when, etc (i.e. metadata) By Joelk75: http://www.flickr.com/photos/75001512@N00/2728233597/
  • Identifier example string: doi:10.9999/FK40K2GTV html version: http://dx.doi.org/10.9999/FK40K2GTV location: http://www.bologna.edu/biology/xfg/123.xls metadata creator: Dr. Felix Kottor title: Data for chromosomal study of catfish (Ictalurus punctatus) publisher: University of Bologna date: 8/31/2012
  • Identifier example string: doi:10.9999/FK40K2GTV html version: http://dx.doi.org/10.9999/FK40K2GTV location: http://www.state.edu/ecology/783sdr/123.xls metadata creator: Dr. Felix Kottor title: Data for chromosomal study of catfish (Ictalurus punctatus) publisher: Dryad Data Repository date: 10/01/2013
  • Why Identifiers are Important
  • Allow readers to find data products Get credit for data and publications Promote reproducibility Better measure of research impact Example: Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20 Why Identifiers are Important
  • A&I Indexing and #altmetrics
  • • DataCite DOIs for data, linking to scholarly research • Credit to data producers and data publishers • Exposure and research metrics for datasets (Web of Knowledge, Google) Primary Functions 1. Create identifiers 2. Manage identifiers (and metadata) over time 3. Resolve identifiers EZID Long term identifiers made easy @ezidCDL http://n2t.net/ezid
  • DataCite Services 1. DOIs for data! 2. Local service & support 3. Usage stats 4. Citation formatter 5. Content negotiation 6. Metadata search 7. OAI provider 8. DataCite-to-ORCID hookup* 9. Your ideas here…
  • ARKs DOIs IDF EZID CLIENTS DOIs DOIs EZID and DataCite together
  • EZID: DOIs & ARKs DOIs ARKs Strict metadata requirements Flexible metadata guidelines From the scholarly communication community From the archives and museums community Established “brand name” Option-rich, open source Use case: Data Citation Use case: Data Documentation
  • DOIs and ARKs in data management plans
  • DOIs in data management plans What it looks like: Sample plan language: Aguilée R, Lambert A, Claessen D (2011) Data from: Ecological speciation in dynamic landscapes. Journal of Evolutionary Biology doi:10.5061/dryad.74024 Publication of data shall occur during the project, if appropriate, or at the end of the project, consistent with normal scientific practices. [Team] follows a standardized data product citation including DOI, that indicates the version and how to obtain a copy of that product. Why it’s important: OSTP mandate to: identify and provide “appropriate attribution to scientific data sets” Researcher benefits: Credit, increased citations, increased productivity Data Citation
  • ARKs in data management plans What it looks like: At top-level directory/folder: Project Title Unique Identifier Date (yyyy or yyyy.mm.dd) At sub-directories: optional identifiers at granular levels Sample plan language: [Team] follows the recommended best practice for good data management by assigning unique identifiers (ARKs) to the data as part of the data documentation. Why it’s important: Researcher benefits: Data documentation helps you keep track of (and remember) aspects of your data throughout the research project. Data Documentation
  • Identifiers for data management Identifiers + data= • Easy to access, • Easy to re-use • Easy to verify
  • EZID DMP Tool Email Twitter http://ezid.cdlib.org https://dmptool.org/ ezid@ucop.edu @ezidCDL