Your SlideShare is downloading. ×
0
Identifiers and
Data Management
Joan Starr
California Digital Library
What is an identifier?
What is an identifier?
What you see: alphanumeric string (never changes)
Associated with: location of object (such as a UR...
Identifier example
string: doi:10.9999/FK40K2GTV
html version: http://dx.doi.org/10.9999/FK40K2GTV
location: http://www.bo...
Identifier example
string: doi:10.9999/FK40K2GTV
html version: http://dx.doi.org/10.9999/FK40K2GTV
location: http://www.st...
Why Identifiers are Important
Allow readers to find data products
Get credit for data and publications
Promote reproducibility
Better measure of researc...
A&I Indexing and #altmetrics
• DataCite DOIs for data, linking to
scholarly research
• Credit to data producers and data
publishers
• Exposure and rese...
DataCite Services
1. DOIs for data!
2. Local service & support
3. Usage stats
4. Citation formatter
5. Content negotiation...
ARKs
DOIs
IDF
EZID CLIENTS
DOIs
DOIs
EZID and DataCite
together
EZID: DOIs & ARKs
DOIs ARKs
Strict metadata requirements Flexible metadata guidelines
From the scholarly communication
com...
DOIs and ARKs in
data management plans
DOIs in data management plans
What it looks like: Sample plan language:
Aguilée R, Lambert A, Claessen D (2011)
Data from:...
ARKs in data management plans
What it looks like:
At top-level directory/folder:
Project Title
Unique Identifier
Date (yyy...
Identifiers for data management
Identifiers + data=
• Easy to access,
• Easy to re-use
• Easy to verify
EZID
DMP Tool
Email
Twitter
http://ezid.cdlib.org
https://dmptool.org/
ezid@ucop.edu
@ezidCDL
Identifiers and Data Management
Upcoming SlideShare
Loading in...5
×

Identifiers and Data Management

867

Published on

How can persistent identifiers like digital object identifiers (DOIs) and archival resource keys (ARKs) help with data management? When should you use a DOI vs and ARK? These "talking point" slides give you sample text you can use in your data management plans--and more!

Published in: Technology
0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
867
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
9
Comments
0
Likes
3
Embeds 0
No embeds

No notes for slide
  • DOIs are one kind of persistent identifier.But what is an identifier?An identifier is an alphanumeric string assigned to an object, and if that assignment is managed with some metadata and the object is made available over time, the identifier becomes a VERY reliable way of keeping track of that object.
  • Let’s take a look at one.So you can see that with just the identifier and a simple set of metadata, you get:Location for VERIFICATIONEXPOSURE & CITATION TRACKING(this is not an actual DOI, nor an actual study)
  • And here’s that same DOI some time later.THE STRING NEVER CHANGES. This means it can be cited, tracked and associated with all kinds of metadata. More on that in a minute.
  • More on that idea about getting credit for data
  • DATA CITATION1. Get full credit for your research2. Ensure transparency & accountability3. Get more citations & track the impact of your work4. Promote scientific re-use of your work
  • How can EZID be in the business of issuing DataCite DOIs? California Digital Library was one of the founding members.DataCite was indeed formed in 2009 by 10 Libraries and Research Centers with a Mission: “"Helping you find, access, and reuse data“The number has now grown to 17. In addition there are 5 associate members, including the Korea Institute of Science and Technology Information, the National Research Council of Thailand and Beijing Genomics Institute (BGI), so there is a presence in Asia.DATACITE’s primary methodology for achieving this mission: issuing DOIs (Digital Object Identifiers) for datasets.
  • In BETA
  • Image credit: http://www.flickr.com/photos/lgh75/6096247026/By lgh75
  • Image credit: http://www.flickr.com/photos/lgh75/6096247026/By lgh75OSTP Public Access Memo:http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdfIncreased citations: Piwowar’s 2007 study: http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0000308
  • Image credit: http://www.flickr.com/photos/lgh75/6096247026/By lgh75Reference for “documentation and metadata”: http://libraries.mit.edu/guides/subjects/data-management/metadata.html
  • Image credit: http://www.flickr.com/photos/abbot45/187640227/By *USB*And that in turn, means that researchers can-->Build on previous work-->Conduct new research-->Avoid duplicating previous workAnd if that isn’t enough of an incentive, we now have the federal policy statements calling for …“Develop approaches for identifying and providing appropriate attribution to scientific data sets” New OSTP policy
  • Transcript of "Identifiers and Data Management"

    1. 1. Identifiers and Data Management Joan Starr California Digital Library
    2. 2. What is an identifier?
    3. 3. What is an identifier? What you see: alphanumeric string (never changes) Associated with: location of object (such as a URL) Optional: who, what, when, etc (i.e. metadata) By Joelk75: http://www.flickr.com/photos/75001512@N00/2728233597/
    4. 4. Identifier example string: doi:10.9999/FK40K2GTV html version: http://dx.doi.org/10.9999/FK40K2GTV location: http://www.bologna.edu/biology/xfg/123.xls metadata creator: Dr. Felix Kottor title: Data for chromosomal study of catfish (Ictalurus punctatus) publisher: University of Bologna date: 8/31/2012
    5. 5. Identifier example string: doi:10.9999/FK40K2GTV html version: http://dx.doi.org/10.9999/FK40K2GTV location: http://www.state.edu/ecology/783sdr/123.xls metadata creator: Dr. Felix Kottor title: Data for chromosomal study of catfish (Ictalurus punctatus) publisher: Dryad Data Repository date: 10/01/2013
    6. 6. Why Identifiers are Important
    7. 7. Allow readers to find data products Get credit for data and publications Promote reproducibility Better measure of research impact Example: Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20 Why Identifiers are Important
    8. 8. A&I Indexing and #altmetrics
    9. 9. • DataCite DOIs for data, linking to scholarly research • Credit to data producers and data publishers • Exposure and research metrics for datasets (Web of Knowledge, Google) Primary Functions 1. Create identifiers 2. Manage identifiers (and metadata) over time 3. Resolve identifiers EZID Long term identifiers made easy @ezidCDL http://n2t.net/ezid
    10. 10. DataCite Services 1. DOIs for data! 2. Local service & support 3. Usage stats 4. Citation formatter 5. Content negotiation 6. Metadata search 7. OAI provider 8. DataCite-to-ORCID hookup* 9. Your ideas here…
    11. 11. ARKs DOIs IDF EZID CLIENTS DOIs DOIs EZID and DataCite together
    12. 12. EZID: DOIs & ARKs DOIs ARKs Strict metadata requirements Flexible metadata guidelines From the scholarly communication community From the archives and museums community Established “brand name” Option-rich, open source Use case: Data Citation Use case: Data Documentation
    13. 13. DOIs and ARKs in data management plans
    14. 14. DOIs in data management plans What it looks like: Sample plan language: Aguilée R, Lambert A, Claessen D (2011) Data from: Ecological speciation in dynamic landscapes. Journal of Evolutionary Biology doi:10.5061/dryad.74024 Publication of data shall occur during the project, if appropriate, or at the end of the project, consistent with normal scientific practices. [Team] follows a standardized data product citation including DOI, that indicates the version and how to obtain a copy of that product. Why it’s important: OSTP mandate to: identify and provide “appropriate attribution to scientific data sets” Researcher benefits: Credit, increased citations, increased productivity Data Citation
    15. 15. ARKs in data management plans What it looks like: At top-level directory/folder: Project Title Unique Identifier Date (yyyy or yyyy.mm.dd) At sub-directories: optional identifiers at granular levels Sample plan language: [Team] follows the recommended best practice for good data management by assigning unique identifiers (ARKs) to the data as part of the data documentation. Why it’s important: Researcher benefits: Data documentation helps you keep track of (and remember) aspects of your data throughout the research project. Data Documentation
    16. 16. Identifiers for data management Identifiers + data= • Easy to access, • Easy to re-use • Easy to verify
    17. 17. EZID DMP Tool Email Twitter http://ezid.cdlib.org https://dmptool.org/ ezid@ucop.edu @ezidCDL
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×