Dataset Metadata, Tools and Approaches for Access and Preservation

Dataset Metadata
Tools and Approaches for Access and Preservation

Joan Starr
California Digital Library
January, 2012
@joan_starr

Dataset Metadata
Tools & Approaches

Introduction
Requirements
DataCite, EZID & Identifiers
DataCite Metadata
Next steps

By Brain farts (Joschua) http://www.flickr.com/photos/brainfarts/97676505/

Requirements
for dataset description

• Access

• Preservation

By barryegan (Vitor Leite) http://www.flickr.com/photos/vixon/116447718/

How?
• Key identifying elements
• Emerging recommendations
• Variation among the domains

How?
• Key identifying elements
• Emerging recommendations
• Variation among the domains
• In common: Persistent identifier

DataCite
German National Library of Economics (ZBW) Canada Institute for Scientific and Technical Information
German National Library of Science and Technology (TIB) (CISTI)

German National Library of Medicine (ZB MED) Technical Information Center of Denmark

GESIS - Leibniz Institute for the Social Sciences, Germany Institute for Scientific & Technical Information (INIST-

Australian National Data Service (ANDS) CNRS), France

ETH Zurich, Switzerland TU Delft Library, The Netherlands

The Swedish National Data Service (SNDS)

The British Library , UK

California Digital Library (CDL), USA

Office of Scientific & Technical Information (OSTI), USA

Purdue University Library

What is an identifier?

What you see: alphanumeric string (never changes)
Associated with: location of object (such as a URL)

Optional: who, what, when, etc (i.e. metadata)

By Joelk75: http://www.flickr.com/photos/75001512@N00/2728233597/

Identifier example
string: doi:10.9999/FK40K2GTV
html version: http://dx.doi.org/10.9999/FK40K2GTV
location: http://www.bologna.edu/biology/xfg/123.xls
metadata
creator: Dr. Felix Kottor
title: Data for chromosomal study of catfish (Ictalurus
punctatus)
publisher: University of Bologna
date: 8/31/2011

Identifier example
string: doi:10.9999/FK40K2GTV
html version: http://dx.doi.org/10.9999/FK40K2GTV
location: http://www.state.edu/ecology/783sdr/123.xls
metadata
creator: Dr. Felix Kottor
title: Data for chromosomal study of catfish (Ictalurus
punctatus)
publisher: Dryad Data Repository
date: 10/01/2011

EZID: long-term identifiers made easy

take control of the
management and
distribution of your research,
share and get credit for it,
and build your reputation
through its collection and
documentation

Primary Functions
1. Create persistent identifiers
2. Manage identifiers over time
3. Manage associated metadata over time

DataCite Metadata V. 2.2
• Small required set = citation elements
• Optional descriptive set:
– extendable lists
– can refer to other standards, schemes
– domain-neutral
– rich ability to describe relationships to other
digital objects
• Metadata Search (MDS) is full-text indexed

Required properties

1. Identifier (with type attribute)
2. Creator (with name identifier attributes)
3. Title (with optional type attribute)
4. Publisher
5. PublicationYear

Optional properties
6. Subject (with schema attribute)
7. Contributor (with type & name identifier attributes)
8. Date (with type attribute)
9. Language
10. ResourceType (with description attribute)
11. AlternateIdentifier (with type attribute)
12. RelatedIdentifier (with type &relation type attributes)
13. Size
14. Format
15. Version
16. Rights
17. Description (with type attribute)

Data Management Planning

By NASA Goddard Photo and Video: http://www.flickr.com/photos/gsfc/3720663276/

A life cycle approach
CDL Curation and Publishing Services
http://www.cdlib.org
Create, edit, share, and save
data management plans
Open source add-in for Microsoft Excel
as a data collection tool

Create and manage
persistent identifiers
Curation repository:
store, manage, and share research data

Open access scholarly publishing services:
papers, journals, books, seminars & more

An infrastructure to publish and get credit Data Publication
for sharing research data

Identifiers and data management
Track your Organize
results your data

Get
more
citations

Meet funder requirements

Next Steps
DataCite
• Dublin Core application profile
• Content Service
• Metadata v. 2.3
EZID
•UI redesign
•Automated link checking
•Exposure for metadata

By Nicola Whitaker http://www.flickr.com/photos/nicolawhitaker/111009156/

Next Steps
Library
• service center
• information center
• your ideas here

By Nicola Whitaker http://www.flickr.com/photos/nicolawhitaker/111009156/

For more information
EZID
EZID application: http://n2t.net/ezid/
EZID website:
http://www.cdlib.org/services/uc3/ezid/

DataCite
DataCite Home: http://datacite.org/
DataCite Metadata Schema:
http://schema.datacite.org/meta/kernel-
2.2/index.html
DataCite Metadata Search: http://search.datacite.org

Questions?

by Horia Varlan
http://www.flickr.com/photos/horiavarlan/4273168957/in/photostream/

Joan Starr: uc3@ucop.edu
@joan_starr

Dataset Metadata, Tools and Approaches for Access and Preservation

More Related Content

What's hot

Viewers also liked

Similar to Dataset Metadata, Tools and Approaches for Access and Preservation

More from University of California Curation Center

Recently uploaded

Dataset Metadata, Tools and Approaches for Access and Preservation

Editor's Notes