The Data Management Ecosystem

The Data Management
Ecosystem
4 April 2013

University of California Curation Center
California Digital Library

The research data problem

• Journal article • Research data
– Uniquely and persistently – Nope
identified
– Concept of “publish” – Not really

– Multiple copies – Typically one

– Easily findable – Difficult

– Services: impact – Nope
metrics, citation
tracking, etc.
Research data is seen as a second-
class citizen in the scholarly record.

An ecosystem of inter-dependent partners
Besides data repository and publisher partners...
• researchers
• educators
• citizen science groups
• funders
• tenure and promotion committees

Libraries as neutral connection partners

Where can libraries make a difference?
Research & Scholarship Lifecycle
Research

Save Collect
Create
Knowledge

Share Publish

Collect > Publish > Share > Save > Research

Create, edit, share, and save data
management plans

Open source curation add-in for
Microsoft Excel

Capture today’s web; build
tomorrow’s archives

Collect >Publish > Share > Save > Research

Create and manage persistent
identifiers: ARKs, DOIs, etc.

An infrastructure to publish and get
credit for sharing research data

Collect > Publish >Share > Save > Research

Curation repository:
store, manage, preserve, and share
research data
Open deposit, open access
repository for spreadsheet data

Data Observation Network for Earth

Collect > Publish > Share > Save >Research

What’s missing to complete the “incentive” circuit?
• Impact measures, citation tracking

“Connecting the data to the
research it informs”

Altmetrics tools to measure non-
traditional products and uses , , etc.

Stable storage: Merritt repository
• Curation repository open to the UC
community and beyond
• Discipline / content agnostic
• Micro-services architecture
• Easy-to-use UI or API
• Hosted or locally deployed

EZID: Long term identifiers made easy
• Precise identification of a
dataset (DOI or ARK)
• Credit to data producers and
data publishers
• A link from the traditional
literature to the data (DataCite)
• Exposure and research metrics
for datasets
(Web of Knowledge, Google)

Take control of the
management and distribution
of your research, share and get
credit for it, and build your
reputation through its collection
and documentation

Discovery: DataCiteconsortium
• Technische Informationsbibliothek • Canada Institute for Scientific and
(TIB), Germany Technical Information (CISTI)
• L’Institut de l’Information Scientifique
• Australian National Data Service (ANDS)
et Technique (INIST), France
• The British Library
• Library or the ETH Zürich
• California Digital Library, USA • Library of TU Delft, The Netherlands
• Office of Scientific and Technical
Information, US Department of Energy
• Purdue University, USA
• Technical Information Center of
Denmark

New distributed framework
Coordinating Nodes Flexible, scalable, sustainabl
Member Nodes
• retain complete metadata
e network
• catalog institutions
diverse
• subset of all data
• serve local community
• perform basic indexing
• provide resources for
• provide network-wide
managing their data
services
• ensure data availability
(preservation)
• provide replication
services

The rest of the story

www.cdlib.org/uc3

John.Kunze@ucop.edu
uc3@ucop.edu for service questions

The Data Management Ecosystem

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to The Data Management Ecosystem

Similar to The Data Management Ecosystem (20)

More from John Kunze

More from John Kunze (19)

The Data Management Ecosystem

Editor's Notes