An Overview of Data Citation Principles Synthesis Activity
1. Prepared for
Research Data Alliance 2nd Plenary
Sept 2013
An Overview of Data Citation Principles
Synthesis Activity
<bit.ly/altmandatacite>
Dr. Micah Altman
<escience@mit.edu>
Director of Research, MIT Libraries
Non-Resident Senior Fellow, The Brookings Institution
2. An Overview of Data Citation Principles
Synthesis Activity
Why a synthesis effort?
2
3. “The nice thing about standards is
that you have so many to choose
from.”
– Andy S. Tanenbaum
An Overview of Data Citation Principles
Synthesis Activity
3
4. Relatively Recent Relevant
Recommendations
An Overview of Data Citation Principles
Synthesis Activity
4
1999
-2004
NESSTAR,
Virtual Data Center
Cite research data in
publications; Use
persistent identifiers;
Facilitate direct
access to data
through URI’s
[Ryssevik & Musgrave 2001]
[Altman, et al. 2001]
2005-
2009
Dataverse Network
System, TIB Data
DOI Registration
Include versioning,
fixity, and granularity
for verification; use
permanent
institutions; facilitate
attribution
[Buhneman 2006]
[Altman & King 2007]
2009-
DataCite;
Thomson-Reuters
Data Citation
Index; FigShare;
Data Dryad
Include data citations
in standard locations;
index data citations
in catalogs; facilitate
data mining
[OECD 2009]
[NAS 2012]
[Force 11 2013]
[CODATA 2013]
Example Systems Core
Recommendations
Key References
5. An Overview of Data Citation Principles
Synthesis Activity
What’s been done?
5
6. Ongoing Synthesis Group
• Hosted by Force 11:
http://www.force11.org/node/4432
• Met weekly through summer
• Reviewed current recommendations and efforts:
– Amsterdam Manifesto
– Co-Data
– Digital Curation Center
– DataCite
– Research Data Alliance
• Identified core principles that are consistent across
recommendation groups
• Identify gaps and lack of consensus areas
• Formulate a draft synthesis of:
– Principles
– Examples
– Key references
An Overview of Data Citation Principles
Synthesis Activity
6
7. Sneak Preview
• Large core of agreement need for citation; use of persistent
identifiers; support for human and machine access;
facilitation of verification, attribution
• One key to successful synthesis is maintaining conceptual
boundaries between data citation; data publication & data
evaluation
• Convergence on principles, focus on identifying key
examples and reference documents
• Out of scope: detailed standards, protocols, infrastructure,
tools *
An Overview of Data Citation Principles
Synthesis Activity
7
*”A mere matter of implementation” - PHB
8. An Overview of Data Citation Principles
Synthesis Activity
What’s next?
8
9. Tomorrow
Data Citation Synthesis Group
Open Workshop
National Academy of Sciences
2100 C Street NW
18-19 September 2013
• Wed 1:15-5:15
In depth review of draft synthesis principles
• Thu 8:45-11:50
Discussion of any proposed modifications.
Dissemination plan.
An Overview of Data Citation Principles
Synthesis Activity
9
10. After Tomorrow
• Revision
– Public draft of synthesis
– Disseminate through professional meetings and
publications
• Adoption
– Target stakeholder organizations to adopt
declaration of principles
An Overview of Data Citation Principles
Synthesis Activity
10
11. References
• [Ryssevik & Musgrave 2001]
J Ryssevik , S. Musgrave. 2001. The Social Science Dream Machine
Social Science Computer Review [Altman, et al. 2001]
M. Altman, et al. 2001. A Digital Library for the Dissemination and Replication of Quantitative Social
Science Research: The Virtual Data Center, Social Science Computer Review
• [Buhneman 2006]
P. Buhneman 2006. How to Cite Curated Databases and Make them Citable
SSDBM ’06
• [Altman & King 2007]
M. Altman & G. King, 2007. A Proposed Standard for the Scholarly Citation of Quantitative Data, D-Lib
• [OECD 2009]
T. Green. 2009, We need publishing standards for datasets and data tables. OECD.
• [NAS 2012]
P. Uhlir (ed.),2011. For Attribution -- Developing Data Attribution and Citation Practices and Standards.
National Academies of Sciences.
• [Force11 2013]
M. Crosas, T. Carptenter, C. Borgman, D. Shotton 2013, The Amsterdam Manifesto on Data Citation
Principles, Force11
• [CODATA 2013]
CODATA-ICSTI Task Group on Data Citation, 2013; Out of Cite, Out of Mind: The Current State of Practice,
Policy, and Technology for the Citation of Data. Data Science Journal
An Overview of Data Citation Principles
Synthesis Activity
11
12. Questions?
Questions for me:
escience@mit.edu
Questions about the synthesis group:
http://www.force11.org/node/4432
Questions for the synthesis group:
datacitationworkgroup@force11.org
An Overview of Data Citation Principles
Synthesis Activity
12
Editor's Notes
This work. by Micah Altman (http://micahaltman.com) is licensed under the Creative Commons Attribution-Share Alike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/3.0/us/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.