This work “Trustworthy Repositories, Organizations & Infrastructure”, by Micah Altman (http://redistricting.info) is licensed under the Creative Commons Attribution-Share Alike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/3.0/us/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.
Linking Data to Publications through Citation and Virtual Archives Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the 2011 SSP 33rd Annual Meeting June 2011
Leonid Andreev, Ed Bachman, Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling, Kevin Condon, Jonathan Crabtree, Merce Crosas, Gary King , Patrick King, Tom Lipkis, Freeman Lo, Jared Lyle, Marc Maynard, Nancy McGovern, Lois Timms-Ferrarra, Akio Sone, Bob Treacy
Thanks to the Library of Congress (PA#NDP03-1), the National Science Foundation (DMS-0835500, SES 0112072), IMLS (LG-05-09-0041-09), the Harvard University Library, the Institute for Quantitative Social Science, the Harvard-MIT Data Center, and the Murray Research Archive.
Linking Data to Publications through Citation and Virtual Archives * And co-conspirators
Reprints available from: http://maltman.hmdc.harvard.edu
M. Altman, Adams, M., Crabtree, J., Donakowski, D., Maynard, M., Pienta, A., & Young, C. 2009. "Digital preservation through archival collaboration: The Data Preservation Alliance for the Social Sciences." The American Archivist . 72(1): 169-182
M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of Quantitative Data”, D-Lib, 13, 3/4 (March/April).
M. Altman,2008, "A Fingerprint Method for Verification of Scientific Data" in, Advances in Systems, Computing Sciences and Software Engineering , (Proceedings of the International Conference on Systems, Computing Sciences and Software Engineering 2007) , Springer Verlag.
M. Crosas, 2011, “The Dataverse Network: An Open-Source Application for Sharing, Discovering and Preserving Data”, D-Lib Magazine 17(1/2).
G. King, 2007, " An Introduction to the Dataverse Network as an Infrastructure for Data Sharing", Sociological Methods and Research , Vol. 32, No. 2, pp. 173-199
Linking Data to Publications through Citation and Virtual Archives
The state tax authority: counting people, estimating wealth
Reformers use data to assess the performance of the state
Science informs public policy continually
In modern democracy: the public needs a direct source of information
Linking Data to Publications through Citation and Virtual Archives Source: “Propaganda” http://www.media-studies.ca/articles/images/berlin_wall.jpg Motivations Elements Citing Data Virtual Archives
Open Data Enables New Forms of Science and Education
Data Intensive Science
Increased opportunities for interdisciplinarity
Science modeling reality across multiple scales
Continuous, complete, fine-grained information on physical processes, systems, human behavior
Open Data Democratizes Science
Institutions outside of the inner circle of research
Open data eases transition from education to research
In addition, sharing data increases citation rates [Gleditsch 2003; Wilson 2008; Piowar 2007]
Linking Data to Publications through Citation and Virtual Archives Visualization from multiple experiments using Community Climate Systems Model, through Earth Science Grid. Source: “ Beyond Being There”, National Science Foundation, 2008. Motivations Elements Citing Data Virtual Archives
was published in the NIH Guide on February 26, 2003.
“ Starting with the October 1, 2003 receipt date, investigators submitting an NIH application seeking $500,000 or more in direct costs in any single year are expected to include a plan for data sharing or state why data sharing is not possible. “
No later than the main findings from the final data set are accepted for publication
NSF, All proposals must (as of 1/1/2011) include a data management plan.
Specific requirements vague, for the most part: “will be determined by the community of interest through the process of peer review and program management.”
“ will review data management and sharing plans, and any costs involved in delivering them, as an integral part of the funding decision ”
Linking Data to Publications through Citation and Virtual Archives Motivations Elements Citing Data Virtual Archives