Prepared and presented by Jessica Gallinger, Systems & Data Librarian, Okanagan College. How faculty can utilize Open Data in their own research and with their classes.
2. INTRODUCTION TO OPEN ACCESS
Thomas Jefferson, the third President of the United States, described knowledge in the
following way: "He who receives an idea from me, receives instruction himself without lessening
mine; as he who lights his taper at mine, receives light without darkening me." In doing so,
Jefferson anticipated the modern concept of a public good.
… A public good [depends on] non-rivalrous consumption – the consumption of one individual
does not detract from that of [another]. Knowledge of a mathematical theorem clearly satisfies
[this] attributes: if I teach you the theorem, I continue to enjoy the knowledge of the theorem at
the same time that you do.
Stiglitz (1999). Knowledge as a Global Public Good.
3. OUTLINE
Introduction to Open Access
What is open research data?
What are some examples of open data?
Merits of publishing research data
Finding repositories
Finding datasets
Research data management
References
Appendixes
4. WHAT IS OPEN
RESEARCH DATA?
Format: machine-
readable digital files
License: for re-use
“Research
Data”
material
sample
biological
specimen
chemical
artifacts
data
tabular
microdata [.sav, .txt]
aggregate [.xls]
spatial [.shp, .kml]
image
lab notes [.pdf]
pictures [.jpg]
AV
audio [.wav]
video [.mp4]
text transcript [.txt]
5. WHAT ARE SOME EXAMPLES OF OPEN DATA?
Example: Inter-university Consortium for Political and Social Research (ICPSR)
Example: Kelowna Open Data Catalogue
Example: Abacus Dataverse Network
6. MERITS OF PUBLISHING DATA
May citation frequency
permits mashups
May integrity
Metadata record
discoverability
searchability
contact-ability
* Data repositories accept null results
* Data repositories help you comply with Tri-
Council funding requirements
Sharing Detailed Research Data Is
Associated with Increased Citation
Rate: Principal Findings
[The study] examined the citation history
of 85 cancer microarray clinical trial
publications with respect to the
availability of their data. The 48% of
trials with publicly available microarray
data received 85% of the aggregate
citations. Publicly available data was
significantly (p = 0.006) associated with
a 69% increase in citations,
independently of journal impact factor,
date of publication, and author country
of origin using linear regression.
7. FINDING REPOSITORIES
Types of repositories:
Un/curated
Restricted use vs on demand
Disclosure risk
Digital preservation (or not)
Registry of Research Data Repositories > re3data.org
9. DATA MANAGEMENT PLANS
DMP Assistant is a bilingual tool for preparing data management plans (DMPs). The
tool follows best practices in data stewardship and walks researchers step-by-step
through key questions about data management.
10. REFERENCES
King, Gary. (2007). An Introduction to the Dataverse Network as an Infrastructure for Data
Sharing. Sociological Methods and Research 36: 173-199. copy at http://j.mp/iHJcAa
Piwowar HA, Day RS, Fridsma DB. (2007). Sharing Detailed Research Data Is Associated with
Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
Stiglitz, J. (1999). Knowledge as a global public good, in Kaul, I., Grunberg, I. and Stern, M.
(eds.), Global Public Goods: International cooperation in the 21st century. New York: Oxford
University Press, pp. 308–325.
Wicherts JM, Bakker M, Molenaar D. (2011). Willingness to Share Research Data Is Related to
the Strength of the Evidence and the Quality of Reporting of Statistical Results. PLoS ONE
6(11): e26828. doi:10.1371/journal.pone.0026828
Willinsky, J. (2010). Open access and academic reputation. Annals of Library and Information
Studies 57: pp. 296-302 http://goo.gl/UzTAW
11. APPENDIX OF RESOURCES
Highlighted repositories:
Inter-university Consortium for Political and Social Research (ICPSR)
Kelowna Open Data Catalogue
Abacus Dataverse Network
Registry of Research Data Repositories: re3data.org
DataCite Metadata Search
Portage DMP Assistant
12. APPENDIX: TRI-COUNCIL POLICIES
Funding Agency Open Data Policies
SSHRC : Research Data Archiving Policy
NSERC : Responsibilities and Accountability (see: Intellectual Property)
CIHR : Tri-Agency Open Access Policy: Publication-related Research Data