Data Collections<br />Bernadette Duffy and Abraham de Jesus<br />LIBR 580<br />Louise Broadley<br />October 5, 2011<br />
What are Data Collections?<br />Data from surveys, opinion polls, climate data<br />Numeric data in machine-readable form ...
Data Lifecyclefrom DataOne https://www.dataone.org/content/education<br />
Libraries and Data Collections<br />Important in academic and special libraries<br />Used by researchers and policy analys...
UBC Library Data Serviceshttp://data.library.ubc.ca/<br />
Data suppliers - UBC<br /><ul><li>Statistics Canada http://www.statcan.gc.ca/Canadian Census, labour, health, income, trade
The Roper Center for Public Opinion Research at the University of Connecticut http://www.ropercenter.uconn.edu/ Opinion polls
Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan http://data.library.ub...
abacus - data set Part 1<br />
abacus - data set Part 2<br />
Data file<br />
Challenge - Cost<br />Strategies to reduce cost for subscription data sets<br />Collaborative purchase with several depart...
Challenge - Selection<br />Decisions are based on<br /><ul><li>Collection policy</li></ul>Knowledge of what is available<b...
Challenge - Supporting Access<br />Make visible in Library Catalogue. <br />Convert file formats for use in statistical pr...
Infrastructure<br />Data sets can be highly variable in size.<br />This creates certain infrastructural challenges for sto...
Storage<br />Scalability: “the ability of a system, network, or process, to handle growing amounts of work in a graceful m...
Systems Support<br />Network: Can the network handle downloading of large datasets? <br />Hardware: Can the systems suppor...
UN Gender Info<br />
Institutional Support<br />Workflows: Can your data collections be integrated into the larger collections management frame...
Preservation<br />Best practices for data preservation mean that preservation concerns enter in at the earliest point in t...
Criteria for Preservation<br />Obligation<br />Value<br />Uniqueness<br />Verification<br />Other Cultural Reasons<br />
Metadata<br />Plagued by a lack of standards.<br />No international metadata standard for data sets.<br />Needs to give en...
Upcoming SlideShare
Loading in …5
×

Management of Data Collections

509 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
509
On SlideShare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Management of Data Collections

  1. 1. Data Collections<br />Bernadette Duffy and Abraham de Jesus<br />LIBR 580<br />Louise Broadley<br />October 5, 2011<br />
  2. 2. What are Data Collections?<br />Data from surveys, opinion polls, climate data<br />Numeric data in machine-readable form <br />To make use of the data files need Codebooks and other supporting files<br />
  3. 3. Data Lifecyclefrom DataOne https://www.dataone.org/content/education<br />
  4. 4. Libraries and Data Collections<br />Important in academic and special libraries<br />Used by researchers and policy analysts<br />Academic libraries starting to get involved in the preservation of research data from own institution<br />
  5. 5. UBC Library Data Serviceshttp://data.library.ubc.ca/<br />
  6. 6. Data suppliers - UBC<br /><ul><li>Statistics Canada http://www.statcan.gc.ca/Canadian Census, labour, health, income, trade
  7. 7. The Roper Center for Public Opinion Research at the University of Connecticut http://www.ropercenter.uconn.edu/ Opinion polls
  8. 8. Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan http://data.library.ubc.ca/gen/icpsr.htmlSocial Sciences data</li></li></ul><li>abacus<br />
  9. 9. abacus - data set Part 1<br />
  10. 10. abacus - data set Part 2<br />
  11. 11. Data file<br />
  12. 12. Challenge - Cost<br />Strategies to reduce cost for subscription data sets<br />Collaborative purchase with several departments (UC Berkeley)<br />University consortium (UBC, SFU, UVic, UNBC combined to form BC Research Libraries’ Data Services consortium – abacus http://abacus.library.ubc.ca/<br />
  13. 13. Challenge - Selection<br />Decisions are based on<br /><ul><li>Collection policy</li></ul>Knowledge of what is available<br />Understanding user need<br />Cost<br />Individual patron need<br />If the data would be useful to multiple users<br />
  14. 14. Challenge - Supporting Access<br />Make visible in Library Catalogue. <br />Convert file formats for use in statistical programs<br />Outreach / education in use of data collection and statistical tools<br />Workshops on data literacy<br />Create a Data Lab<br />Become embedded in course requiring use of data collections<br />
  15. 15. Infrastructure<br />Data sets can be highly variable in size.<br />This creates certain infrastructural challenges for storage, institution’s system, and the institution itself. <br />
  16. 16. Storage<br />Scalability: “the ability of a system, network, or process, to handle growing amounts of work in a graceful manner or its ability to be enlarged to accommodate that growth.” (Wikipedia)<br />Location: Does your institution expect to host the data produced by researchers at that institution?<br />
  17. 17. Systems Support<br />Network: Can the network handle downloading of large datasets? <br />Hardware: Can the systems support computation over disparate data sets?<br />Software: Do you have statistical programs (like SPSS or R) available for your users?<br />Flexibility: Can your system handle the wide variety of data formats, sizes, and uses? <br />Example of a good system: http://www.devinfo.info/genderinfo/<br />
  18. 18. UN Gender Info<br />
  19. 19. Institutional Support<br />Workflows: Can your data collections be integrated into the larger collections management framework?<br />Faculty Partnerships: Will faculty work with the library to create data management plans? <br />Mandate: Does your institution consider data collections a priority?<br />
  20. 20. Preservation<br />Best practices for data preservation mean that preservation concerns enter in at the earliest point in the data management cycle: creation. <br />
  21. 21. Criteria for Preservation<br />Obligation<br />Value<br />Uniqueness<br />Verification<br />Other Cultural Reasons<br />
  22. 22. Metadata<br />Plagued by a lack of standards.<br />No international metadata standard for data sets.<br />Needs to give enough context for the data to be understandable. <br />No clear citation practice has emerged for data sets. <br />Data Documentation Initiative (DDI)<br />
  23. 23. Wrap-Up<br />What is a data collection? A collection of the data resulting from research.<br />They have unique challenges for selection, access, infrastructure, and preservation. <br />Data Curation is an up and coming field in librarianship. <br />Librarians are uniquely poised to be involved in the recent surge of interest in data. <br />

×