Getting Started with Data in Your Library
Director, Research Data Management
Research Data means recorded information, regardless of form or the media
in which it may be recorded, which constitute the original observations and
methods of a study and the analyses of these original data that are necessary
for reconstruction and evaluation of the Report(s) of a study made by one or
more Investigators. Research Data also includes all such recorded information
gathered in anticipation of a Report. Research Data differ among disciplines.
The term may include but is not limited to technical information, computer
software, laboratory and other notebooks, printouts, worksheets, other
media, survey, memoranda, evaluations, notes, databases, clinical case
history records, study protocols, statistics, findings, conclusions, samples,
physical collections, other supporting materials created or gathered in the
course of the Research, Tangible Research Property, unique Research
resources such as synthetic compounds, organisms, cell lines, viruses, cell
products, cloned DNA as well as genetic sequences and mapping information,
crystallographic coordinates, plants, animals and spectroscopic data, and
other compilations formed by selecting and assembling preexisting materials
in a unique way.
Big data is like teenage sex: everyone talks about
it, nobody really knows how to do it, everyone
thinks everyone else is doing it, so everyone
claims they are doing it…”— Dan Ariely
“If it gives you pain, it is Big Data.”
- Donald Brown, new Director of Virginia Integrative Data Institute, speaking
at Research Data and Technology Fair presented by Claude Moore Health
Sciences Library, University of Virginia Health System
Lots of Small Data = Big Data
Heidorn, P. Bryan. (2008) Shedding Light on the Dark Data in the Long Tail of Science. Library Trends, 57(2), pp.280-299.
Libraries and Data
Library and information professionals:
• need to become more involved with semantic
web or users will reinvent wheel (i.e. ontologies)
• have the interpersonal and subject specialization
for reference/consultation that IT doesn't have
• continue to help users find the information they
Stuart, David.(2011) Facilitating Access to the Web of Data: a Guide for Librarians. Facet
What Can You Do to Help?
Learn about data.
Learn about data in your organization.
Find the gaps.
Help people figure out what they need.
Help people find what they need.
What other skills do you have? Analysis?
Visualization? Programming? Etc.
Learn About Data
UK Data Archive http://www.data-archive.ac.uk/home
• Kdnuggets http://www.kdnuggets.com/
Who Else is Involved?
What are Your Organizational Needs?
Reuse of Data
Author ID –
– Citation of
• Regulatory requirements
• Grant and funding requirements
• Journal requirements
• Patent/ Intellectual Property documentation
• Need to keep market share
• Purdue Data Curation Profile
• Data Management Plan https://dmp.cdlib.org/ or
http://library.umassmed.edu/data_management_frameworks.pdf (pg 21)
to go over basics
• Research and work flow story
Keep Learning More
Library school classes and programs
Statistics and analysis http://www.gapminder.org/
Twitter (#data or follow a particular group,
person, or meeting)
– Kevin the Librarian http://kevinthelibrarian.wordpress.com/
– Strata (O’Reilly) http://strata.oreilly.com/
- Flowing Data
• Databib http://databib.org/
• OpenDOAR (Directory of Open Access
• DATA.gov http://www.data.gov/
• ICPSR (Inter-University Consortium for Political
and Social Research) subscription
Find a Niche
• Disciplinary Metadata
• “Data Analysis” (Coursera) videos
• Research Data Management: Principles, Practices,
and Prospects (DataRes project, library response to
• Keep track of what you do.
• Keep track of what is done with the
information you find.
• Altmetrics http://impactstory.org/
• Data citation “Out of Cite, Out of Mind”