Basics of Data
Photo by Johnathan Nightengale http://www.flickr.com/photos/johnath/7270799658/
5. Gold, Anna. Cyberinfrastructure, Data, and
Librarians: Parts 1 & 2. D-Lib.
Salo, Dorothea. Retooling Libraries for the
Data Challenge. Ariadne. July 2010:64.
2012 Priscilla Mayden Lecture from the
University of Utah: eScience and the
Evolution of Library Services
For advanced readings, review Dorothea
Salo’s 855 syllabus. http://dsalo.info/wp-
Slide 3: Photo by John (cygnus921) http://www.flickr.com/photos/cygnus921/2678359760
Slide 4: Photo by Johnathan Nightengale http://www.flickr.com/photos/johnath/7270799658/
Slide 5: Gold, Anna. Cyberinfrastructure, Data, and Librarians: Parts 1 & 2. D-Lib.
Salo, Dorothea. Retooling Libraries for the Data Challenge. Ariadne. July 2010:64.
2012 Priscilla Mayden Lecture from the University of Utah: eScience and the Evolution
of Library Services http://library.med.utah.edu/or/pmayden/home.php
Salo, Dorothea. Syllabus: LIS 855, Digital Curation.
Slide 6: Stuart, David. Facilitating Access to the Web of Data: a Guide for Librarians. October 2011:
Facet Publishing. London.
Milton, Michael. Head First Excel. 2010: O’Reilly. Sebastopol, CA.
Caldwell, Sally. Statistics Unplugged. 3rd Ed. 2010: Cengage Learning. Wadsworth, MA.
Slide 7: http://www.icpsr.umich.edu/icpsrweb/ICPSR/
Slide 8: http://datacurationprofiles.org/
Why am I on this stage?*AAAS Confererence*Started at UIC in Dec 2010 (Public Small Town to Urban R1 Medical)*Just following the NSF DMP Mandate*Database Designer –been working with data, cleaning, building –this appealed. *Started seeing Data related things turning up in job ads*Who is doing it in libraries? GIS, IT, and Catalogers—Linked library data, FRBR, RDA
So I’m supposed to be at least slightly autodidactic, right? I dug, I read I listened. Today we’re going to go through the what, where, and who that I’ve identified thus farNOT Comprehensive but a launch point for you or someone at your institution who is trying to get up to speed
Basics of Data: *Comes in all shapes and sizes*Clean*Good? *Good doesn’t always mean complex or vice versa*I charge by the hour to clean data*Also includes STATISTICS and DATA ANALYSIS *We don’t all need to statisticians*Software: SPSS/SAS/R, Mathematica (what’s behind Wolfram Alpha), and Advanced Excel. Excel –also works for grocery lists, though it’s harder to do calculations on those tables. *Library ComponentsMetadata and OntologiesSources of DataData Management Plans Basically: Find/Acquire, Describe, and Provide Access. Gee, now doesn’t that sound familiar…. Metadata—lot easier to find, we’ve been doing that for much longer, now it’s finding content appropriate ontologiesSources—best thing I can recommend is go out and look. I’ll give you some links, go digging around in other people’s LibGuidesE.G. Federal Government –alcohol DMPs are a natural for us.