Extending our Reach: Librarians Supporting the Data Life CycleCC BY-SA 3.0This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License
Open data: growing governmental interest in making publicly-funded research more transparent and more available (NIH, NSF)The Office of Management and Budget (OMB) Circular A-110 provides the federal administrative requirements for grants and agreements with institutions of higher education, hospitals and other non-profit organizations. In1999, revised to provide public access under some circumstances to research data through the Freedom of Information Act (FOIA). Funding agencies have implemented the OMB requirement in various ways. Broader critical review: greater interest evaluating original research data (Nature)Technological advances: sharing of research results easier and faster (Repositories, Web 2.0)Reuse/preservation of research data: increased consideration of the cost and value of research data and need to ensure its longevityhttp://www.guardian.co.uk/news/datablog/2011/jul/07/government-transparency-data-releasesScience (December 2, 2011, Volume: 334, Issue: 6060) with a special section focusing on data replication and reproducibility in the sciences http://pineda-krch.com/2011/12/16/special-section-in-science-on-reproducible-research/Cyberinfrastructure and Software Development Researchhttp://www.osc.edu/research/csd/http://www.anneofcarversville.com/fp/liberals-have-different-dopamine-gene-more-friends-say-scien.htmlhttp://blogs.ischool.utexas.edu/digitalcuration/2010/09/26/a-survey-of-scientific-data-repositories/
Publishers are beginning to require authors to submit datasets with their completed manuscripts Sustainable Digital Data Preservation and Access Network Partners (DataNet)NSF DataNET Awards: to create a set of national data research infrastructure organizations Major DataONE grant ($20 million) Global data access and preservation network for environmental sciences Now seeking Librarians/IT people to work on the implementation, coordination, collaboration aspectsDC Lead by Library at Johns Hopkins Univ.Funding agencies also require faculty to include a data management plan as a component of grant applications/NIH since 2003 Data Sharing Requirement for awards over $500,000NSF require that grantee share their data within a reasonable length of time ( since 2002)Reduces cost if share data.
National Science Board’s Report to the National Science Foundation (Sept. 2005)Long-Lived Digital Data Collections: Enabling Research and Education in the 21st CenturyDetailed Roles and Responsibilities of researchers and institutions. Purdue’s Data Research Scientist was based on the Data ScientistsDescribed in the report as those who are crucial to the successful management of digital data collections (described as librarians)From 2006, ARL Workshop:To Stand the Test of Time: Long-term Stewardship of Digital Data Sets in Science and EngineeringResearch libraries should provide access both to the body of scientific literature and to the dataupon which this literature is based.November 2007, ARL Task Force on Library Support for E-Science final Report was published. Charge of the Task Force focused on raising awareness and positioning researchlibraries to be players in this new arenaOUTCOME 4: Research libraries as active participants in the conceptualization anddevelopment of research infrastructure, including systems and services to support theprocesses of research and the full life cycle of research assets.
Pressure to abide by regulations by granting agencies and publishers/journalsChanging regulations, confusing regulations. Pressure to bring in the money, need grantsLack of formal data management training/understandingMany scientists do not know how to mange their data, now they have labs with more students, working collaborative across institutions.Not aware of problems with Data Management until someone leaves. Or until someone asks for their data. Or worse, their data is destroyed/lost!Not a priority for most researchers (takes time away from the work that is rewarded) Data Management takes time, away from what they like doing… the research, the science. They would rather be doing this than managing their dataData is expensive (time, instrumentation, inability to reproduce)Instrumentation is expensive, when you buy time on a telescope, that data is irriplaceable. Expensive, Research data is being lost because of poor data management and lack of institutional repositoriesResearchers have collected all this “data” during their project and now what do they do with it. Where will they put it, even if they want to. More often they create webpages. URL breaks.
If we think about it Institutions, like UVa, face the same challenges that researchers face.ProblemResearch is becoming more data driven.Increasingly, data has research value beyond the project it was generated for. Data Deluge – the amount of data being produced is increasing exponentially, and the systems, tools and infrastructures to organize, manage, disseminate and preserve data have not kept pace.
Early 2009, We saw Data Services (Research … diff. variations) an emerging service at a variety of peer institutions. These institutions set priorities with data services consistent with their research library missions.Other Institutions used their existing librarians/liaisons and created services and web pages to help support Data Management.These new services were behind the UK (creation of its Digital Curation Center to help support HE) and Australia (National Data Service).
Researchers?VPR? (Vice President for Research)CIO? (Chief Information Officer/IT)OSP? (Office of Sponsored Programs)UL? (University Library)
To date, Subject Librarians have supported the needs of researchers through relatively traditional services revolving around information discovery, collection development and some elements of information management. publication process tailored to deliver the peer-reviewed scholarly article shifting from a focus on managing information in its published form to managing the digital data sets on which findings may be published.Think about traditional libraries being involved in the downstream (the publication). Preserving the data is an “Upstream” role – to have a say (input) into what is preservered and how. We can have a greater influence here so it would be easier to handle downstream.Libraries have long played critical roles within this ecosystem, in the pre-digital world of knowledge-making and knowledge-sharing. Libraries’ role as custodians of “downstream” knowledge – the reports of research communicated in articles and books – is well established. Libraries also contribute to the knowledge lifecycle through less visible but crucial interventions “upstream,” by advising researchers and teaching new scholars how to use the communicative apparatus of their field.
Literature shows currently: evidence for a shift in the nature of support and services, which are becoming more diverse and extensive, and moving beyond purely information-‐related activities and towards a greater emphasis on research data management in particular. clear trend towards providing support for research that is driven more by the requirements of researchers than it has been in the recent past, and a movement in some institutions towards a more proactive model of engagement with researchers.
Extending our Reach: LibrariansSupporting the Research Life Cycle Sherry Lake July 30, 2012 University of Florida Data Management Workshop
Challenge for InstitutionsData is expensive Time, instrumentation, inability to reproduceIncreasing regulation Granting agencies and journals require submissionInadequate training No formal data management curriculumPreservation is not a priority Where is the Institutional Repository?Data management plans (DMPs) and Policies A challenge for research administration and compliance, but really just the tip of the iceberg
Roles for Libraries “What role do the research and academic libraries envision for themselves and do scientists envision for librarians in a digital data framework…?” - To Stand the Test of Time…(2006) ARL. p.24
In the Beginning: Purdue University 2004 initiative for Libraries to collaborate with faculty across campus—apply library science knowledge and expertise to research problems: manage, organize, describe, disseminate, preserve information. Particular emphasis on addressing Jim Mullins data curation issues.Dean of LibrariesPurdue University
The Data Research Scientist Help the Libraries move research strategically forward. Help ramp up interaction with research faculty on campus. Leverage interdisciplinary research collaborations. Address the social, cultural and organizational aspects of data curation.
Trends in Academic LibrariesData Research Scientist Research Data Management Librarian Lorry I. Lokey Science Data Services Librarian Science Data Librarian
Why the Library? Neutral: works across the entire institution Strong in relationship building: has experience fostering discussion and relationships, and cultivates an existing support network Intellectual Property experts: has dealt with copyright, can translate to data Service-oriented: uniquely positioned as an intellectual service unit within the institution
Access. Knowledge. Success Research traditionally moves in this direction“published” unpublished “published” published secondary data/ research research research tertiary datasets traditional/non non-traditional traditional resources analyzed data/ Library service traditionally moves in this direction datasetsprocessed Given the changing nature of research and data/ scholarly communication (i.e., e-research), datasets how do librarians change to adapt to new ideas regarding what they collect to support “raw” research & education and how they provide data/ access to it? datasets Modified from: Brandt, D.S. “Scholarly Communication” (in To Stand the Test of Time: Long-Term Stewardship of Digital Data Sets in Science and Engineering.: Final Report of Workshop New Collaborative Relationships: Academic Libraries in the Digital Data Universe. ARL, Washington, DC, September 2006.)
Research Life Cycle Data Re- Data Deposit Discovery Use ArchiveProposal Project Data Data Data End ofPlanning Start Up Collection Analysis Sharing ProjectWriting Re- Purpose Data Life Cycle
Questions? Sherry Lake Senior Scientific Data Consultant, UVA Library firstname.lastname@example.org Twitter: shlakeuva Web: http://www.lib.virginia.edu/brown/data
References US National Science Board. Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century. Septem- ber 2005. http://www.nsf.gov/pubs/2005/ nsb0540/nsb0540_1.pdf Association of Research Libraries. To Stand the Test of Time: Long-term Stewardship of Digital Data Sets in Science and Engineering. A report to the National Science Foundation from the ARL Workshop on New Collaborative Relationships: the Role of Academic Libraries in the Digital Data Universe. September 26-27, 2006, Arlington, VA. http://www.arl.org/ bm~doc/digdatarpt.pdf
Image ReferencesTitle Slide 1. http://www.flickr.com/photos/grantloy/4577061495/ By grant_loy 2. http://www.flickr.com/photos/catchesthelight/3187651801/ By catchesthelight 3. http://www.flickr.com/photos/stevenm_61/2673806520/ By StevenM_61 4. http://www.flickr.com/photos/gregory-moine/4302464123/ By Gregory Moine 5. http://www.flickr.com/photos/54485353@N05/5784339186/ By janna487Slide 2 http://www.guardian.co.uk/news/datablog/2011/jul/07/government-transparency-data-releases http://www.osc.edu/research/csd/ http://blogs.ischool.utexas.edu/digitalcuration/2010/09/26/a-survey-of-scientific-data-repositories/Slide 5 http://s184.photobucket.com/albums/x311/mdreamweaver/?action=view¤t=under_pressure.jpg& http://digiex.net/attachments/general-discussions/computer-technology-discussion/1174d1231879288-post-your- desktops-pc-desktop.jpg http://faculty.virginia.edu/brodie/butch.html http://amazon.evsc.virginia.edu/gallery23/main.php/v/equipment/PHCK_Met_2004/PC170012.jpg.html http://www.cardcow.com/269117/national-radio-astronomy-observatory-300-foot-transit-telescope-green-bank-west- virginia/ http://www.azoquantum.com/equipment-details.aspx?EquipID=103 http://grenkblog.com/wp-content/uploads/2011/01/e-waste1.jpgSlide 8 http://www.krannert.purdue.edu/konline/2012s/viewpoints/mullins-p1.html