Before we get in to the drivers and barriers for data sharing I would like to ‘share’ 2 things about me with you.. First of all, I am a librarian. I work as project officer for LIBER, which is the Association of European Research Libraries. We have 380 member libraries from all over Europe. Our projects really focus on developing the role of the library as part of the Europeana Research Infrastructure and they fall into 3 main categories.
So, waves and surfing are anaologies that are often used when referring to the data deluge and research data sharing. This report ‘Riding the Wave’ which was written by the High Level Expert Group n Scientific Data in october 2010 talks about how Europe can gain from the rising tide of scientific data.
Called for a frameworkk for collaborative data infrastructure to outline how different stakeholders interact with the data sharing system
The role of libraries in data exchange
The role of libraries indata exchangeSusan Reilly, Twitter:@skreillyIFLA Science & Technology Section, Helsinki, 13thAug 2012
LIBER & the European Research InfrastructureAssociation of European Research Libraries Our projects: Content Europeana Libraries Europeana Newspapers Policy MEDOANET Infrastructure APARSEN AAA Study ODE
Now and Next• Authentication & authorisation• New skills
The Opportunities for Data Exchange Project• identify, collate, interpret and deliver evidence of emerging best practices in sharing, re-using, preserving and citing data, the drivers for these changes and barriers impeding progress, in forms suited to each audience• policy makers, funders, infrastructure operators, data centres, data providers and users, libraries and publishers It’s multi- stakeholder!
The DataPublication Pyramid (1) Data contained and explained within the article (2) Further data explanations in any kind of supplementary (3) Data files to articles referenced from the article and held in data centers and (4) Data repositoriespublications, describing available datasets (5) Data in drawers and on disks at the institute
Implications for librariesLevel of integration Implication for libraryData contained within the article Prepare for adequate preservation strategiesData published in supplementary files to Presentation and preservationarticles mechanisms Persistent linkDatasets referenced from the articles Citability of dataset Persistent link Perpetual access to datasetData published independently from written Support publication processpublications (“data publication”) Curation of datasets Metadata and documentationData in drawers and on disks at the Engage in data managementinstitute planning
Background Workshop at LIBER on linking data to publications Clear need to explore demand and need for new skills 8
The Survey • What is the role of libraries in data exchange? • In what areas is the demand for support from researchers? • What skills do libraries need to develop to support researchers in data exchange? • How do we develop these skills ? 9
The Survey• Survey sent to circa 800 librarians, 110 respondents• Also sent to AU/US libraries active in this area• Seven criteria: 1. Availability 2. Findability 3. Interpretability 4. Reusability 5. Citability 6. Curation 7. Preservation 10
The Results• 81% of respondents see demand for support provision• Supply is nowhere near matching demand e.g.50% said there was a demand to support the writing of data management plans, only 13% provide this support• Only 32% of libraries believe they have the right skills in place to provide support (53% working on developing these skills• The best place for underlying data is in official data repositories and archives (84%) 11
Differences with US/AUS libraries More priority for data More manage priority ment for data plans citation Agree that archiving data is most in demand
The future of data publishing The best place for underlying data is in official data repositories and archives 84% agree75% US/AUS Publications should always contain links to the underlying research data 74% agree 75% US/AUS Data archives should have a system in place for persistent identifiers that 74% agree properly support citation of datasets 87.5% Research journals should have much stricter editorial policies on data 64% agree availability 25% Underlying data should be part of the peer review process 54% agree 37.5% Underlying data should be cited separately in the reference list 46% agree 37.5% Publishers and editors should only accept in supplements the summary 26% agree datasets that are of direct relevance to the article 12.5% There are not sufficient trustworthy data archives available for authors to +/- ¼ agree deposit their data 24% agree 62.5%
What Skills ?88% of expert libraries see subject expertise as important!
Developing Skills W hich me a sure for de ve loping skills do you conside r a ppropria te ? Integrate data management 3% into professional education 1% 28% Data management courses 27% for librarians as continuing professional development Practical literature and guidelines N/A Other (please specify) 41%