Research Data Management at Edinburgh: Effecting Culture Change


Published on

Slides presented at the Spanish Agency of Science and Technology (FECYT) and the network of Spanish repositories (RECOLECTA) Research Data Management Webinar Series - see url:

Published in: Education, Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • All urls and links will be available on the last slide
  • Social science researchers and scientists who are producers and users of micro and macro-level social data; Information specialists who preserve social data, manage facilities and provide services that promote the secondary use of social data Methodologists and computing specialists who advance technical methods to manipulate and analyze social data Membership Usually 60% US / Canada, 30% Europe, less than 10% others
  • Level of use or existance of the following within Spanish institutions in the first instance Data Audit framework, Data Curation profiles, DMP Online
  • IASSIST – and how it can help or support data professionals from academic / research institutions in Latin speaking countries
  • Data library role not seen as prominent this may include the macro-economic and social indicator-based resources / GIS Centralised / departmental, computing services / library / national body / research data centres
  • Closed mailing list, twitter account, blog, facebook, linkedin, Flickr, IASSIST Quarterly (peer review journal)
  • NORC – National Opinion Research Centre, part of the University of Chicago based in Washington Inter-American Development Bank & Instituto Nacional de Estadística y Geografía ( INEGI ) , Costa Rican Population Center If you want to now more about this please get in touch or visit the IASSIST website Proposal that IASSIST be held in Spain / Portugal
  • 25 years ago disk storage - expensive researchers interested in working with data came together to petition the PLU and the University’s Library – wanting a university-wide provision for files that were too large to be stored on individual computing accounts Early holdings were research data from universities of edinburgh, glasgow, and strathclyde
  • UKBORDERS Digimap Collection Go-Geo! Agcensus Moving pictures and sound services - EIG, newsFilm Online Plus A&I databases The Depot HILT GetRef LOCKSS PePRS
  • Primarily social sciences but not exclusively so, large scale government surveys (micro data), macro-economic time series data (country-level data), Elections studies, Geospatial data, financial datasets, population census data Free on internet / subscription / through national data centres/archives / resource discovery portals Registration / authorisaiton and authentication / special conditions / budget to pay for data SPSS, STATS, SAS, R, ArcGIS – interpret documentaiton/codebooks, merge and match users data with other data (via look-up tables), subset data Data Catalogue
  • Training for postgraduates and early career researchers   These  were  the  School  of  Divinity,  School  of  History,  Classics  and  Archaeology),  School of Biomedical Sciences),  (School  of  Molecular  and  Clinical  Medicine),   (School  of  Physics  and  Astronomy).  Also,  the  School  of  Geosciences
  • Digital Curation centre, Data Library, Information Services Infrastructure, Research Computing, Library & Collections Concern is both for the shorter term – ensuring competitive advantage through secure and easy-to-use access, and for the longer term – ensuring enduring access and usability to the research community into the future and compliance with legislation. 2 working groups RDS working group RDM working group
  • IS Infrastructure, DCC, Digital Library, Data Library
  • Funded by JISC as part of its UK programme, Managing Research Data to develop online learning materials to assist researchers manage their digital assets. IAD – set up to deliver training and development for postgraduate students and staff – via online course, Virtual Learning Environments, transferable skills training
  • A set of Multi- or Cross-Disciplinary online learning resources
  • Shareable Content Object Reference Model – XML-based
  • JorumOpen - national OER repository
  • Loss of passwords/ access and permissions seems to be dependent upon operating systems/browser
  • Research Data Management at Edinburgh: Effecting Culture Change

    1. 1. Stuart Macdonald Associate Data Librarian EDINA & Data Library University of Edinburgh [email_address] RECOLECTA Online Seminar on Research Data Management and Curation, 24 November 2011 Research Data Management at Edinburgh: Effecting Culture Change
    2. 2. Overview IASSIST Latin Engagement Strategic Action Group Research Data Management at University of Edinburgh Research Data MANTRA
    3. 3. An international organisation of professionals working with IT and data services to support research and teaching in the social sciences. Its 320 members work in a variety of settings, including data archives, statistical agencies, research centers, libraries, academic departments, government departments, and non- profit organisations. International Association for Social Science Information Services and Technology – IASSIST
    4. 4. IASSIST Latin Engagement Strategic Action Group Co-Chairs: Luis Martínez-Uribe Instituto Juan March, Madrid Stuart Macdonald EDINA & Data Library, University of Edinburgh Mission : To propose a set of activities that IASSIST could undertake to further engage with data professionals from Ibero-american institutions
    5. 5. Identify data infrastructures and resources from Spanish & Portuguese speaking countries to further the establishment of collaborative data networks. Reach out to Spanish/Portuguese speaking academic and support communities that could benefit from the existing network of expertise, guidance and knowledge that is the IASSIST community Benefits
    6. 6. IASSIST Latin Members <ul><ul><ul><li>Two members from Spain in 1993 and 2001 </li></ul></ul></ul><ul><ul><ul><li>One member from Mexico 2009 </li></ul></ul></ul><ul><ul><ul><li>Representation in 2011 from Spain and Portugal </li></ul></ul></ul><ul><li>  </li></ul><ul><li>  </li></ul><ul><ul><li>Phase I: Spain </li></ul></ul><ul><ul><li>Desk research (navigating institutional websites, annual reports, mailing lists) </li></ul></ul><ul><ul><li>Combination of online survey questionnaire & telephone interviews (with script) </li></ul></ul><ul><ul><li>3 areas of interest: traditional data library role; research data management & curation; IASSIST </li></ul></ul><ul><ul><li>Responses from library & support services from national & regional universities, Data Centres </li></ul></ul>
    7. 7. Findings: traditional data library <ul><ul><li>Findings: research data management </li></ul></ul><ul><ul><li>Growing area of interest in repository community </li></ul></ul><ul><ul><li>Some institutions have started to deposit data in IRs - others are conducting scoping & requirement gathering exercises </li></ul></ul><ul><ul><li>New skills needed – unclear about whose responsibility </li></ul></ul><ul><li>Only one 'traditional' data library </li></ul><ul><li>Libraries provide training courses and </li></ul><ul><li>support for subscription data services </li></ul><ul><li>Minimal data cataloguing! </li></ul>
    8. 8. <ul><ul><li>Half of respondents knew of IASSIST </li></ul></ul><ul><ul><li>Perceived benefits that IASSIST can offer: </li></ul></ul><ul><ul><li>Information about data support / data libraries </li></ul></ul><ul><ul><li>Basic skills to handle social science data   </li></ul></ul><ul><ul><li>Links to free social science data resources </li></ul></ul><ul><ul><li>Data analysis and visualisation tools </li></ul></ul><ul><ul><li>Tools to manage and share research data </li></ul></ul><ul><ul><li>A peer to peer support network </li></ul></ul><ul><ul><li>Access to IASSIST online facilities </li></ul></ul><ul><ul><li>Help to raise awareness and share knowledge about RDM issues (remove duplication of effort) </li></ul></ul>IASSIST
    9. 9. The Latin Engagement Action Group have come up with a number of outreach activities aimed at supporting data professionals from Spanish and Portuguese speaking educational institutions, namely: <ul><ul><li>In addition to conducting </li></ul></ul><ul><ul><li>similar scoping exercises in Portugal & Latin-America </li></ul></ul>Participation in Research Data Management Webinars for Spanish/Portuguese data specialists Preparation of a Latin-American session in next IASSIST annual conference in collaboration with outreach committee Spanish and Portuguese translation of the main pages of the IASSIST site
    10. 10. Research Data Management at University o f Edinburgh
    11. 11. EDINA & Data Library (EDL) <ul><li>EDINA and University Data Library (EDL) together are a division within Information Services of the University of Edinburgh. </li></ul><ul><li>EDINA is a JISC-funded National Data Centre providing national online resources for education and research. </li></ul><ul><li>The Data Library assists Edinburgh University users in the discovery, access, use and management of research datasets. </li></ul>
    12. 12. <ul><ul><li>Mission statement: “ enhance the productivity of research, learning and teaching in UK higher and further education ..” </li></ul></ul><ul><ul><li>Networked access to a range of online resources for UK FE and HE </li></ul></ul><ul><ul><li>Services free at the point of use for use by staff and students in learning, teaching and research through institutional subscription </li></ul></ul><ul><ul><li>Focus is on service but also undertake R&D (projects  services) </li></ul></ul><ul><ul><ul><li>delivers about 20 online services </li></ul></ul></ul><ul><ul><ul><li>has about 10 major projects (including services in development) </li></ul></ul></ul><ul><ul><ul><li>employs about 75 staff (Edinburgh & St Helens) </li></ul></ul></ul>EDINA National Data Centre
    13. 13. What is a Data Library? A data library refers to both the content and the services that foster use of collections of numeric, audio-visual, textual or geospatial data sets for secondary use in research. Focus on re-use of data
    14. 14. <ul><li>Data Library & consultancy </li></ul><ul><li>Edinburgh DataShare </li></ul><ul><li>JISC-funded projects </li></ul><ul><ul><li>DISC-UK DataShare (2007-2009) </li></ul></ul><ul><ul><li>Data Audit Framework Implementation (2008) </li></ul></ul><ul><ul><li>Research Data MANTRA (2010-2011) </li></ul></ul>Data Library services and projects
    15. 15. Data Library & Consultancy Building relationships with researchers via postgraduate teaching activities, research support projects, IS Skills workshops, Research Data Management training and through traditional reference interviews. <ul><li>finding… </li></ul><ul><li>accessing … </li></ul><ul><li>using … </li></ul><ul><li>teaching … </li></ul><ul><li>managing </li></ul>
    16. 16. Edinburgh DataShare was built as an output of the JISC-funded DISC-UK DataShare project (2007-2009)
    17. 17. Edinburgh DataShare An online institutional repository of multi-disciplinary research datasets produced at the University of Edinburgh, hosted by the Data Library Researchers producing research data associated with a publication, or which has potential use for other researchers, can upload their dataset for sharing and safekeeping. A persistent identifier and suggested citation will be provided. DataShare is a customised DSpace instance with a selection of standards-compliant metadata fields useful for discovery of datasets, through Google and other search engines via OAI-PMH.
    18. 18. A decision making and planning tool for institutions with digital repositories in existence or in development that are considering adding research data to their digital collections Downloadable as PDF
    19. 19. A JISC-funded pilot project produced 6 case studies from research units across the University in identifying research data assets and assessing their management, using DAF methodology developed by the Digital Curation Centre. Edinburgh Data Audit Framework (DAF) Implementation (May – Dec 2008) <ul><li>4 main outcomes: </li></ul><ul><li>Develop online RDM guidance </li></ul><ul><li>Develop university research data management policy </li></ul><ul><li>Develop services & support for RDM (in partnership IS) </li></ul><ul><li>Develop RDM training </li></ul>
    20. 20. Research Data Management Web Guidance <ul><li>Online suite of web pages </li></ul><ul><li>for IS website developed in </li></ul><ul><li>2009 (will be revamped </li></ul><ul><li>early 2012) </li></ul><ul><li>Downloadable PDF </li></ul><ul><li>handbook </li></ul>
    21. 21. <ul><li>Championed by Vice-Principal </li></ul><ul><li>& Chief Information Officer </li></ul><ul><li>Professor Jeff Haywood the </li></ul><ul><li>policy for management of </li></ul><ul><li>research data was approved </li></ul><ul><li>by the University Court on 16 </li></ul><ul><li>May, 2011. </li></ul><ul><li>Along with University of </li></ul><ul><li>Oxford, the first RDM policies </li></ul><ul><li>in the UK </li></ul><ul><li>“ The University adopts the following </li></ul><ul><li>policy on Research Data Management. </li></ul><ul><li>It is acknowledged that this is an </li></ul><ul><li>aspirational policy, and that </li></ul><ul><li>implementation will take some years.” </li></ul>* Quote from University of Edinburgh website
    22. 22. University Research Data Management Policy In spring 2010, a review commenced at the University of Edinburgh to address the issue of managing the rapidly expanding volume and complexity of data produced by researchers. The Review was overseen by the IT & Library Committee and had twin tracks to look at Research Data Storage, and Data Management, Curation and Preservation. The Review looked at current practice in the University and assessed current practice in peer universities and internationally Review Committee responsible for drafting policy to support the University’s mission for ‘the creation, dissemination and curation of knowledge’
    23. 23. <ul><li>Research data will be managed to the highest standards throughout the research data lifecycle as part of the University’s commitment to research excellence. </li></ul><ul><li>Responsibility for research data management through a sound research data management plan during any research project or programme lies primarily with Principal Investigators (PIs). </li></ul><ul><li>All new research proposals must include research data management plans or protocols that explicitly address data capture, management, integrity, confidentiality, retention, sharing and publication. </li></ul><ul><li>The University will provide training, support, advice and where appropriate guidelines and templates for the research data management and research data management plans. </li></ul><ul><li>The University will provide mechanisms and services for storage, backup, registration, deposit and retention of research data assets in support of current and future access, during and after completion of research projects. </li></ul>The 10 Policy Principles
    24. 24. <ul><li>Any data which is retained elsewhere , for example in an international data service or domain repository should be registered with the University. </li></ul><ul><li>Research data management plans must ensure that research data are available for access and re-use where appropriate and under appropriate safeguards. </li></ul><ul><li>The legitimate interests of the subjects of research data must be protected . </li></ul><ul><li>Research data of future historical interest , and all research data that represent records of the University, including data that substantiate research findings , will be offered and assessed for deposit and retention in an appropriate national or international data service or domain repository, or a University repository. </li></ul><ul><li>Exclusive rights to reuse or publish research data should not be handed over to commercial publishers or agents without retaining the rights to make the data openly available for re-use , unless this is a condition of funding. </li></ul>
    25. 25. IS Implementation group established Led by Director, Library and Collections Research Data Management Policy Implementation Project Working group was set up to define a research data storage strategy Vice Principal has promised a half terabyte per researcher, storage Next Steps
    26. 26. Research Data MANTRA Partnership between: Edinburgh University Data Library Institute for Academic Development Funded by JISC Managing Research Data programme (Sept. 2010 – Aug. 2011)
    27. 27. W hy manage research data? <ul><li>Data Deluge – exponential growth in </li></ul><ul><li>the volume of digital research artifacts </li></ul><ul><li>created within academia </li></ul><ul><li>Data management is one of the </li></ul><ul><li>essential areas of responsible </li></ul><ul><li>conduct of research. </li></ul><ul><li>By managing your data you will: </li></ul><ul><li>Meet funding body grant requirements. </li></ul><ul><li>Ensure research integrity and replication. </li></ul><ul><li>Ensure research data and records are accurate, complete, authentic and reliable. </li></ul><ul><li>Increase your research efficiency. </li></ul><ul><li>Enhance data security and minimise the risk of data loss. </li></ul><ul><li>Prevent duplication of effort by enabling others to use your data. </li></ul>
    28. 28. <ul><li>Grounded in three disciplinary contexts: social science, clinical psychology and geoscience </li></ul><ul><li>Aim was to develop online interactive open learning resources for PhD students and early career researchers that will: </li></ul><ul><ul><li>Raise awareness of the key issues related to research data management & contribute to </li></ul></ul><ul><ul><li>culture change </li></ul></ul><ul><ul><li>Provide guidelines for good practice </li></ul></ul>Project overview
    29. 29. Selling RDM as a Transferrable Skill (voluntary participation) The FRUIT principles: Fun Feedback Relevant Real Useful Interesting Timely
    30. 30. <ul><li>Eight units with activities, scenarios and videos: </li></ul><ul><li>Research data explained </li></ul><ul><li>Data management plans </li></ul><ul><li>Organising data </li></ul><ul><li>File formats and transformation </li></ul><ul><li>Documentation and metadata </li></ul><ul><li>Storage and security </li></ul><ul><li>Data protection, rights and access </li></ul><ul><li>Preservation, sharing and licensing </li></ul><ul><li>Four data handling practicals: </li></ul><ul><li>SPSS, NVivo, R, ArcGIS </li></ul><ul><li>Video stories from researchers in variety of settings </li></ul><ul><li>Xerte Online Toolkits – University of Nottingham </li></ul>Online learning module
    31. 31. Online learning module <ul><li>Delivered online – self-paced, available ‘anytime, anyplace’ </li></ul><ul><li>Emphasis on practical experience and active engagement via online activities </li></ul><ul><li>One hour per unit </li></ul><ul><li>Read and work through scenarios & activities (incl. videos etc) </li></ul><ul><li>CC licence to allow manipulation of content for re-use with attribution </li></ul><ul><li>Portable content in open standard formats (e.g. SCORM) </li></ul>
    32. 32. <ul><li>Deposit learning materials with an open licence in JorumOpen and Xpert </li></ul><ul><li>Learning materials to be embedded in three participating postgraduate </li></ul><ul><li>programmes and made available through IAD programme for use by all </li></ul><ul><li>postgraduate students and early career researchers: This academic </li></ul><ul><li>year </li></ul><ul><li>Public Website launch: </li></ul><ul><li>Download/re-brand/re-purpose materials from JorumOpen in standards </li></ul><ul><li>compliants formats (forthcoming) </li></ul><ul><li>Software modules – data handling practicals (MS Word) </li></ul>MANTRA dissemination Image courtesy of the periodic table printmaking project –
    33. 33. <ul><li>Time – underestimated! </li></ul><ul><li>Authoring content – not an easy task... </li></ul><ul><ul><li>Re-using existing materials vs writing from scratch </li></ul></ul><ul><ul><li>Copyright – images, videos (where possible open </li></ul></ul><ul><ul><li>or CC-licensed materials) </li></ul></ul><ul><ul><li>Consistency in style, terminology, tempo </li></ul></ul><ul><li>Setting up Xerte Online Toolkits (XOT) on the server </li></ul><ul><ul><li>Security issues </li></ul></ul><ul><ul><li>Issues with the software </li></ul></ul>Lessons learned
    34. 34. <ul><li>Lessons learned </li></ul><ul><li>Working with a default XOT template </li></ul><ul><ul><li>Limitations of the template </li></ul></ul><ul><li>Use ‘reward’ to engage volunteers for user testing </li></ul><ul><li>Issues with data handling practicals: </li></ul><ul><ul><li>Tension between data handling and </li></ul></ul><ul><ul><li>data analysis in terms of actual content </li></ul></ul><ul><ul><li>Ensuring clear explanation of </li></ul></ul><ul><ul><li>software specific jargon/technical </li></ul></ul><ul><ul><li>terminology/process-specific </li></ul></ul><ul><ul><li>language </li></ul></ul><ul><ul><li>Retaining consistency across four </li></ul></ul><ul><ul><li>data handling practicals in terms of </li></ul></ul><ul><ul><li>look and feel, flow, content, audience </li></ul></ul>
    35. 35. <ul><li>Links </li></ul><ul><li>IASSIST: http:// / </li></ul><ul><li>Data Library services: </li></ul><ul><li>EDINA: </li></ul><ul><li>Research data management guidance pages: </li></ul><ul><li>http:// /is/research-data-management </li></ul><ul><li>Edinburgh University data policy: </li></ul><ul><li> </li></ul><ul><li>Policy-making for Research Data in Repositories - A Guide: </li></ul><ul><li> </li></ul><ul><li>Edinburgh Data Audit Framework (DAF) Implementation: </li></ul><ul><li> </li></ul><ul><li>Research data MANTRA course: http:// /mantra </li></ul>
    36. 36. Acknowledgements image on Flickr - Green bird by Rakka - CC BY-NC-ND 2.0 - / Image on Flickr – green butterfly abstract with a heart! By David Gunter - CC BY-NC-SA 2.0 Image on Flickr - abstract vector background by Vectorportal – CC BY 2.0 - IASSIST Images courtesy of IASSIST website Image on Flickr – Magpie by you get the picture – CC BY-NC-SA 2.0 - Image on Flickr by Philippe Put – CC BY-ND - Image on Flickr – Apple retro by kyz – CC BY 2.0 - Image on Flickr by FindYourSearch - CC BY-SA 2.0 - Image on Flickr by miuenski - CC BY-NC-SA 2.0 - Image on Flickr - DNA by Mark Cummins - CC BY-SA 2.0 - Image on Flickr - by Melissa Venable CC BY-NC-SA 2.0 - Image on Flickr – by victorfe - CC BY-NC-SA 2.0 - Image on Flickr – by RRCA_again – CC BY-NC-SA 2.0 - Image on Flickr by Jason Verwey - CC BY-NC-SA 2.0 - Image on Flickr by laszlo-photo - CC BY 2.0 - Image on Flickr by Malkav - CC BY 2.0– Image on Flickr by M. M. Alvarez, T. Shinbrot, F. J. Muzzio, Rutgers University, Center for Structured Organic Composites - CC BY 2.0 – Image on Flickr by - CC BY-NC-SA 2.0– Image on Flickr by Sean McGrath – CC By 2.0 – Image on Flickr by ecstaticist - CC BY-NC-SA 2. 0 - Image by Flickr by libraryman - CC BY-NC-ND - Image on Flickr by Gideon Burton - CC BY-SA 2.0 - Image on Flickr by mhobl - CC BY-NC 2.0 - Image on Flickr by karen horton - CC BY-NC 2.0 - Image on Flickr by incurable_hippy – CC BY-NC 2.0 - THANK YOU! [email_address]