• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
RDM LIASA webinar
 

RDM LIASA webinar

on

  • 2,592 views

Presentation given by Sarah Jones and Joy Davidson to a group of South African librarians at a webinar organised by LIASA HELIG. http://www.liasa.org.za/node/977

Presentation given by Sarah Jones and Joy Davidson to a group of South African librarians at a webinar organised by LIASA HELIG. http://www.liasa.org.za/node/977

Statistics

Views

Total Views
2,592
Views on SlideShare
2,450
Embed Views
142

Actions

Likes
1
Downloads
30
Comments
0

2 Embeds 142

http://www.scoop.it 137
https://twitter.com 5

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution-NonCommercial-NoDerivs LicenseCC Attribution-NonCommercial-NoDerivs LicenseCC Attribution-NonCommercial-NoDerivs License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Data storageoccure during both the active phase of research and for longer-term preservation. If there isn’t a data repository in your institution, check to see if there are any external subject based repositories that might be a suitable home. BUT REMEMER! If you are planning to deposit data in a repository, check repository policies on the formats that are accepted before you begin. Make sure that any normalisation procedures will not affect the usability of the data.

RDM LIASA webinar RDM LIASA webinar Presentation Transcript

  • Digital curation: why managing andsharing data matters to universitiesSarah Jones and Joy DavidsonDigital Curation Centresarah.jones@glasgow.ac.ukjoy.davidson@glasgow.ac.uka LIASA HELIG webinar, 30th April 2013, www.liasa.org.za/node/977
  • Digital Curation CentreJisc-funded consortium comprising units from the– Universities of Bath (UKOLN)– Edinburgh (DCC Centre)– Glasgow (HATII)Launched 1st March 2004 as a national centre for solving challengesin digital curation that could not be tackled by any single institution ordiscipline
  • Overview of session: four brief modules1. Introduction to digital curation – how does researchdata management fit into the curation lifecycle?2. Benefits and drivers for research data management3. Review of current research data managementactivity in UK Universities4. What role does the library have to play in researchdata management?
  • Please feel free to ask questions at anytime!• During the session you can ask questions.Simply type these into the chat box.• Questions will be gathered and speakers willrespond to selected questions at the end ofeach module.• There will be a chance for additional questionsat the end of the session.
  • DIGITAL CURATION, PRESERVATIONAND RESEARCH DATA MANAGEMENT– AN INTRODUCTION
  • An introduction to digital curation• What is digital curation?• What is the difference betweencuration, preservation and datamanagement?• What sort of activities are involved indigital curation?• Who should be involved in digitalcuration? 6
  • “the active management and appraisal of dataover the lifecycle of scholarly and scientific interest”Data have importance as the evidential baseof scholarly conclusionsCuration is part of good research practiceWhat is data curation?
  • Are data curation, preservation andmanagement different?• Lots of different terms being used - are thethey same or different?• Essentially, they are all part of the curationlifecycle
  • Curation Lifecycle Model
  • Key questions to consider:• what data will be created?• how much storage is needed?• where will data be stored in the short and longer term?• are there ethical issues that require consent?Many funders expect data management & sharing plans at thegrant application stage!Data Management Planning
  • Key questions to consider:What information do users need to understand the data?- descriptions of all variables / fields and their values- code labels, classification schema, abbreviations list- information about the project and data creators- tips on usage e.g. exceptions, quirks, questionable resultsHow will this capture this and who will capture/record it?Are there standards that need to be followed?Metadata & documentation
  • Key questions to consider:• What data must be kept? (for validation, etc)• What must not be kept? (e.g. personal data)• Is it worth keeping the data? – cost/benefits• Where will the data be kept?Selecting what to keep
  • Storing dataKey questions to consider:What amount of storage is available for theactive phase?What facilities are needed in the active phase?- remote access to work from home- file sharing with others- high-levels of security for sensitive dataHow will the data be backed up?Where will data be stored for the longer-term?
  • Institutional data repositoriesNot intended toreplacenational, subject orother establisheddata collectionsAcknowledge hybridenvironmenthttp://datashare.is.ed.ac.ukwww.dspace.cam.ac.uk/https://databank.ora.ox.ac.ukEssex-RDR andDataPool at Southampton
  • External data centresResearch funders’data centres…List of data centres:http://databib.orgStructured databasesDisciplinary&communityinitiatives
  • Finding and reusing dataKey questions to consider:How can researchers make theirdata visible and citable?
  • Data cataloguesDevelop a research dataextension to the cerif standardJISC & DCC planningNational coordinationhttp://cerif4datasets.wordpress.com
  • Who should be involved in curation?ResearchOrganisationsFundersData centresAdvisorybodiesSupport servicesResearchersPublishers
  • BENEFITS AND DRIVERS– THE UK POLICY LANDSCAPE
  • “Data sets arebecoming thenew instrumentsof science”Dan Atkins, University of Michigan
  • Digital data asthe new specialcollections?Sayeed Choudhury, Johns Hopkins
  • Research data:institutionalcrown jewels?http://www.flickr.com/photos/lifes__too_short__to__drink__cheap__wine/4754234186/
  • Expectations of public access“Publicly funded research data are a publicgood, produced in the public interest, which shouldbe made openly available with as few restrictions aspossible in a timely and responsible manner thatdoes not harm intellectual property.”RCUK Common Principles on Data Policyhttp://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
  • 24http://www.bis.gov.uk/innovatingforgrowth…open data
  • ...personal data
  • Benefits of data sharing (1)www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0“It was unbelievable. Its not sciencethe way most of us have practiced inour careers. But we all realised thatwe would never get biomarkersunless all of us parked our egos andintellectual property noses outsidethe door and agreed that all of ourdata would be publicimmediately.”Dr John Trojanowski, University of Pennsylvania... scientific breakthroughs
  • Benefits of data sharing (2)www.guardian.co.uk/politics/2013/apr/18/uncovered-error-george-osborne-austerity... validation of results“It was a mistake in a spreadsheet that could havebeen easily overlooked: a few rows left out of anequation to average the values in a column.The spreadsheet was used to draw the conclusionof an influential 2010 economics paper: that publicdebt of more than 90% of GDP slows down growth.This conclusion was later cited by the InternationalMonetary Fund and the UK Treasury to justifyprogrammes of austerity that have arguably led toriots, poverty and lost jobs.”
  • Benefits of data sharing (3)“There is evidence that studies that make theirdata available do indeed receive more citationsthan similar studies that do not.”Piwowar H. and Vision T.J 2013 "Data reuse and the open datacitation advantage“ https://peerj.com/preprints/1.pdf9% - 30% increase... more citations
  • Why YOU need a DataManagement Planhttp://blogs.ch.cam.ac.uk/pmr/2011/08/01/why-you-need-a-data-management-planDirect benefits to individuals
  • “Research organisations will ensure that effectivedata curation is provided throughout the full datalifecycle, with ‘data curation’ and ‘data lifecycle’ beingas defined by the Digital Curation Centre. The fullrange of responsibilities associated with data curationover the data lifecycle will be clearly allocated...”www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx...institutional responsibility
  • Research funder data policieswww.dcc.ac.uk/resources/policy-and-legal/ overview-funders-data-policies
  • Ultimately funders expect:• timely release of data- once patents are filed or on (acceptance for) publication• open data sharing- minimal or no restrictions if possible• preservation of data- typically 5-10+ years if of long-term valueSee the RCUK Common Principles on Data Policy:www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
  • Jisc MRD programmesManaging Research Data programmes funded by the Jisc:• MRD 01: October 2009 – July 2011– £4.3 million investment– www.jisc.ac.uk/whatwedo/programmes/mrd.aspx• MRD 02 – October 2011 – July 2013– £4.6 million investment– www.jisc.ac.uk/whatwedo/programmes/di_researchmanagement/managingresearchdata.aspxProgramme Manager: Simon Hodson s.hodson@jisc.ac.ukTwitter: #jiscmrd
  • The DCC Mission“Helping to buildcapacity, capability andskills in data managementand curation across theUK’s higher educationresearch community”Phase 3 Business Planwww.dcc.ac.uk
  • DCC Institutional EngagementsWith funding from HEFCE we’re:• Working intensively with 21 HEIs to increase RDM capability– 60 days of effort per HEI drawn from a mix of DCC staff– Deploy DCC & external tools, new approaches & best practice• Support varies based on what each institution wants/needs• Lessons & examples will be shared with the communitywww.dcc.ac.uk/community/institutional-engagements
  • Some unis we are working with
  • Common DCC IE activities• Establishing steering groups• Making the case for RDM• Assessing needs• Developing policy and strategy• Piloting tools• Offering DMP consultations• Delivering training• Setting up guidance websites• ...
  • CURRENT RDM INITIATIVES IN UKUNIVERSITIES
  • How to develop RDM servicesGuide and case studies: www.dcc.ac.uk/resources/developing-rdm-services
  • Components of a research data service
  • Institutional RDM policieswww.dcc.ac.uk/resources/policy-and-legal/institutional-data-policies
  • Early research data policies“Statement of commitment” Infrastructure  policy“10 commandments”mutual promisesaspirationalBaseline of RCUK Code+ procedures & supportlegal tone / languagea section in uni DM policyuseful guide as appendixBased on Edin.with a fewadditions
  • RDM strategies and roadmapsA series of blog postswww.dcc.ac.uk/newsLinks to example roadmapshttp://tiny.cc/EPSRCroadmaps
  • University of Bath RDM roadmap• Based on Monash University RDM strategy• Identifies the current position and proposes activity• Defines roles and responsibilities and timeframeshttp://www.bath.ac.uk/rdso/University-of-Bath-Roadmap-for-EPSRC.pdf
  • Guidance webpageswww.gla.ac.uk/datamanagementwww.bath.ac.uk/research/data
  • Disciplinary RDM trainingwww.dcc.ac.uk/training/train-trainer/disciplinary-rdm-training
  • Online training for PhD studentshttp://datalib.edina.ac.uk/mantra
  • Data Management Planning support• Guidelines / templates on what to include in plans• Example answers, guidance and links to local support• A library of successful DMPs to reuse• Tailored consultancy services• Online tools (e.g. customised DMPonline)• Links / flags embedded in grant systems• ...
  • Research data storageBlue Peta at Bristol1st 5TB free per Data Steward then£400 per TB p.a. for disk storage;tape backup £40 per TBhttp://data.bris.ac.uk• £2m funding to date• Petascale facility – expandable• 3 machine rooms – resilience(tape archive 2012)• Available to all researchers forresearch data
  • Institutional data repositoriesNot intended toreplacenational, subject orother establisheddata repositoriesAcknowledge hybridenvironmenthttp://datashare.is.ed.ac.ukwww.dspace.cam.ac.ukhttps://databank.ora.ox.ac.ukResearch Data at Essex andDataPool at Southampton
  • Data cataloguesDevelop a research dataextension to the cerif standardJISC & DCC planningNational coordinationhttp://cerif4datasets.wordpress.com
  • Bringing it all together into a serviceDiagram courtesy of Sally Rumsey, University of Oxford
  • THE ROLE OF THE LIBRARY– RE-SKILLING FOR DATA CURATION
  • How are libraries engaging in RDM?LibraryITResearchOfficeThe library is leading on most DCC institutional engagementswww.dcc.ac.uk/community/institutional-engagementsThey are involved in: defining the institutional strategy developing RDM policy delivering training courses helping researchers to write DMPs advising on data sharing and citation setting up data repositories ...
  • Why should libraries support RDM?• existing data and open access leadership roles• often run publication repositories• have good relationships with researchers• proven liaison and negotiation skills• knowledge of information management, metadata...• highly relevant skill set
  • Possible Library RDM roles• Leading on local (institutional) data policy• Bringing data into undergraduate research-based learning• Teaching data literacy to postgraduate students• Developing researcher data awareness• Providing advice, e.g. on writing DMPs or advice on RDM within a project• Explaining the impact of sharing data, and how to cite data• Signposting who in the Uni to consult in relation to a particular question• Auditing to identify data sets for archiving or RDM needs• Developing and managing access to data collections• Documenting what datasets an institution has• Developing local data curation capacity• Promoting data reuse by making known what is availableRDMRose Lite
  • Training for librarians• RDM for librarians, DCChttp://www.dcc.ac.uk/training/rdm-librarians• RDMRose, University of Sheffieldhttp://rdmrose.group.shef.ac.uk• Data Intelligence for librarians, 3TU, Netherlandshttp://dataintelligence.3tu.nl/en/about-the-course• DIY Training Kit for Librarians, University of Edinburghhttp://datalib.edina.ac.uk/mantra/libtraining.html• SupportDM modules, University of East Londonhttp://www.uel.ac.uk/trad/outputs/resources
  • RDM for Librarians• 3 hour course by the DCC covering:– Research data and RDM– Data management planning– Data sharing– Skills– RDM at [INSERT YOUR UNI]• Slides and accompanying handbook• Used UKDA guide as pre-reading• http://www.dcc.ac.uk/training/rdm-librarians
  • RDMRose• Taught and CPD learning materials in RDM tailoredfor information professionals, by the Uni of Sheffield• 8 sessions, each of which is half day of study• Strong emphasis on practical hands-on activities• Also offer a short (2hr) course called RDMRose Lite• http://rdmrose.group.shef.ac.uk
  • Data Intelligence for Librarians• A course produced by 3TU, a consortium of technicaluniversities in the Netherlands• Combination of online and face-to-face education• Four meetings to learn and share knowledge• Theory (on website) and assignments are conductedbetween sessions• http://dataintelligence.3tu.nl/en/home
  • DIY Training Kit for Librarians• By EDINA and Data Library at University of Edinburgh• Self-directed course, intended to be used by a group oflibrarians to build confidence in supporting researchers• MANTRA modules as pre-reading, shortpresentation, reflective questions and exercises to guidediscussion• Five face-to-face sessions– Data Management Planning– Organising and documenting data– Data security and storage– Ethics and copyright– Data sharing
  • SupportDM• By the TraD project at the University of East London• SupportDM comprises five sessions– About research data management– Providing guidance and support for researchers– Data Management Planning– Selecting which data to keep– Cataloguing and sharing data• Each topic is introduced in a face-to-face session andexplored via exercises and discussion• Learning is reinforced via an online tutorial and practicalexercises to do before the next session• http://www.uel.ac.uk/trad/outputs/resource
  • Thanks – any questions?DCC guidance, tools and case studies:www.dcc.ac.uk/resourcesFollow us on twitter:@digitalcuration and #ukdcc