Managing and Sharing Research Data
Upcoming SlideShare
Loading in...5
×
 

Managing and Sharing Research Data

on

  • 516 views

Keynote presentation from White Rose Perspectives on Research Data Management event, University of York, 24 May 2012

Keynote presentation from White Rose Perspectives on Research Data Management event, University of York, 24 May 2012

Statistics

Views

Total Views
516
Views on SlideShare
516
Embed Views
0

Actions

Likes
2
Downloads
3
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • The DCC developed the curation lifecycle model to explain the range of activities involved in creating, preserving and sharing digital content.In RDM terms ‘curation’ is simply managing & sharing data. The DCC argues that this is just part of good research practice.
  • for projects ofclinical or major social, environmental or heritage importance, for 20 yearsor longer
  • This is what senior university managers REALLY want to avoid. They also want to keep the government happy, and BIS want private sector to be able to build upon public sector data.
  • Point out that although funders like the ESRC have been asking for data for years (apparently since 2001 or something like that), and that until recently these requirements have been poorly enforced (very little incentive for researchers to comply - they still got their grant money!), the funders are now under external pressure to ensure that researchers do provide data. So things are changing and the old 'lax' system is being replaced by one that that is more stringent.
  • IT departments in particular tend to think of data management as primarily a hardware/technical problem. It’s not – the human side is about 80% of the problem.
  • Best place to hide a leaf is in a tree…
  • Consideration of roleswill be explored in the breakouts
  • Whole range of support and services we can offer. There are typically a few things we do with each HEI. The focus depends on what they need.
  • We’ll talk more about some of these shortly, but first we’ll give a snapshot of the current UK ‘scene’…
  • Research infrastructure depends on all of its parts, and a good data management strategyrelies upon coordination and communication for ensuring smooth and accurate interactions

Managing and Sharing Research Data Managing and Sharing Research Data Presentation Transcript

  • MANAGING AND SHARING RESEARCH DATA Martin Donnelly Digital Curation Centre University of Edinburgh White Rose Perspectives on Research Data Management event University of York, 24 May 2012
  • Running orderI. DEFINITIONSII. DRIVERSIII. SOME (IN)EQUATIONSIV. STAKEHOLDERSV. CURRENT WORK
  • I. DEFINITIONS
  • The DCC MissionHelping to build capacity, capabilityand skills in data management and curation across the UK’s higher education research community – DCC Phase 3 Business Plan
  • What is Research Data Management? “the active management and Manage appraisal of data over the lifecycle of scholarly and scientific interest” Share Data management is a part of good research practice
  • Why manage research data?• Enable reuse• Control costs• Research integrity• Research impact – Linking data and publication – Making data citable• Regulatory requirements• Maximising value RLUKRDM -20120416 - Kevin Ashley, DCC, CC-BY
  • Where is the data in research? The six datacentric phases of the research lifecycle
  • http://www.google.co.uk/imgres?q=illumina+bgi&hl=en&client=firefox- Data... a&hs=Jl2&rls=org.mozilla:en-GB:official&biw=1366&bih http://www.flickr.com/photos/think mulejunk/352387473/ http://www.flickr.com/photos/charleswelch/3597432481//http://www.flickr.com/photos/usfsregion5/4546851916// http://www.flickr.com/photos/wasp_barcode/4793484478/
  • “Research Data Management”- The phrase means different things to different people- Researchers may care enormously about their data, so much so that they worry about it going out into the world on its own- Others (e.g. those with responsibility for compliance) may worry about it not going out into the world, or going out when it shouldn’t / underdressed- Some may not even recognise the relevance of ‘data’ in what they do
  • II. DRIVERS
  • The data deluge “Surfing the Tsunami” Science: 11 February 2011
  • • Public good• Preservation• Discovery• Confidentiality• First use• Recognition• Public funding
  • RCUK Policy and Code of Conduct on theGovernance of Good Research ConductUnacceptable research conduct includes mismanagementor inadequate preservation of data and/or primarymaterials, including failure to: – keep clear and accurate records of the research procedures followed and the results obtained, including interim results; – hold records securely in paper or electronic form; – make relevant primary data and research evidence accessible to others for reasonable periods after the completion of the research: data should normally be preserved and accessible for 10 yrs (in some cases 20 yrs or longer); – manage data according to the research funder’s data policy and all relevant legislation; – wherever possible, deposit data permanently within a national collection. Responsibility for proper management and preservation of data and primary materials is shared between the researcher and the research organisation.
  • April 2011 - EPSRC Letter to VCsEPSRC expects all those institutions it funds:- to develop a roadmap that aligns their policies and processes with EPSRC’s expectations by 1st May 2012- to be fully compliant with these expectations by 1st May 2015http://www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx
  • Institutional Policieshttp://www.dcc.ac.uk/resources/policy-and-legal
  • Data Access as Headline News JISC Legal
  • Government pressure…6.9 The Research Councils expect the researchers they fund to depositpublished articles or conference proceedings in an open accessrepository at or around the time of publication. But this practice isunevenly enforced. Therefore, as an immediate step, we have askedthe Research Councils to ensure the researchers they fund fulfil thecurrent requirements. Additionally, the Research Councils have nowagreed to invest £2 million in the development, by 2013, of a UK‘Gateway to Research’. In the first instance this will allow ready accessto Research Council funded research information and related data butit will be designed so that it can also include research funded by othersin due course. The Research Councils will work with their partners andusers to ensure information is presented in a readily reusableform, using common formats and open standards.http://www.bis.gov.uk/assets/biscore/innovation/docs/i/11-1387-innovation-and-research-strategy-for-growth.pdf
  • III. SOME (IN)EQUATIONS
  • STORAGE ≠MANAGEMENT
  • Greenhouse = storageHorticulture = management
  • MANAGEMENT ≠ SHARING
  • “Data sharing was“While many researchers are more readilypositive about sharing data indiscussed byprinciple, they are almost early careeruniversally reluctant in researchers.”practice. ..... using thesedata to publish results beforeanyone else is theprimary way of gainingprestige in nearly alldisciplines.” INCREMENTAL Project
  • Rule 1. Don’t Keep It All
  • Rule 2. Don’t Share It All• Data Protection Act• Ethical concerns• Commercial interests
  • Open to all? Case studies of openness in researchChoices are made according to context, with degreesof openness reached according to:• The kinds of data to be made available• The stage in the research process• The groups to whom data will be made available• On what terms and conditions it will be providedDefault position of most:• YES to protocols, software, analysis tools, methods and techniques• NO to making research data content freely available to everyone Angus Whyte, RIN/NESTA, 2010
  • Regulation
  • IV. STAKEHOLDERS
  • “The ability to take data -tobe able to understand it, toprocess it, to extract valuefrom it, to visualise it, tocommunicate it -that’sgoing to be a hugelyimportant skill in the nextdecades.” Hal Varian, Chief Economist, Google
  • Implications of “Big Data” and data science for organisations in all sectors Predicts a shortage of 190,000 data scientists by 2019http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation
  • Position LocationScience Data Librarian StanfordData Management Librarian Oregon StateSocial Sciences Data Librarian BrownData Curation Librarian NortheasternData Librarian New South WalesResearch Data Management SydneyCo-ordinatorResearch Data & Digital CambridgeCuration OfficerData Services Librarian IowaData Analyst ANDSInstitutional Data Scientist Bath
  • Data roles1. Director IS/CIO/University Librarian2. Data librarians /data scientist /liaison/subject/faculty librarians3. Repository managers4. IT/Computing Services5. Research Support/Innovation Office6. Doctoral Training Centres7. PVC Research + Public Engagement Office Liz Lyon, Informatics Transform, IJDC Current Issue, 2012
  • V. CURRENT WORK
  • DCC institutional stakeholders University managers Researchers Research support staff • University library / repository with a role to play in data • IT services management, particularly • Research and innovation those from • Etc
  • Institutional EngagementsWith funding from HEFCE we’re:• Working intensively with 18 HEIs to increase RDM capability – 60 days of effort per HEI drawn from a mix of DCC staff – Deploy DCC and external tools, approaches and best practice• Support varies based on what each institution wants/needs – Institution agrees a schedule of work with the DCC, and each assigns a primary contact / programme manager• Lessons and examples to be shared with the community www.dcc.ac.uk/community/institutional-engagements
  • Some current IE activities Assessing needs Piloting tools e.g. DataFlow RDM roadmaps Policy Policydevelopment implementation
  • Support offered by the DCC InstitutionalAssess data cataloguesneeds Workflow assessment Pilot RDM tools Develop DAF & CARDIO DCC assessments Guidance and support support team training and services RDM policy Advocacy to senior development management Customised Data Make the case Management Plans …and support policy implementation
  • To sum...ALL THE PIECES MATTER
  • CREDITSImages: Slide 3 – http://www.flickr.com/photos/dougbelshaw/ Slide 9 – http://www.flickr.com/photos/chaparral/ Slide 10 – http://www.flickr.com/photos/rpmarks/ Slide 19 – http://www.flickr.com/photos/billburris/ Slide 21 – http://www.flickr.com/photos/mykl/ Slide 28 – http://www.flickr.com/photos/mugley/ Slide 33 – http://www.flickr.com/photos/chiotsrun/ Slide 38 – http://www.treehugger.com/picture-is-worth-sum-car-parts.jpg Thanks to DCC colleagues for their slides: Kevin Ashley, Liz Lyon, Graham Pryor, Sarah Jones
  • QUESTIONS AND CONTACTSFor more information: – Visit http://www.dcc.ac.uk – Email martin.donnelly@ed.ac.uk – Twitter @mkdDCC This work is licensed under a Creative Commons Attribution 2.5 UK: Scotland License.