• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Research data: burden or treasure? (Talk from #fote13)
 

Research data: burden or treasure? (Talk from #fote13)

on

  • 1,162 views

A talk at #fote13 (fote-conference.com) about why we should *all* - as taxpayers - care about reuse of research data

A talk at #fote13 (fote-conference.com) about why we should *all* - as taxpayers - care about reuse of research data

Statistics

Views

Total Views
1,162
Views on SlideShare
1,084
Embed Views
78

Actions

Likes
0
Downloads
3
Comments
0

2 Embeds 78

https://twitter.com 77
http://hayleyofatkinson.wordpress.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Those external pressures include those from funders such as EPSRC. Looming deadlines this year and in 2015 got the attention of senior university management.
  • The expectations that universities need to sign up are listed here – their roadmaps need to demonstrate how they are going to deliver on these expectations by 2015. They include a commitment to keep data for 10 years after its last use – note, not just after the project ends. Some worry that this means they need to keep data for 100 years. I say that if your data is still being used (and cited) 100 years later you should break out the champagne, not worry about paying for it.

Research data: burden or treasure? (Talk from #fote13) Research data: burden or treasure? (Talk from #fote13) Presentation Transcript

  • Research data: burden or treasure? Kevin Ashley Digital Curation Centre www.dcc.ac.uk @kevingashley Kevin.ashley@ed.ac.uk Reusable with attribution: CC-BY The DCC is supported by Jisc & FP7
  • 164 universities in UK* *2011 HESA data 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 2 71 (43%) > 5% research income 115 (70%) > £1m income from research
  • £4.4 billion total research grants 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 3
  • Funders are making demands 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 4
  • 2013-10-11 Kevin Ashley – FOTiE 2013 - CC- BY 5 http://www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx EPSRC expects all those institutions it funds to develop a roadmap that aligns … with EPSRC’s expectations by 1st May 2012; to be fully compliant … by 1st May 2015.
  • 2012-06-15 Kevin Ashley, DCC; IRWM12, ULCC; CC-BY 6 • Awareness of regulatory environment • Data access statement • Policies and processes • Data storage • Structured metadata descriptions • DOIs for data • Securely preserved for a minimum of 10 years from last use
  • How much data do we have? • Edinburgh – provision for 5 Petabytes • Oxford – guessing 3Pb/year • For comparison – LHC @ CERN – 15 Pb/year • £2m investment in storage not unusual 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 7
  • The Data Deluge is upon us 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 8 Sensor’s ability to produce data outstrips IT’s ability to process it
  • Research Data Centres – the solution! 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 9 MANY AREAS OF RESEARCH HAVE NO DATA CENTRE TO SERVE THEM
  • Cloud – sorted! • Sorry, but it isn’t. • See David Rosenthal’s analysis of the economics of Amazon for preservation “Distributed digital preservation in the cloud” IJDC 8(1), 2013 doi:10.2218/ijdc.v8i1.248 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 10
  • Cost of data for 100 years – local vs Amazon S3 Data from blog.dshr.org/2013/01/talk-at-idcc2013.html © David Rosenthal, used under CC-BY-SA licence 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 11
  • Cost of data for 100 years – local vs Amazon S3 AND Glacier Data from blog.dshr.org/2013/01/talk-at-idcc2013.html © David Rosenthal, used under CC-BY-SA licence 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 12
  • That looks like a problem • Funder requirements exist for a reason: – That data is valuable • Value to funder, society from reuse • Value to the institution is there also 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 13 BIS business case: £1.5m investment in research data services pays back 2.5 times after 5 years
  • Integrity • Not everyone publishes here • Almost all fraud connected to unavailable data • People suffer & die due to research fraud • When your research is reproducible – it gets cited 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 14
  • Citability • Making data available increases citations • Everyone – academic, funder, institution – loves citations • Want evidence? – Alter, Pienta, Lyle – 240%, social sciences * – Piwowar, Vision – 9% (microarray data)† – Henneken, Accomazzi – 20% (astronomy) # 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 15 † Piwowar H, Vision TJ. (2013) Data reuse & the open data citation advantage. PeerJ PrePrints 1:e1v1 http://dx.doi.org/10.7287/peerj.preprints.1v1 * Amy Pienta, George Alter, Jared Lyle, (2010) The Enduring Value of Social Science Research: The Use and Reuse of Primary Research Data. http://hdl.handle.net/2027.42/78307 # Edwin Henneken, Alberto Accomazzi, (2011) Linking to Data - Effect on Citation Rates in Astronomy. http://arxiv.org/abs/1111.3618
  • Value in the institution • New research depends on the old – well managed data resources like well-equipped labs • Teaching more effective when real data from research is used 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 16
  • Wherever it is, it has valueWant a 400% -> 1200% return on your investment? 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 17 Try BADC! http://www.jisc.ac.uk/whatwedo/programmes/di_directions/strategicdirections/badc.aspx
  • Commercial services 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 18
  • Can we find it? • Data must be discoverable to be reused • Alone, or in conjunction with publication • Institutional catalogues, national data registries – JISC is piloting through DCC 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 19
  • 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 20
  • 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 21
  • Jisc – through DCC – can help 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 22
  • http://dataintelligence.3tu.nl/en/home/ Choice of RDM training materials for librarians Up-skilling for data http://datalib.edina.ac.uk/mantra/libtraining.html 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 23
  • 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 24 Idea Develop Fund Plan Record Process Publish Read
  • 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 25 Idea Develop Fund Plan Record Process Publish Read Idea Develop Fund Plan Record Process Publish Read
  • Idea Develop Fund Plan Record Process Publish Read 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 26
  • Data reuse stories • The palaeontologist who saved years of work with archaeological data • The ‘noise’ from research radar that mapped dust from Eyjafjallajökull • The 19th-century logs and photographs that help us model climate change 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 27 Often your data tells stories that your publications do not
  • 3TU treasure chest 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 28
  • Thanks for your attention kevin.ashley@ed.ac.uk www.dcc.ac.uk @kevingashley 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 29
  • DCC ‘institutional engagement’ Assess needs Make the case Develop support and services RDM policy development Customised Data Management Plans DAF & CARDIO assessments Guidance and training Workflow assessment DCC support team Advocacy with senior management Institutional data catalogues Pilot RDM tools …and support policy implementation 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 30