• Like
1630 mon lomond ashley
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Published

 

Published in Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
511
On SlideShare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
7
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • \\not just about opennness – think of seismic, drug industry. Protected data, but needs to be reused in other parts of company, or many years after creation when originators have gone. Need to know what you have & how to use it.

Transcript

  • 1. Because good research needs good data On data (and publications) – who does what? Kevin Ashley With thanks to•High Heid Yin, DCC Director, Liz Lyon director@dcc.ac.uk Director, UKOLN Funded by: CC-BY © Digital Curation Centre, 2009. Licensed under Creative Commons BY-NC-SA 2.5 Scotland: http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/
  • 2. Because good research needs good data “Data is the new oil.” Andreas Weigend, Stanford (ex Amazon) “The future belongs to companies and people that turn data into products” Mike Loukides, O’Reilly Media2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 2
  • 3. Because good research needs good data Overview • Why should we care ? • Things you could do • How you might get there • Things to avoid2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 3
  • 4. Because good research needs good data“Information… has become a saleable commodity like never before” Yet – 33% don’t know Earth orbits the Sun (GB, 1999) Brian Aldiss – “The Secret of This Book (1995)2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 4
  • 5. Because good research needs good data What is data curation ? • “Maintaining, preserving and adding value to research data throughout its lifecycle” • More than preservation: • Active management – dealing with change • Less than preservation: • Lifecycle sometimes involves destruction • Sometimes, not always, about sharing, publication or citation2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 5
  • 6. Because good research needs good data Why care? • Data is expensive – an investment • Reuse: • More research • Teaching & Learning • Planning • Impact – with or without publication • Accountability • Legal & regulatory requirements2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 6
  • 7. Because good research needs good data Without good RDM – BAD THINGS HAPPEN With good RDM – GOOD STUFF HAPPENS2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 7
  • 8. Because good research needs good data EPSRC expects all those institutions it funds •to develop a roadmap that aligns … with EPSRC’s expectations by 1st May 2012; •to be fully compliant … by 1st May 2015. http://www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 8
  • 9. Because good research needs good data • Awareness of regulatory environment • Data access statement • Policies and processes • Data storage • Structured metadata descriptions • DOIs for data • Securely preserved for a minimum of 10 years from last use2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 9
  • 10. Because good research needs good data “Data is the new oil.” Andreas Weigend, Stanford (ex Amazon) Data is more like soup – its messy and you don’t know what’s in it….2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 10
  • 11. Because good research needs good data2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 11
  • 12. Presentation services: subject, media-specific, data, commercial portals Because good research needs good data Searching , harvesting, Resource embedding Resource Data creation / discovery, linking, discovery, capture / embedding linking, gathering: embedding laboratory experiments, Data analysis, Aggregator Learning object Grids, transformation, services: national, creation, re-use fieldwork, mining, modelling commercial surveys, media Harvesting metadata Learning & Research & Teaching e-Science workflows workflows Repositories : institutional, Institutional e-prints, subject, presentation data, learning objects services: portals, Learning Deposit / self- Deposit / self- Management Validation Systems, u/g, p/g archiving archiving courses, modules Publication Resource discovery, linking, ValidationThe scholarly knowledge cycle. embeddingLiz Lyon, Ariadne, July 2003. Peer-reviewed publications: journals, © Liz Lyon (UKOLN, University of Bath), 2005 Quality assurance bodies conference proceedingsThis work is licensed under a Creative Commons License2012-03-26 Attribution-ShareAlikeKevin 2.0 Ashley, DCC, UKSG Glasgow. CC-BY 12
  • 13. Because good research needs good data (e)-Research Life Cycle view of Data Curation? Formulate hypothesis / ideas, test, (New) knowledge Data processing experiment, observe: data creation, extraction: data collection & capture mining, modelling, analysis, synthesis Data processing Data processing Data management e-Infrastructure storage & validation: Adding value: Data description, deposit, linking, annotation, Open access self-archiving, visualisation, simulation Collaboration preservation, certification Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-useThis work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 •Liz Lyon December 20052012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 13
  • 14. Because good research needs good data Chris Rusbridge, DCC2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 14
  • 15. Because good research needs good data OAIS2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 15
  • 16. Because good research needs good data MoReq2Model Requirements for Electronic Records Management 2• Records Management Discipline• No mention of DATA• Simple to explain• Easily used to organise and present resources2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 16
  • 17. Because good research needs good data E-Science Curation Report - 2003 • E-science discipline • Appropriate for current focus • Takes integrated look at higher education data curation problems • Granularity on curation activities?2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 17
  • 18. Because good research needs good data InterPARES - 20012012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 18
  • 19. Because good research needs good data2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 19
  • 20. Because good research needs good data RLUK/Mary Auckland: Reskilling for Research 9 areas are skill gaps for subject librariansSheila Corrall: Libraries,Librarians and DataMany action exemplars2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 20
  • 21. Because good research needs good data Some library roles • Leadership – coordinate action • Audit – who has what, where does it go? • Advice on access – data, wherever it is • Preservation – permanance • Citability • Data/publication linking • Promoting data in teaching2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 21
  • 22. Understanding Data Requirements Because good research needs good datahttp://www.dcc.ac.uk/2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 22
  • 23. Because good research needs good dataData management plans2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 23
  • 24. What data to keep Because good research needs good dataHow to cite data2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 24
  • 25. Because good research needs good data Data Licensing • Bespoke licences • Standard licences • Multiple licensing • Licence mechanisms2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 25
  • 26. Because good research needs good data Tools to track impact http://total-impact.org/2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 26
  • 27. Because good research needs good data Findable, citable data has value • Important to link publications to data (and vice versa) • Increases citations – of data & publication • Increases reuse (hence value) • But effects exist even without publication • All benefit – researcher; institution; publisher MORAL: build a data registry2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 27
  • 28. Because good research needs good data How? • Create policy – collaborate with others • Develop existing digital services • Learn about audit tools (DCC & others) • Learn about data & sources • Reskill subject librarians • Learn about your own data • Bridge between publishers & researchers2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 28
  • 29. Because good research needs good data 4. Audit/AssessmentBenefits:Prioritisation of resourcesCapacity development and planningEfficiency savings – move data to more cost-effective storageManage risks associated with data lossRealise value through improved access & re-useScale: Dealing with Data: Rec 4Departments, institutions2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 29
  • 30. Because good research needs good data How? • Create policy – collaborate with others • Develop existing digital services • Learn about audit tools (DCC & others) • Learn about data & sources • Reskill subject librarians • Learn about your own data • Bridge between publishers & researchers2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 30
  • 31. “The role of the Library in data-intensive Because good research needs good data research is important and a strategic repositioning of the Library with respect to research support is now appropriate.” UK i n enough “there are…not 5 l” specialised data librarians yet” n ly - a 8 n t ne 200 O d e ate Ju i pd “Recommendation: The research c clibraryU community in the UK aand research institutes to define ilip “ ? C should work with universities ? properly and to formalise the role of data librarians, and to develop a curriculum that ensures a suitable supply of librarians skilled in data handling.” Dealing with Data : Rec 342012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 31
  • 32. Because good research needs good data How? • Create policy – collaborate with others • Develop existing digital services • Learn about audit tools (DCC & others) • Learn about data & sources • Reskill subject librarians • Learn about your own data • Help promote data literacy • Bridge between publishers & researchers2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 32
  • 33. Because good research needs good data2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 33
  • 34. Because good research needs good data Observations • Role for national & institutional differs • BUILD on existing subject data centers • Datasets aren’t publications • Indistinct boundaries • Continual change • Multi-dimensional • Non-linear2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 34
  • 35. Because good research needs good data “Institutions will try to preserve the problem(s) to which they are the solution” Clay Shirky2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 35
  • 36. Because good research needs good data Original Source Data Data Object A Data Object B Data Object D Data Object C Publication A Publication B Publication C Publication D2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 36
  • 37. Because good research needs good data On Citing Data • Peter Buneman. How to cite curated databases and how to make them citable. In Proceedings of the 18th Conference on Scientific and Statistical Database Management, pages 195-203, July 2006 [or http://homepages.inf.ed.ac.uk/opb/papers/ssdbm2006.pdf ] • Some serious computer science – some for a very general audience2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 37
  • 38. Because good research needs good data Summary • Data not just adjunct to publication • Data is often living – treat it as such (and be ready to kill it) • There’s more to the world than scholarly research • Hidden data is wasted data • Bad things happen without RDM • Great benefits accrue with it2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 38
  • 39. Because good research needs good data Questions • How does data management align with institutional mission? • When is library a coordinator, and when is it a service provider? • What will you do alone, and what will you coordinate with others? • What skills must you acquire? • What do you want from DCC?2012-03-26 Kevin Ashley, DCC, UKSG Glasgow. CC-BY 39