The Informatics Transform:Re-engineering Libraries forthe Data DecadeDr Liz Lyon, Associate Director, UK Digital Curation ...
“Data is the new oil.” Andreas Weigend, Stanford (ex Amazon)“The future belongs tocompanies and people that turndata into ...
http://www.google.co.uk/imgres?q=illumina+bgi&hl=en&client=firefox-                                       Data...         ...
Oceans:                                                                  last unmapped                                    ...
..using personaldata for research
Share your genome data?            • Buy a DTC kit            • Join a project
In a recent 2011 survey, Nature asked its readerswhether they had, or would consider, a genomeanalysis (n=1588)           ...
Consumer data…
One in every nine people on Earth is                             on Facebook30billion pieces of content are shared onFaceb...
…and conversations                     http://www.touchagency.com/free-twitter-infographic/
“Data is the new oil.” Andreas Weigend, Stanford (ex Amazon)Data is more like soup –its messy and you don’tknow what’s in ...
Kyle Machulis“DIY”Humanphysiologydata             http://www.technologyreview.com/biomedicine/37784/
“Herculean” and“Heroic”Particlephysicsdata
“Crowd-sourced”astronomy
Researchers need help tomanage their data.This is a really excitingopportunity for libraries…..
with a bit of re-engineering                               http://www.flickr.com/photos/49397559@N02/5899381202/
1. Leadership(Getting attention…)
Six reasons why you should careabout managing your research data
1. Risk: where is your data? Photo credits: Harvey Rutt http://www.ecs.soton.ac.uk/regenesis/pictures/
2. Reputation : data access, FOI
3. Quality: data gold standardhttp://www.sciencemag.org/content/334/6060/1226.full.html
4. Scale: an explosion of data     http://www.phgfoundation.org/reports/10364/“A single sequencer can now generate in a da...
5.Partnerships           Alzheimer’s Disease Neuroimaging Initiative:            a unique (open) $60M partnership between ...
6. Funding  EPSRC expects all those institutions it funds•to develop a roadmap that aligns their policiesand processes wit...
•   Awareness of regulatory environment•   Data access statement•   Policies and processes•   Data storage•   Structured m...
Sticks                                                         …and Carrotshttp://www.cartoonstock.com/lowres/csl4846l.jpg...
2. Research DataManagement services(Providing tools & support)
Understanding Data Requirementshttp://www.dcc.ac.uk/
Data management plans
• Advocacy & Training  • Informatics: disciplinary    metadata schema, standards,    formats, identifiers, ontologies  • S...
What data to keepHow to cite data
Data Licensing• Bespoke licences• Standard licences• Multiple licensing• Licence mechanisms
Tools to track impact http://total-impact.org/
Research360@Bath• Partnership  approach    • UKOLN-DCC    • Library    • IT services    • Research Support      Office    ...
Partnership approachLibrary & institutional stakeholders•Roles (7 listed)•Responsibilities•Requirements•RelationshipsLiz L...
•       Director IS/CIO/University Librarian•       Data librarians /data scientist        /liaison/subject/faculty librar...
Full mapping : Informatics Transform, Ariadne Issue 68, 2012
3. Developing data informaticscapacity & capability(Acquiring the skills….)
RLUK/Mary Auckland:                             Reskilling for Research                               9 areas are skill ga...
Skill gap                     2-5 years Now  Preserving research outputs   49%          10%  Data management & curation   ...
Pause for reflection….•     Skills shortage for data informatics?•     Reposition LIS curriculum?•     LIS entry requireme...
Play for action….1. Define core components of   data informatics  • Visualisation e.g. VisTrails  • Workflow e.g. Taverna ...
“Very few librarians are likely to have specialist scientific or medical knowledge - if you train as a research scientist ...
Play for action….1. Analyse LIS entry qualifications   & increase STEM entrants Target • Biologists • Chemists • Mathemati...
Let’s get together
Play for action….1. International Data Informatics   Working Group to explore   promotion, recognition & reward • Global a...
Position                       LocationScience Data Librarian         StanfordData Management Librarian      Oregon StateS...
Datajournalist?        Data        artist?
Implications of                                                                    “Big Data” and                         ...
“Big Data”Data scientistData Science Revealedcommunity surveyhttp://www.emc.com/collateral/about/news/emc-data-science-stu...
For a University, research data is akey element of “Big Data”.Managing research data effectivelywill give business advanta...
Data-intensive research                   •   Intelligence                   •   Decision-making                   •   Pla...
CommunityCapabilityModelFrameworkCCMF         • Research Funders         • Institutions         • Research leaders/PIs    ...
“The ability to take data-to be able to understandit, to process it, to extractvalue from it, to visualiseit, to communica...
Libraries are on a data journey-the Informatics Transform isthe first step in a newdirection…
Thank you!Informatics Transform article (in press)http://ariadne.ac.uk/issue68/lyon   use details:Slideshttp://www.ukoln.a...
Upcoming SlideShare
Loading in …5
×

Informatics Transform : Re-engineering Libraries for the Data Decade

1,449 views
1,286 views

Published on

VALA2012, Melbourne, Australia

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,449
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
13
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Informatics Transform : Re-engineering Libraries for the Data Decade

  1. 1. The Informatics Transform:Re-engineering Libraries forthe Data DecadeDr Liz Lyon, Associate Director, UK Digital Curation CentreDirector, UKOLN, University of Bath, UKVALA2012, Melbourne, Australia This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 UKOLN is supported by: www.ukoln.ac.uk A centre of expertise in digital information management
  2. 2. “Data is the new oil.” Andreas Weigend, Stanford (ex Amazon)“The future belongs tocompanies and people that turndata into products”Mike Loukides, O’Reilly Media
  3. 3. http://www.google.co.uk/imgres?q=illumina+bgi&hl=en&client=firefox- Data... a&hs=Jl2&rls=org.mozilla:en-GB:official&biw=1366&bih http://www.flickr.com/photos/think mulejunk/352387473/ http://www.flickr.com/photos/charleswelch/3597432481//http://www.flickr.com/photos/usfsregion5/4546851916// http://www.flickr.com/photos/wasp_barcode/4793484478/
  4. 4. Oceans: last unmapped frontier?http://www.wired.com/wiredscience/2011/09/ocean-sensor-network/ http://bohemianadventures.blogspot.com.au/2010/06/bering-sea- day-1-dutch-harbor.html
  5. 5. ..using personaldata for research
  6. 6. Share your genome data? • Buy a DTC kit • Join a project
  7. 7. In a recent 2011 survey, Nature asked its readerswhether they had, or would consider, a genomeanalysis (n=1588) Have not/would not 15% Not sure Would if 13% given the 54% opportunity 18% Have had genome analysis
  8. 8. Consumer data…
  9. 9. One in every nine people on Earth is on Facebook30billion pieces of content are shared onFacebook each monthPeople upload 3000images to Flickr every minuteGoogle+ has > 25million users From 20 Social Media Statistics (Jeffbullas)
  10. 10. …and conversations http://www.touchagency.com/free-twitter-infographic/
  11. 11. “Data is the new oil.” Andreas Weigend, Stanford (ex Amazon)Data is more like soup –its messy and you don’tknow what’s in it….
  12. 12. Kyle Machulis“DIY”Humanphysiologydata http://www.technologyreview.com/biomedicine/37784/
  13. 13. “Herculean” and“Heroic”Particlephysicsdata
  14. 14. “Crowd-sourced”astronomy
  15. 15. Researchers need help tomanage their data.This is a really excitingopportunity for libraries…..
  16. 16. with a bit of re-engineering http://www.flickr.com/photos/49397559@N02/5899381202/
  17. 17. 1. Leadership(Getting attention…)
  18. 18. Six reasons why you should careabout managing your research data
  19. 19. 1. Risk: where is your data? Photo credits: Harvey Rutt http://www.ecs.soton.ac.uk/regenesis/pictures/
  20. 20. 2. Reputation : data access, FOI
  21. 21. 3. Quality: data gold standardhttp://www.sciencemag.org/content/334/6060/1226.full.html
  22. 22. 4. Scale: an explosion of data http://www.phgfoundation.org/reports/10364/“A single sequencer can now generate in a day what it took 10years to collect for the Human Genome Project”
  23. 23. 5.Partnerships Alzheimer’s Disease Neuroimaging Initiative: a unique (open) $60M partnership between NIH, FDA, universities and drug companies.“It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never getbiomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.” Dr John Trojanowski, University of Pennsylvania
  24. 24. 6. Funding EPSRC expects all those institutions it funds•to develop a roadmap that aligns their policiesand processes with EPSRC’s expectations by1st May 2012;•to be fully compliant with these expectationsby 1st May 2015.http://www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx
  25. 25. • Awareness of regulatory environment• Data access statement• Policies and processes• Data storage• Structured metadata descriptions• DOIs for data• Securely preserved for a minimum of 10 years
  26. 26. Sticks …and Carrotshttp://www.cartoonstock.com/lowres/csl4846l.jpg http://www.flickr.com/photos/darshan-shah/6237564870/
  27. 27. 2. Research DataManagement services(Providing tools & support)
  28. 28. Understanding Data Requirementshttp://www.dcc.ac.uk/
  29. 29. Data management plans
  30. 30. • Advocacy & Training • Informatics: disciplinary metadata schema, standards, formats, identifiers, ontologies • Storage: file-store, cloud, data centres, funder policy • Access: embargoes, FOI
  31. 31. What data to keepHow to cite data
  32. 32. Data Licensing• Bespoke licences• Standard licences• Multiple licensing• Licence mechanisms
  33. 33. Tools to track impact http://total-impact.org/
  34. 34. Research360@Bath• Partnership approach • UKOLN-DCC • Library • IT services • Research Support Office • Doctoral Training Centreshttp://blogs.bath.ac.uk/research360/
  35. 35. Partnership approachLibrary & institutional stakeholders•Roles (7 listed)•Responsibilities•Requirements•RelationshipsLiz Lyon, Informatics Transform, Ariadne Issue 68, 2012
  36. 36. • Director IS/CIO/University Librarian• Data librarians /data scientist /liaison/subject/faculty librarians• Repository managers• IT/Computing Services• Research Support/Innovation Office• Doctoral Training Centres• PVC Research Data roles Liz Lyon, Informatics Transform, Ariadne Issue 68, 2012
  37. 37. Full mapping : Informatics Transform, Ariadne Issue 68, 2012
  38. 38. 3. Developing data informaticscapacity & capability(Acquiring the skills….)
  39. 39. RLUK/Mary Auckland: Reskilling for Research 9 areas are skill gaps for subject librariansSheila Corrall: Libraries,Librarians and DataMany action exemplars2012: Libraries in review
  40. 40. Skill gap 2-5 years Now Preserving research outputs 49% 10% Data management & curation 48% 16% Comply with funder mandates 40% 16% Data manipulation tools 34% 7% Data mining 33% 3% Metadata 29% 10% Preservation of project records 24% 3% Sources of research funding 21% 8% Metadata schema, discipline 16% 2% standards, practicesData from RLUK/Mary Auckland: Reskilling for Research 2012
  41. 41. Pause for reflection….• Skills shortage for data informatics?• Reposition LIS curriculum?• LIS entry requirements?• Get credit for informatics work? Lyon, Informatics Transform, Ariadne 2012
  42. 42. Play for action….1. Define core components of data informatics • Visualisation e.g. VisTrails • Workflow e.g. Taverna • Analysis e.g. RLyon, Informatics Transform, Ariadne 2012
  43. 43. “Very few librarians are likely to have specialist scientific or medical knowledge - if you train as a research scientist or a medic, you probably won’t become a librarian.”RLUK/Mary Auckland: Reskilling for Research 2012
  44. 44. Play for action….1. Analyse LIS entry qualifications & increase STEM entrants Target • Biologists • Chemists • Mathematicians Lyon, Informatics Transform, Ariadne 2012
  45. 45. Let’s get together
  46. 46. Play for action….1. International Data Informatics Working Group to explore promotion, recognition & reward • Global awareness campaign • Career incentives • Benchmark good practice Lyon, Informatics Transform, Ariadne 2012
  47. 47. Position LocationScience Data Librarian StanfordData Management Librarian Oregon StateSocial Sciences Data Librarian BrownData Curation Librarian NortheasternData Librarian New South WalesResearch Data Management SydneyCo-ordinatorResearch Data & Digital CambridgeCuration OfficerData Services Librarian IowaData Analyst ANDSInstitutional Data Scientist Bath
  48. 48. Datajournalist? Data artist?
  49. 49. Implications of “Big Data” and data science for organisations in all sectors Predicts a shortage of 190,000 data scientists by 2019http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation
  50. 50. “Big Data”Data scientistData Science Revealedcommunity surveyhttp://www.emc.com/collateral/about/news/emc-data-science-study-wp.pdf
  51. 51. For a University, research data is akey element of “Big Data”.Managing research data effectivelywill give business advantage.
  52. 52. Data-intensive research • Intelligence • Decision-making • Planning • Investment • Capacity • Capabilityhttp://communitymodel.sharepoint.com/
  53. 53. CommunityCapabilityModelFrameworkCCMF • Research Funders • Institutions • Research leaders/PIs http://communitymodel.sharepoint.com/
  54. 54. “The ability to take data-to be able to understandit, to process it, to extractvalue from it, to visualiseit, to communicate it-that’s going to be a hugelyimportant skill in the nextdecades.” Hal Varian, Chief Economist, Google
  55. 55. Libraries are on a data journey-the Informatics Transform isthe first step in a newdirection…
  56. 56. Thank you!Informatics Transform article (in press)http://ariadne.ac.uk/issue68/lyon use details:Slideshttp://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/presentations.htmlDCC http://www.dcc.ac.uk

×