SLA webinar: Open research data needs librarians

  1. 1. Open research data: fun, important, andin need of librarians! Heather  Piwowar DataONE  postdoc  with  NESCent  and  Dryad @researchremix   sci-­‐tech  SLA  webinar: October  2011
  13. 13. FindOrganizeDocumentDeidentifyFormatAskSubmitAnswer questionsWorry about mistakes being foundWorry about data being misinterpretedWorry about being scoopedForgo money and IP and prestige???
  14. 14. not very motivating.
  15. 15. Is archiving dataimportant?Are we sure?
  16. 16. Errors.More  than  half  of  all  papers  contain  errors5-­‐10%  contain  errors  that  change  the  conclusions Gore et al 1977, Kantoer and Taylor 1994, McGuigan 1995, Hurlbert and White 1993
  17. 17. Ok, let’s share onrequest.
  18. 18. Doesn’t workself-reported denying a request in last 3 years trainees self-reported denying a request been denied access to data, materials, code authors “not able to retrieve raw data” not willing to release data 0% 10% 20% 30% 40% Campbell et al. JAMA. 2002. Kyzas et al. J Natl Cancer Inst. 2005. Vogeli et al. Acad Med. 2006. Reidpath et al. Bioethics 2001.
  19. 19. Don’t get the email Evangelou  et  al.    FASEB  J.    2006. Wren.    Bioinformatics  2008. Wren  et  al.    EMBO  Rep  2006.
  20. 20. Say nowant to publish more papers first want exclusive use ensure data confidentiality control avoid cost of preparation 0% 10% 20% 30% 40% 50% Hedstrom. Society of Am Archivists Ann Meeting. 2008.
  21. 21. Ask why`Before  I  send  you  the  data  could  I  ask  what  you  want  it  for?`Can  you  be  more  explicit,  please,  about  the  analyses  you  have  in   mind  and  what  you  plan  to  do  with  them?`Well  have  to  discuss  your  request  with  the  other  coauthors.     Before  we  do  that,  Id  like  to  know  your  proposed  analysis  plan.  `We  are  not  finished  using  the  data,  but  when  we  are  finished  with   it,  we  would  be  open  to  requests  for  the  data.`Any  use  of  the  data  other  than  for  the  specific  purpose  laid  down   in  the  contract  of  collaboration  is  effectively  ruled  out. Reidpath et al. Bioethics 2001.
  22. 22. Not efficient.
  23. 23. Not efficient.Not fair. Not  random: -­‐  young -­‐  productive Campbell et all 2000
  24. 24. Has real costs.Survey of doctoral students and postdocs:28-50% reported withholding negative effects: • hurt progress of their research, • hurt rate of discovery in their lab/research group, • hurt quality of their relationships with academic scientists, • hurt quality of their education, • hurt level of communication in their lab/research group. Vogeli et al. Acad Med. 2006 Feb; 81(2):128-36
  25. 25. Need help.
  26. 26. Librarians are good athelping.
  27. 27. This issue needs you.
  28. 28. host
  29. 29. By request doesn’t work
  30. 30. On a lab website?No. Urls stop working. Evangelou  et  al.    FASEB  J.    2006. Wren.    Bioinformatics  2008. Wren  et  al.    EMBO  Rep  2006.
  31. 31. In a data repository?Good idea!
  32. 32. X X X X X X X X X X X X X X X XX X X X X X
  33. 33. Discipline repositoryDatatype repositoryJournal repositoryInstitutional repository...
  34. 34. What’s best?It depends.We don’t know.
  35. 35.
  36. 36. Institutionalrepositories shouldjoin in.
  37. 37. ... but only ifcommittedto trying to do it right
  38. 38. use your experience!
  39. 39. educate
  41. 41. educate repositories(doing repositorieswell is hard!)
  43. 43. educate researchers
  44. 44. You know your data.We know long termpreservation.Let’s talk.
  45. 45. educate researchers- about benefits
  46. 46. Carrot?
  47. 47. currency  of  value? Citations. $50! Diamond,Arthur M. What is a Citation Worth?. The Journal of Human Resources (1986) vol. 21 (2) pp. 200-215
  48. 48. ~70%in multivariateanalysis
  49. 49. “Does anyone want your data?That’s hard to predict […]After all, no one ever knocked on yourdoor asking to buy those figurinescollecting dust in your cabinet before youlisted them on eBay.Your data, too, may simply be awaiting aneffective matchmaker.” Got data? Nature Neuroscience (2007)
  50. 50.
  51. 51. educate researchers- about best practices
  52. 52. selectionformatlocationmetadata...
  53. 53. Never know whatkeywords to suggestfor your paper?We know keywords.Let’s talk.
  54. 54. educate researchers- about scientificcontribution
  55. 55. Piwowar, Vision, Whitlock (2011) Data archiving is a good investment. Nature 473, 285
  56. 56. 10 * 100 = 1000
  58. 58. (!!" cumulative  #  of  uses!!" ,-./010" .023040"&!!" 5-6" ./7"%!!" 7897" :;<=20>0=?@AB4C"$!!" 5-696D" E=4470C4"#!!" F==0G"-HI=4CC" !" $!!" $!!(" $!!)" $!!*" $!!+" $!#!"
  59. 59. educate researchers- about gettingrecognition
  60. 60. include it on CV!in lettersin biosketches
  61. 61. a tool that is emerging from ahackathon-gone-right...
  62. 62.
  63. 63.
  64. 64.
  65. 65.
  67. 67. educate admin- contribution to science- institutional benefit- what others are doing
  68. 68. advocate
  69. 69. open up our own stuff+advocate for open- repository stats- reference lists!
  70. 70. policies:- data availability- data citation
  71. 71. reward:- letters of rec- biosketches- tenure packages
  72. 72. more evidence!
  73. 73. hosteducateadvocate
  74. 74.
  75. 75.
  76. 76.
  77. 77. Todd Vision, thank you Jonathan Carlson, Estephanie Sta Maria, Nicholas Weber, Sarah Judson,Valerie Enriquez Jason Priem and Beyond Impact Dryad and DataONE teamsThe open science online community and those who release their articles, datasets and photos openly blog: @researchremix My :