Momentum of Open Research Data: now in 5-d!

  • 367 views
Uploaded on

Presentation by Heather Piwowar at Simon Fraser University in October 2012 at the SFU Research Data Repository Project Launch. …

Presentation by Heather Piwowar at Simon Fraser University in October 2012 at the SFU Research Data Repository Project Launch.

Highlights current state of research data sharing. http://www.lib.sfu.ca/node/11510

More in: Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
367
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
1
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Momentum of open research data: now in 5-D! Heather  Piwowar  @researchremix   Postdoc  with  NESCent  and  Dryad,  at  Duke  and  UBC SFU  Research  Data  Repository  Project  Launch  October  2012  some photos NC, SA
  • 2. http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm
  • 3. http://www.flickr.com/photos/jsmjr/62443357/
  • 4. http://www.flickr.com/photos/camilleharrington/3587294608/
  • 5. http://www.flickr.com/photos/rkuhnau/3318245976/
  • 6. http://www.flickr.com/photos/conformpdx/1796399674/
  • 7. http://www.flickr.com/photos/rkuhnau/3317418699/
  • 8. http://www.flickr.com/photos/zemlinki/261617721/
  • 9. http://www.flickr.com/photos/tracenmatt/3020786491/
  • 10. http://www.flickr.com/photos/the-o/2078239333/
  • 11. http://www.flickr.com/photos/ryanr/142455033/
  • 12. http://www.flickr.com/photos/75166820@N00/5318468/
  • 13. MOMENTUM
  • 14. 5dimensions
  • 15. - repositories- research- policies- tools- environment
  • 16. - repositories- research- policies- tools- environment
  • 17. Discipline repositoryDatatype repositoryJournal repositoryInstitutional repository...
  • 18. Institutional repository:https://circle.ubc.ca/Discipline repository:http://datadryad.org/Datatype repository:http://www.ncbi.nlm.nih.gov/genbank/(example: http://www.ncbi.nlm.nih.gov/nuccore/192496?report=genbank )Journal supplementary information:http://www.nature.com/nature/journal/v429/n6990/suppinfo/nature02564.htmlLab website:http://www.bx.psu.edu/~ross/dataset/DatasetHome.html"Data paper"http://www.biomedcentral.com/bmcresnotes/Catch-all data repository:http://figshare.com/
  • 19. X X X X X X X X X X X X X X X XX X X X X X
  • 20. What’s best?It depends.We don’t know.
  • 21. It depends.
  • 22. http://www.flickr.com/photos/jo-h/2688026447/
  • 23. - repositories- research- policies- tools- environment
  • 24. Citation boost
  • 25. Gleditsch et al. 2003. Posting Your Data: Will You BeScooped or Will You Be Famous?, International StudiesPerspectives 4(1): 89–97.Piwowar et al. 2007. Sharing Detailed research data isassociated with increased citation Rate. PLoS ONE.Ioannidis et al. Repeatability of published microarray geneexpression analyses. Nature Genetics 41, 149 - 155Pienta et al. 2010. NSR Social Science Secondary Use.Michigan IR.Henneken et al. 2011. Linking to Data – Effect on CitationRates in Astronomy. ESO.Sears 2011. Data Sharing Effect on Article Citation rate inPaleoceanography. AGU.
  • 26. ~70%in multivariateanalysis
  • 27. Amount shared andwithheld
  • 28. Proportion of articles with shared datasets, by year 0.35Proportion of articles with datasets found in GEO or ArrayExpress 0.30 0.25 0.20 0.15 Across  time 0.10 0.05 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 Year article published
  • 29. 19%Piwowar and Chapman. Journal of Informetrics 2010
  • 30. Multivariate nonlinear regression with interactions Odds Ratio 0.25 0.50 1.00 2.00 4.00OA journal & previous GEO-AE sharing Amount of NIH funding 0.95 Journal impact factor and policy Higher Ed in USA Cancer & humans
  • 31. Amount of reuse
  • 32. Type of reuse
  • 33. Cost/benefit
  • 34. 2) more impact per funding dollarTraditional research funding:$400k = 16 papersAt Dryad cost levels,at similar levels of reuse to GEO,$400k would facilitate 1000 reuse papersA stellar Scientific ROI is in easy reach.
  • 35. Piwowar, Vision, Whitlock (2011) Data archiving is a good investment. Nature 473, 285http://researchremix.wordpress.com/2011/05/19/nature-letter/
  • 36. - repositories- research- policies- tools- environment
  • 37. Journalrequirements
  • 38. journal  data  sharing  policy “An inherent principle of publication is that others should be able to replicate and build upon the authors published claims. Therefore, a condition of publication in a Nature journal is that authors are required to make materials, data and associated protocols available in a publicly accessible database …” http://www.nature.com/authors/editorial_policies/availability.html http://www.nature.com/nature/journal/v453/n7197/index.html
  • 39. JDAP<< Journal>> requires, as a condition for publication, thatdata supporting the results in the paper should be archivedin an appropriate public archive, such as << list of approvedarchives here >>. Data are important products of thescientific enterprise, and they should be preserved andusable for decades in the future. Authors may elect to havethe data publicly available at time of publication, or, if thetechnology of the archive allows, may opt to embargoaccess to the data for a period up to a year after publication.Exceptions may be granted at the discretion of the editor,especially for sensitive information such as human subjectdata or the location of endangered species.
  • 40. High-impact journals tend to havea strong data-sharing policy
  • 41. Articles published in journalswith a strong data-sharing policyare more likely to have publicly available datasets
  • 42. NSF datamanagementrequirement
  • 43. NSF biosketch
  • 44. 9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0#(!"#!"#&!"#%!"#$!"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 9:/54351#;#<2//#.5*#=,+5#>2*4?,-3##(!"#!"#&!"#%!"#$!"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# Do not publicize
  • 45. 9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0# 9#:/54351#2*#;2//#<5#=4/851##(!"# <0#>0#?8-15+# (!"#!"# !"#&!"# &!"#%!"# %!"#$!"# $!"# !"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 1234.+55# 4.+55# 9:/54351#;#<2//#.5*#=,+5#>2*4?,-3## 9#:/54351#2*#;2//#<5#=4/851##(!"# <0#>0#:+,>,?,-#,+#*5-8+5#@,>>2A55# (!"#!"# !"#&!"# &!"#%!"# %!"#$!"# $!"# !"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 1234.+55# 4.+55# Do not publicize
  • 46. http://www.nsf.gov/pubs/policydocs/pappguide/nsf08_1/gpg_2.jsp
  • 47. NSF Biosketchstarting January:Publications toProducts
  • 48. - repositories- research- policies- tools- environment
  • 49. DataUp
  • 50. DMP Tool
  • 51. RunMyCode
  • 52. http://www.flickr.com/photos/pixscapes/4331070047
  • 53. In 2009, 116 articles cited ORNL DAAC data.Finding these articles took 70-80 hoursacross at least 12 resourcesall chosen from a deep understandingof this specific research domain then the full text of all the hits were manually reviewed Valerie Enriquez interview with James Kidder http://openwetware.org/wiki/DataONE:Notebook/Reuse_of_repository_data
  • 54. http://www.flickr.com/photos/quinnanya/2055471833
  • 55. altmetrics.org/tools ImpactStory altmetric.com PLoS article-level metrics Reader Meter Science Card
  • 56. impact flavour CC-BY-NC by maniacyak on flickr http://www.flickr.com/photos/maniacyak/3432589472
  • 57. http://dx.doi.org/10.5061/dryad.18
  • 58. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  • 59. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  • 60. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  • 61. ImpactStory.org
  • 62. - repositories- research- policies- tools- environment
  • 63. Open Access
  • 64. Reproducibility
  • 65. Big Data
  • 66. - repositories- research- policies- tools- environment
  • 67. GET EXCITED andMAKE THINGS
  • 68. Open up your datawhile you are doing it :) http://www.flickr.com/photos/myklroventine/892446624/
  • 69. thank you!Todd Vision: PI of DryadJason Priem: cofounder of ImpactStory Also: Mike Whitlock, Jonathan Carlson, Estephanie Sta Maria The open science online community and those who release their articles, datasets and photos openly. blog: ResearchRemix.wordpress.com @researchremix