Your SlideShare is downloading. ×
Momentum of Open Research Data: now in 5-d!
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Saving this for later?

Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime - even offline.

Text the download link to your phone

Standard text messaging rates apply

Momentum of Open Research Data: now in 5-d!

448
views

Published on

Presentation by Heather Piwowar at Simon Fraser University in October 2012 at the SFU Research Data Repository Project Launch. …

Presentation by Heather Piwowar at Simon Fraser University in October 2012 at the SFU Research Data Repository Project Launch.

Highlights current state of research data sharing. http://www.lib.sfu.ca/node/11510

Published in: Education

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
448
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Momentum of open research data: now in 5-D! Heather  Piwowar  @researchremix   Postdoc  with  NESCent  and  Dryad,  at  Duke  and  UBC SFU  Research  Data  Repository  Project  Launch  October  2012  some photos NC, SA
  • 2. http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm
  • 3. http://www.flickr.com/photos/jsmjr/62443357/
  • 4. http://www.flickr.com/photos/camilleharrington/3587294608/
  • 5. http://www.flickr.com/photos/rkuhnau/3318245976/
  • 6. http://www.flickr.com/photos/conformpdx/1796399674/
  • 7. http://www.flickr.com/photos/rkuhnau/3317418699/
  • 8. http://www.flickr.com/photos/zemlinki/261617721/
  • 9. http://www.flickr.com/photos/tracenmatt/3020786491/
  • 10. http://www.flickr.com/photos/the-o/2078239333/
  • 11. http://www.flickr.com/photos/ryanr/142455033/
  • 12. http://www.flickr.com/photos/75166820@N00/5318468/
  • 13. MOMENTUM
  • 14. 5dimensions
  • 15. - repositories- research- policies- tools- environment
  • 16. - repositories- research- policies- tools- environment
  • 17. Discipline repositoryDatatype repositoryJournal repositoryInstitutional repository...
  • 18. Institutional repository:https://circle.ubc.ca/Discipline repository:http://datadryad.org/Datatype repository:http://www.ncbi.nlm.nih.gov/genbank/(example: http://www.ncbi.nlm.nih.gov/nuccore/192496?report=genbank )Journal supplementary information:http://www.nature.com/nature/journal/v429/n6990/suppinfo/nature02564.htmlLab website:http://www.bx.psu.edu/~ross/dataset/DatasetHome.html"Data paper"http://www.biomedcentral.com/bmcresnotes/Catch-all data repository:http://figshare.com/
  • 19. X X X X X X X X X X X X X X X XX X X X X X
  • 20. What’s best?It depends.We don’t know.
  • 21. It depends.
  • 22. http://www.flickr.com/photos/jo-h/2688026447/
  • 23. - repositories- research- policies- tools- environment
  • 24. Citation boost
  • 25. Gleditsch et al. 2003. Posting Your Data: Will You BeScooped or Will You Be Famous?, International StudiesPerspectives 4(1): 89–97.Piwowar et al. 2007. Sharing Detailed research data isassociated with increased citation Rate. PLoS ONE.Ioannidis et al. Repeatability of published microarray geneexpression analyses. Nature Genetics 41, 149 - 155Pienta et al. 2010. NSR Social Science Secondary Use.Michigan IR.Henneken et al. 2011. Linking to Data – Effect on CitationRates in Astronomy. ESO.Sears 2011. Data Sharing Effect on Article Citation rate inPaleoceanography. AGU.
  • 26. ~70%in multivariateanalysis
  • 27. Amount shared andwithheld
  • 28. Proportion of articles with shared datasets, by year 0.35Proportion of articles with datasets found in GEO or ArrayExpress 0.30 0.25 0.20 0.15 Across  time 0.10 0.05 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 Year article published
  • 29. 19%Piwowar and Chapman. Journal of Informetrics 2010
  • 30. Multivariate nonlinear regression with interactions Odds Ratio 0.25 0.50 1.00 2.00 4.00OA journal & previous GEO-AE sharing Amount of NIH funding 0.95 Journal impact factor and policy Higher Ed in USA Cancer & humans
  • 31. Amount of reuse
  • 32. Type of reuse
  • 33. Cost/benefit
  • 34. 2) more impact per funding dollarTraditional research funding:$400k = 16 papersAt Dryad cost levels,at similar levels of reuse to GEO,$400k would facilitate 1000 reuse papersA stellar Scientific ROI is in easy reach.
  • 35. Piwowar, Vision, Whitlock (2011) Data archiving is a good investment. Nature 473, 285http://researchremix.wordpress.com/2011/05/19/nature-letter/
  • 36. - repositories- research- policies- tools- environment
  • 37. Journalrequirements
  • 38. journal  data  sharing  policy “An inherent principle of publication is that others should be able to replicate and build upon the authors published claims. Therefore, a condition of publication in a Nature journal is that authors are required to make materials, data and associated protocols available in a publicly accessible database …” http://www.nature.com/authors/editorial_policies/availability.html http://www.nature.com/nature/journal/v453/n7197/index.html
  • 39. JDAP<< Journal>> requires, as a condition for publication, thatdata supporting the results in the paper should be archivedin an appropriate public archive, such as << list of approvedarchives here >>. Data are important products of thescientific enterprise, and they should be preserved andusable for decades in the future. Authors may elect to havethe data publicly available at time of publication, or, if thetechnology of the archive allows, may opt to embargoaccess to the data for a period up to a year after publication.Exceptions may be granted at the discretion of the editor,especially for sensitive information such as human subjectdata or the location of endangered species.
  • 40. High-impact journals tend to havea strong data-sharing policy
  • 41. Articles published in journalswith a strong data-sharing policyare more likely to have publicly available datasets
  • 42. NSF datamanagementrequirement
  • 43. NSF biosketch
  • 44. 9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0#(!"#!"#&!"#%!"#$!"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 9:/54351#;#<2//#.5*#=,+5#>2*4?,-3##(!"#!"#&!"#%!"#$!"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# Do not publicize
  • 45. 9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0# 9#:/54351#2*#;2//#<5#=4/851##(!"# <0#>0#?8-15+# (!"#!"# !"#&!"# &!"#%!"# %!"#$!"# $!"# !"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 1234.+55# 4.+55# 9:/54351#;#<2//#.5*#=,+5#>2*4?,-3## 9#:/54351#2*#;2//#<5#=4/851##(!"# <0#>0#:+,>,?,-#,+#*5-8+5#@,>>2A55# (!"#!"# !"#&!"# &!"#%!"# %!"#$!"# $!"# !"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 1234.+55# 4.+55# Do not publicize
  • 46. http://www.nsf.gov/pubs/policydocs/pappguide/nsf08_1/gpg_2.jsp
  • 47. NSF Biosketchstarting January:Publications toProducts
  • 48. - repositories- research- policies- tools- environment
  • 49. DataUp
  • 50. DMP Tool
  • 51. RunMyCode
  • 52. http://www.flickr.com/photos/pixscapes/4331070047
  • 53. In 2009, 116 articles cited ORNL DAAC data.Finding these articles took 70-80 hoursacross at least 12 resourcesall chosen from a deep understandingof this specific research domain then the full text of all the hits were manually reviewed Valerie Enriquez interview with James Kidder http://openwetware.org/wiki/DataONE:Notebook/Reuse_of_repository_data
  • 54. http://www.flickr.com/photos/quinnanya/2055471833
  • 55. altmetrics.org/tools ImpactStory altmetric.com PLoS article-level metrics Reader Meter Science Card
  • 56. impact flavour CC-BY-NC by maniacyak on flickr http://www.flickr.com/photos/maniacyak/3432589472
  • 57. http://dx.doi.org/10.5061/dryad.18
  • 58. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  • 59. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  • 60. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  • 61. ImpactStory.org
  • 62. - repositories- research- policies- tools- environment
  • 63. Open Access
  • 64. Reproducibility
  • 65. Big Data
  • 66. - repositories- research- policies- tools- environment
  • 67. GET EXCITED andMAKE THINGS
  • 68. Open up your datawhile you are doing it :) http://www.flickr.com/photos/myklroventine/892446624/
  • 69. thank you!Todd Vision: PI of DryadJason Priem: cofounder of ImpactStory Also: Mike Whitlock, Jonathan Carlson, Estephanie Sta Maria The open science online community and those who release their articles, datasets and photos openly. blog: ResearchRemix.wordpress.com @researchremix