Momentum of                     open research data:                          now in 5-D!                                 H...
http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm
http://www.flickr.com/photos/jsmjr/62443357/
http://www.flickr.com/photos/camilleharrington/3587294608/
http://www.flickr.com/photos/rkuhnau/3318245976/
http://www.flickr.com/photos/conformpdx/1796399674/
http://www.flickr.com/photos/rkuhnau/3317418699/
http://www.flickr.com/photos/zemlinki/261617721/
http://www.flickr.com/photos/tracenmatt/3020786491/
http://www.flickr.com/photos/the-o/2078239333/
http://www.flickr.com/photos/ryanr/142455033/
http://www.flickr.com/photos/75166820@N00/5318468/
MOMENTUM
5dimensions
- repositories- research- policies- tools- environment
- repositories- research- policies- tools- environment
Discipline repositoryDatatype repositoryJournal repositoryInstitutional repository...
Institutional repository:https://circle.ubc.ca/Discipline repository:http://datadryad.org/Datatype repository:http://www.n...
X    X            X                           X                                                     X                     ...
What’s best?It depends.We don’t know.
It depends.
http://www.flickr.com/photos/jo-h/2688026447/
- repositories- research- policies- tools- environment
Citation boost
Gleditsch et al. 2003. Posting Your Data: Will You BeScooped or Will You Be Famous?, International StudiesPerspectives 4(1...
~70%in multivariateanalysis
Amount shared andwithheld
Proportion of articles with shared datasets, by year                                                                    0....
19%Piwowar and Chapman. Journal of Informetrics 2010
Multivariate nonlinear regression with interactions                                                 Odds Ratio            ...
Amount of reuse
Type of reuse
Cost/benefit
2) more impact per funding dollarTraditional research funding:$400k = 16 papersAt Dryad cost levels,at similar levels of r...
Piwowar, Vision, Whitlock (2011)  Data archiving is a good investment.  Nature 473, 285http://researchremix.wordpress.com/...
- repositories- research- policies- tools- environment
Journalrequirements
journal	  data	  sharing	  policy              “An inherent principle of publication is that               others should b...
JDAP<< Journal>> requires, as a condition for publication, thatdata supporting the results in the paper should be archived...
High-impact journals     tend to havea strong data-sharing        policy
Articles published in journalswith a strong data-sharing policyare more likely to have publicly        available datasets
NSF datamanagementrequirement
NSF biosketch
9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0#(!"#!"#&!"#%!"#$!"# !"#       )*+,-./0#     6#       6#   758*+4/#    6# ...
9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0#                                     9#:/54351#2*#;2//#<5#=4/851##(!"#   ...
http://www.nsf.gov/pubs/policydocs/pappguide/nsf08_1/gpg_2.jsp
NSF Biosketchstarting January:Publications toProducts
- repositories- research- policies- tools- environment
DataUp
DMP Tool
RunMyCode
http://www.flickr.com/photos/pixscapes/4331070047
In 2009, 116 articles cited ORNL DAAC data.Finding these articles took 70-80 hoursacross at least 12 resourcesall chosen f...
http://www.flickr.com/photos/quinnanya/2055471833
altmetrics.org/tools ImpactStory altmetric.com PLoS article-level metrics Reader Meter Science Card
impact flavour                 CC-BY-NC by maniacyak on flickr                 http://www.flickr.com/photos/maniacyak/3432...
http://dx.doi.org/10.5061/dryad.18
http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
ImpactStory.org
- repositories- research- policies- tools- environment
Open Access
Reproducibility
Big Data
- repositories- research- policies- tools- environment
GET EXCITED    andMAKE THINGS
Open up your datawhile you are doing it :)     http://www.flickr.com/photos/myklroventine/892446624/
thank you!Todd Vision: PI of DryadJason Priem: cofounder of ImpactStory Also: Mike Whitlock, Jonathan Carlson, Estephanie ...
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Upcoming SlideShare
Loading in...5
×

Momentum of Open Research Data: now in 5-d!

529

Published on

Presentation by Heather Piwowar at Simon Fraser University in October 2012 at the SFU Research Data Repository Project Launch.

Highlights current state of research data sharing. http://www.lib.sfu.ca/node/11510

Published in: Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
529
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Momentum of Open Research Data: now in 5-d!

  1. 1. Momentum of open research data: now in 5-D! Heather  Piwowar  @researchremix   Postdoc  with  NESCent  and  Dryad,  at  Duke  and  UBC SFU  Research  Data  Repository  Project  Launch  October  2012  some photos NC, SA
  2. 2. http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm
  3. 3. http://www.flickr.com/photos/jsmjr/62443357/
  4. 4. http://www.flickr.com/photos/camilleharrington/3587294608/
  5. 5. http://www.flickr.com/photos/rkuhnau/3318245976/
  6. 6. http://www.flickr.com/photos/conformpdx/1796399674/
  7. 7. http://www.flickr.com/photos/rkuhnau/3317418699/
  8. 8. http://www.flickr.com/photos/zemlinki/261617721/
  9. 9. http://www.flickr.com/photos/tracenmatt/3020786491/
  10. 10. http://www.flickr.com/photos/the-o/2078239333/
  11. 11. http://www.flickr.com/photos/ryanr/142455033/
  12. 12. http://www.flickr.com/photos/75166820@N00/5318468/
  13. 13. MOMENTUM
  14. 14. 5dimensions
  15. 15. - repositories- research- policies- tools- environment
  16. 16. - repositories- research- policies- tools- environment
  17. 17. Discipline repositoryDatatype repositoryJournal repositoryInstitutional repository...
  18. 18. Institutional repository:https://circle.ubc.ca/Discipline repository:http://datadryad.org/Datatype repository:http://www.ncbi.nlm.nih.gov/genbank/(example: http://www.ncbi.nlm.nih.gov/nuccore/192496?report=genbank )Journal supplementary information:http://www.nature.com/nature/journal/v429/n6990/suppinfo/nature02564.htmlLab website:http://www.bx.psu.edu/~ross/dataset/DatasetHome.html"Data paper"http://www.biomedcentral.com/bmcresnotes/Catch-all data repository:http://figshare.com/
  19. 19. X X X X X X X X X X X X X X X XX X X X X X
  20. 20. What’s best?It depends.We don’t know.
  21. 21. It depends.
  22. 22. http://www.flickr.com/photos/jo-h/2688026447/
  23. 23. - repositories- research- policies- tools- environment
  24. 24. Citation boost
  25. 25. Gleditsch et al. 2003. Posting Your Data: Will You BeScooped or Will You Be Famous?, International StudiesPerspectives 4(1): 89–97.Piwowar et al. 2007. Sharing Detailed research data isassociated with increased citation Rate. PLoS ONE.Ioannidis et al. Repeatability of published microarray geneexpression analyses. Nature Genetics 41, 149 - 155Pienta et al. 2010. NSR Social Science Secondary Use.Michigan IR.Henneken et al. 2011. Linking to Data – Effect on CitationRates in Astronomy. ESO.Sears 2011. Data Sharing Effect on Article Citation rate inPaleoceanography. AGU.
  26. 26. ~70%in multivariateanalysis
  27. 27. Amount shared andwithheld
  28. 28. Proportion of articles with shared datasets, by year 0.35Proportion of articles with datasets found in GEO or ArrayExpress 0.30 0.25 0.20 0.15 Across  time 0.10 0.05 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 Year article published
  29. 29. 19%Piwowar and Chapman. Journal of Informetrics 2010
  30. 30. Multivariate nonlinear regression with interactions Odds Ratio 0.25 0.50 1.00 2.00 4.00OA journal & previous GEO-AE sharing Amount of NIH funding 0.95 Journal impact factor and policy Higher Ed in USA Cancer & humans
  31. 31. Amount of reuse
  32. 32. Type of reuse
  33. 33. Cost/benefit
  34. 34. 2) more impact per funding dollarTraditional research funding:$400k = 16 papersAt Dryad cost levels,at similar levels of reuse to GEO,$400k would facilitate 1000 reuse papersA stellar Scientific ROI is in easy reach.
  35. 35. Piwowar, Vision, Whitlock (2011) Data archiving is a good investment. Nature 473, 285http://researchremix.wordpress.com/2011/05/19/nature-letter/
  36. 36. - repositories- research- policies- tools- environment
  37. 37. Journalrequirements
  38. 38. journal  data  sharing  policy “An inherent principle of publication is that others should be able to replicate and build upon the authors published claims. Therefore, a condition of publication in a Nature journal is that authors are required to make materials, data and associated protocols available in a publicly accessible database …” http://www.nature.com/authors/editorial_policies/availability.html http://www.nature.com/nature/journal/v453/n7197/index.html
  39. 39. JDAP<< Journal>> requires, as a condition for publication, thatdata supporting the results in the paper should be archivedin an appropriate public archive, such as << list of approvedarchives here >>. Data are important products of thescientific enterprise, and they should be preserved andusable for decades in the future. Authors may elect to havethe data publicly available at time of publication, or, if thetechnology of the archive allows, may opt to embargoaccess to the data for a period up to a year after publication.Exceptions may be granted at the discretion of the editor,especially for sensitive information such as human subjectdata or the location of endangered species.
  40. 40. High-impact journals tend to havea strong data-sharing policy
  41. 41. Articles published in journalswith a strong data-sharing policyare more likely to have publicly available datasets
  42. 42. NSF datamanagementrequirement
  43. 43. NSF biosketch
  44. 44. 9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0#(!"#!"#&!"#%!"#$!"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 9:/54351#;#<2//#.5*#=,+5#>2*4?,-3##(!"#!"#&!"#%!"#$!"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# Do not publicize
  45. 45. 9:/54351#,*;5+3#<4-#=82/1#,-#>0#?,+@#>,+5#5432/0# 9#:/54351#2*#;2//#<5#=4/851##(!"# <0#>0#?8-15+# (!"#!"# !"#&!"# &!"#%!"# %!"#$!"# $!"# !"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 1234.+55# 4.+55# 9:/54351#;#<2//#.5*#=,+5#>2*4?,-3## 9#:/54351#2*#;2//#<5#=4/851##(!"# <0#>0#:+,>,?,-#,+#*5-8+5#@,>>2A55# (!"#!"# !"#&!"# &!"#%!"# %!"#$!"# $!"# !"# !"# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# )*+,-./0# 6# 6# 758*+4/# 6# 6# )*+,-./0# 1234.+55# 4.+55# 1234.+55# 4.+55# Do not publicize
  46. 46. http://www.nsf.gov/pubs/policydocs/pappguide/nsf08_1/gpg_2.jsp
  47. 47. NSF Biosketchstarting January:Publications toProducts
  48. 48. - repositories- research- policies- tools- environment
  49. 49. DataUp
  50. 50. DMP Tool
  51. 51. RunMyCode
  52. 52. http://www.flickr.com/photos/pixscapes/4331070047
  53. 53. In 2009, 116 articles cited ORNL DAAC data.Finding these articles took 70-80 hoursacross at least 12 resourcesall chosen from a deep understandingof this specific research domain then the full text of all the hits were manually reviewed Valerie Enriquez interview with James Kidder http://openwetware.org/wiki/DataONE:Notebook/Reuse_of_repository_data
  54. 54. http://www.flickr.com/photos/quinnanya/2055471833
  55. 55. altmetrics.org/tools ImpactStory altmetric.com PLoS article-level metrics Reader Meter Science Card
  56. 56. impact flavour CC-BY-NC by maniacyak on flickr http://www.flickr.com/photos/maniacyak/3432589472
  57. 57. http://dx.doi.org/10.5061/dryad.18
  58. 58. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  59. 59. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  60. 60. http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
  61. 61. ImpactStory.org
  62. 62. - repositories- research- policies- tools- environment
  63. 63. Open Access
  64. 64. Reproducibility
  65. 65. Big Data
  66. 66. - repositories- research- policies- tools- environment
  67. 67. GET EXCITED andMAKE THINGS
  68. 68. Open up your datawhile you are doing it :) http://www.flickr.com/photos/myklroventine/892446624/
  69. 69. thank you!Todd Vision: PI of DryadJason Priem: cofounder of ImpactStory Also: Mike Whitlock, Jonathan Carlson, Estephanie Sta Maria The open science online community and those who release their articles, datasets and photos openly. blog: ResearchRemix.wordpress.com @researchremix
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×