Data citations: who cares?

1,185 views

Published on

Who cares how research data is attributed and cited? Lots of people. Presented by Heather Piwowar to DataONE summer internship 2010 group on data citatio

Published in: Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,185
On SlideShare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
12
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Data citations: who cares?

  1. 1. Data citation... Who cares? Heather Piwowar DataONE postdoc with Dryad and NESCent DataONE summer internship meeting  July 7, 2010
  2. 2. http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm
  3. 3. http://www.flickr.com/photos/jsmjr/62443357/
  4. 4. http://www.flickr.com/photos/camilleharrington/3587294608/
  5. 5. http://www.flickr.com/photos/rkuhnau/3318245976/
  6. 6. http://www.flickr.com/photos/conformpdx/1796399674/
  7. 7. http://www.flickr.com/photos/rkuhnau/3317418699/
  8. 8. http://www.flickr.com/photos/zemlinki/261617721/
  9. 9. http://www.flickr.com/photos/tracenmatt/3020786491/
  10. 10. http://www.flickr.com/photos/the-o/2078239333/
  11. 11. Probably.
  12. 12. In theory.
  13. 13. ?
  14. 14. • Genbank • PDB
  15. 15. http://www.oxfordjournals.org/ nar/database/cap/
  16. 16. http://www.flickr.com/photos/archeon/2941655917/
  17. 17. Data citation...
  18. 18. paper dataset paper dataset dataset paper dataset paper dataset paper dataset paper
  19. 19. • Alas, no unique standard identifier • URL • accession number • DOI • citation to paper • citation to database • reference to supplementary material • search strategy
  20. 20. Example: full-text phrases containing “... accessed”
  21. 21. “submitted”
  22. 22. “downloaded”
  23. 23. • Citations are indexed and machine-extractable
  24. 24. paper dataset paper dataset dataset paper dataset paper dataset paper dataset paper
  25. 25. • understand current practice • articulate the best best-practices
  26. 26. paper dataset paper dataset dataset paper dataset paper dataset paper dataset paper
  27. 27. Who cares?
  28. 28. 1.  Data creators • personal reward • motivation: • “if it really helped” • even esoteric datasets are useful • how prevalent is scooping? • alert to possible misuses • grounded requirements
  29. 29. 2.  Data reusers • clear guidelines are helpful • what has been reused, for what? • what hasnʼt?
  30. 30. 3.  Repository creators, maintainers • funding • how much metadata • how to format • what additional tools are useful • lifecycle of data
  31. 31. 4.  Funders • most, best science for their money • cost/benefit of mandate • inform funding decisions: • what has been extra useful? • what hasnʼt? • what support is needed
  32. 32. 5.  Journals • increasingly called upon to mandate or fund: • how to decide • how to rationalize • another avenue to compete
  33. 33. 6.  Information scientists • extension of citation analysis for studying information behaviour
  34. 34. 6.  Me
  35. 35. Articles published in journals with a strong data-sharing policy are more likely to have publicly available datasets
  36. 36. Reuse estimate • 2703 submissions in 2007 • GSE* in PubMed Central • Exclude author overlap • Exclude data creation • automatically, manually • 139 • 520
  37. 37. 7.  You
  38. 38. 8.  Your mom
  39. 39. 9.  These mice http://www.flickr.com/photos/ryanr/142455033/
  40. 40. 10.  Scientific progress • trace errors, fraud • increase transparency • more efficient and effective
  41. 41. you can not manage  what you do not measure quote: Lord Kelvin http://www.flickr.com/photos/archeon/2941655917/
  42. 42. science about our science
  43. 43. http://www.flickr.com/photos/druclimb/293046352/
  44. 44. Thanks to: NSF, DataONE, NESCent, Dryad UBC Dept of Zoology NLM, U of Pittsburgh Dept of Biomedical Informatics Open science online community and those who release their articles, datasets and photos openly questions?

×