Your SlideShare is downloading. ×
0
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaScience)

1,179

Published on

Scott Edmunds announcing BGI's new GigaScience journal at the 1st Earth Microbiome Project meeting in Shenzhen

Scott Edmunds announcing BGI's new GigaScience journal at the 1st Earth Microbiome Project meeting in Shenzhen

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,179
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
17
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Scott Edmunds<br />Data Dissemination: <br />Difficulties, Data Citation, DOIs and,<br />(“Mo Data Mo Problems”)<br />
  • 2. The Ecoresponsive Genome of Daphnia pulexColbourne et al., Science4 February 2011: <br />200Mb Genome, 30,907 genes<br />Duplicated genes most responsive to ecological challenges<br />
  • 3. Daphnia Genome Consortium<br />wFleabase: Mar 2006<br />Genome release: July 2007<br />Genome Published: Feb 2011<br />>58 companion papers<br />https://daphnia.cgb.indiana.edu/Publications<br />
  • 4. Difficulties<br />Flickr cc: opensourceway<br />
  • 5. Sequencing cost($ per Mbp)<br />Moore’s Law<br />~100,000X<br />Sequencing<br />Source: E Lander/Broad<br />
  • 6. Sequencing Output<br />Data<br />Storage<br />Moore’s/Kryders Law<br />
  • 7. Sequencing Output<br />Data<br />Publication<br />Dissemination?<br />
  • 8. Potential sequencing capacity<br />1 IlluminaHiSeq 2000 (+Truseq upgrade) <br />= 600Gb/run (12 days)<br />X 128 Hiseq= 6Tb/day = >2Pb/year<br />= ~ 2000 Human Genomes/day<br />
  • 9. SRA Closure<br />
  • 10. Incentives/credit<br />Credit where credit is overdue:<br />“One option would be to provide researchers who release data to public repositories with a means of accreditation.”<br />“An ability to search the literature for all online papers that used a particular data set would enable appropriate attribution for those who share. “<br />Nature Biotechnology 27, 579 (2009) <br />Prepublication data sharing <br />(Toronto International Data Release Workshop)<br />“Data producers benefit from creating a citable reference, as it can later be used to reflect impact of the data sets.”<br />Nature461, 168-170 (2009) <br />
  • 11. Datacitation: Datacite and DOIs<br />Digital Object Identifiers (DOIs) offer a solution<br /><ul><li>Mostly widely used identifier for scientific articles
  • 12. Researchers, authors, publishers know how to use them
  • 13. Put datasets on the same playing field as articles</li></ul><br />Dataset<br />Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA.<br />doi:10.1594/PANGAEA.587840<br />
  • 14. Datacitation: Datacite and DOIs<br />>1 million DOIs since Dec 2009<br />Central metadata repository to link with WoS/ISI<br />- finally can track and credit use!<br />
  • 15. Coming soon…<br />Large-Scale Data <br />Journal/Database<br />In conjunction with:<br />Editor-in-Chief: Laurie Goodman, PhD<br /> Editor: Scott Edmunds, PhD<br /> Assistant Editor: Alexandra Basford, PhD<br />www.gigasciencejournal.com<br />
  • 16. Criteria and Focus of Journal/Database<br /><ul><li>Reproducibility/Reuse
  • 17. Utility/Usability
  • 18. Standards/Searchability/Scale/Sharing
  • 19. Data publishing/DOI</li></ul>www.gigasciencejournal.com<br />
  • 20. Use of Data = Importance + Usability<br />easier to assess<br />subjective? <br />www.gigasciencejournal.com<br />
  • 21. Reproducibility/Reuse<br /><ul><li> BGI Cloud Computing resources for handling and analyzing large-scale data.
  • 22. Integrated tools to promote more widespread access, viewing, and analysis of data.
  • 23. Encourage and aid use of workflow systems for methods (e.g. submission of Galaxy XML files).</li></ul>www.gigasciencejournal.com<br />
  • 24. Standards/Searchability/Sharing<br /><ul><li>ISA-Tab compatibility to aid and promote best practice in metadata reporting.
  • 25. Allsupporting data must be publically available.
  • 26. Ask for MIBBI compliance and use of reporting checklists.
  • 27. Part of the Biosharing network.</li></ul>www.gigasciencejournal.com<br />
  • 28. Data publishing/DOI<br /><ul><li>New journal format combines standard manuscript publication with an extensive database to host all associated data.
  • 29. Data hosting will follow standard funding agency and community guidelines.
  • 30. DOI assignment available for submitted data to allow ease of findingand citing datasets, as well as for citation tracking.</li></ul>www.gigasciencejournal.com<br />
  • 31. Our first DOI:<br />To maximize its utility to the research community and aid those  fighting the current epidemic, genomic data is released here into the public domain under a CC0 license. Until the publication of research papers on the assembly and whole-genome analysis of this isolate we would ask you to cite this dataset as:<br />Li, D; Xi, F; Zhao, M; Liang, Y; Chen, W; Cao, S; Xu, R; Wang, G; Wang, J; Zhang, Z; Li, Y; Cui, Y; Chang, C; Cui, C; Luo, Y; Qin, J; Li, S; Li, J; Peng, Y; Pu, F; Sun, Y; Chen,Y; Zong, Y; Ma, X; Yang, X; Cen, Z; Zhao, X; Chen, F; Yin, X; Song,Y ; Rohde, H; Li, Y; Wang, J; Wang, J and the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011) Genomic data from Escherichia coli O104:H4 isolate TY-2482. BGI Shenzhen. doi:10.5524/100001 http://dx.doi.org/10.5524/100001<br />To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to Genomic Data from the 2011 E. coli outbreak. This work is published from: China. <br />
  • 32. E. Coli #crowdsourcing: the first tweenome?<br />
  • 33. Questions?<br />scott.edmunds@genomics.org.cn<br />editorial@gigasciencejournal.com<br />@gigascience<br />www.gigasciencejournal.com<br />

×