From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Citable Data


Published on

Slides from GigaScience press-conference at BGI's Bio-IT APAC meeting on the GigaScience website launch and release of first unpublished animal genomes released from database. Genomes include polar bear, penguin, pigeon and macaque. 6th July 2011

Published in: Technology, Health & Medicine
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Citable Data

  1. 1. From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Citable Data<br />doi:10.5524/100001 <br />doi:10.5524/100008 <br />Scott Edmunds, PhD<br />Alexandra Basford, PhD<br />
  2. 2. The “big-data” era<br />Source: E Lander/Broad<br />Flickr cc: opensourceway<br />
  3. 3. Traditional Form of Data-Release<br />Data Production<br />Data Analysis<br />Peer Review<br />Publication<br />Publication<br />Months/Years<br />
  4. 4. Incentives/credit<br />Credit where credit is overdue:<br />“One option would be to provide researchers who release data to public repositories with a means of accreditation.”<br />“An ability to search the literature for all online papers that used a particular data set would enable appropriate attribution for those who share. “<br />Nature Biotechnology 27, 579 (2009) <br />Prepublication data sharing <br />(Toronto International Data Release Workshop)<br />“Data producers benefit from creating a citable reference, as it can later be used to reflect impact of the data sets.”<br />Nature461, 168-170 (2009) <br />
  5. 5. Launching today…<br />A new open-access open-data journal and database<br />Innovative article publishing and data hosting<br />… “big and sharable”<br /><br />Published by BGI in <br />partnership with BioMedCentral<br />
  6. 6. Why DOI®s?<br />Clear method for data tracking and data citation, allowing: <br />Increased the searchability (and use) of data <br />Credit for data production, making it clear who produced the data and when<br />The ability to track and receive feedback on data usage<br />Credit to original authors for their data’s use <br />A data citation metric potentially rivaling and complementary to the impact factor<br />The potential to publish papers relating to a dataset, while making the data available and receiving credit for it earlier<br />
  7. 7. Our first DOI®:<br />To maximize its utility to the research community and aid those  fighting the current epidemic, genomic data is released here into the public domain under a CC0 license. Until the publication of research papers on the assembly and whole-genome analysis of this isolate we would ask you to cite this dataset as:<br />Li, D; Xi, F; Zhao, M; Liang, Y; Chen, W; Cao, S; Xu, R; Wang, G; Wang, J; Zhang, Z; Li, Y; Cui, Y; Chang, C; Cui, C; Luo, Y; Qin, J; Li, S; Li, J; Peng, Y; Pu, F; Sun, Y; Chen,Y; Zong, Y; Ma, X; Yang, X; Cen, Z; Zhao, X; Chen, F; Yin, X; Song,Y ; Rohde, H; Li, Y; Wang, J; Wang, J and the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011) Genomic data from Escherichia coli O104:H4 isolate TY-2482. BGI Shenzhen. doi:10.5524/100001<br />To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to Genomic Data from the 2011 E. coli outbreak. This work is published from: China. <br />
  8. 8.
  9. 9. G10K Genomes Get DOI®s<br />doi:10.5524/100004 <br />
  10. 10. About The Journal<br /><ul><li> Open access, online journaloptimized for the publication of all types of biological studies that use or create large-scale data sets.
  11. 11. New journal format that combines standard manuscript publication with an extensive database developed to host all associated data.
  12. 12. Evolving data repository beginning with 'omic’-type data and growing to support other biological large-scale data, such as imaging data, cohort data, metadata, and others.
  13. 13. Editorial interaction with the different biological communities to determine the best means of hosting and accessing their type of data.
  14. 14. Integrated tools to promote more widespread access, viewing, and analysis of the stored data.
  15. 15. BGI Cloud Computing resources for handling and analyzing large-scale data.
  16. 16. All Data given a DOI to allow ease of finding and citing datasets, as well as for citation tracking.</li></li></ul><li>Launching today…<br />Questions?<br />Editor-in-Chief: Laurie Goodman, PhD<br /> Editor: Scott Edmunds, PhD<br /> Assistant Editor: Alexandra Basford, PhD<br />Contact:<br />Follow GigaScience on Twitter @GigaScience<br /><br />