Slides from GigaScience press-conference at BGI's Bio-IT APAC meeting on the GigaScience website launch and release of first unpublished animal genomes released from database. Genomes include polar bear, penguin, pigeon and macaque. 6th July 2011
CNIC Information System with Pakdata Cf In Pakistan
From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Citable Data
1. From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Citable Data doi:10.5524/100001 doi:10.5524/100008 Scott Edmunds, PhD Alexandra Basford, PhD
3. Traditional Form of Data-Release Data Production Data Analysis Peer Review Publication Publication Months/Years
4. Incentives/credit Credit where credit is overdue: “One option would be to provide researchers who release data to public repositories with a means of accreditation.” “An ability to search the literature for all online papers that used a particular data set would enable appropriate attribution for those who share. “ Nature Biotechnology 27, 579 (2009) Prepublication data sharing (Toronto International Data Release Workshop) “Data producers benefit from creating a citable reference, as it can later be used to reflect impact of the data sets.” Nature461, 168-170 (2009)
5. Launching today… A new open-access open-data journal and database Innovative article publishing and data hosting … “big and sharable” www.gigasciencejournal.com Published by BGI in partnership with BioMedCentral
6. Why DOI®s? Clear method for data tracking and data citation, allowing: Increased the searchability (and use) of data Credit for data production, making it clear who produced the data and when The ability to track and receive feedback on data usage Credit to original authors for their data’s use A data citation metric potentially rivaling and complementary to the impact factor The potential to publish papers relating to a dataset, while making the data available and receiving credit for it earlier
7. Our first DOI®: To maximize its utility to the research community and aid those fighting the current epidemic, genomic data is released here into the public domain under a CC0 license. Until the publication of research papers on the assembly and whole-genome analysis of this isolate we would ask you to cite this dataset as: Li, D; Xi, F; Zhao, M; Liang, Y; Chen, W; Cao, S; Xu, R; Wang, G; Wang, J; Zhang, Z; Li, Y; Cui, Y; Chang, C; Cui, C; Luo, Y; Qin, J; Li, S; Li, J; Peng, Y; Pu, F; Sun, Y; Chen,Y; Zong, Y; Ma, X; Yang, X; Cen, Z; Zhao, X; Chen, F; Yin, X; Song,Y ; Rohde, H; Li, Y; Wang, J; Wang, J and the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011) Genomic data from Escherichia coli O104:H4 isolate TY-2482. BGI Shenzhen. doi:10.5524/100001 http://dx.doi.org/10.5524/100001 To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to Genomic Data from the 2011 E. coli outbreak. This work is published from: China.
11. New journal format that combines standard manuscript publication with an extensive database developed to host all associated data.
12. Evolving data repository beginning with 'omic’-type data and growing to support other biological large-scale data, such as imaging data, cohort data, metadata, and others.
13. Editorial interaction with the different biological communities to determine the best means of hosting and accessing their type of data.
14. Integrated tools to promote more widespread access, viewing, and analysis of the stored data.
15. BGI Cloud Computing resources for handling and analyzing large-scale data.