Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

BGI training lecture: Scott Edmunds - Science 2.0, why new developments on the web will make you a better scientist!

Training lecture for BGI staff, June 23rd 2011.

  • Be the first to comment

  • Be the first to like this

BGI training lecture: Scott Edmunds - Science 2.0, why new developments on the web will make you a better scientist!

  1. 1. Scott Edmunds<br />Science 2.0 and beyond:<br />how new developments on the web will make you a better scientist!<br />?<br />(“Everything you wanted to know about social networks but were too afraid to ask…”)<br />
  2. 2. What is?<br /> “Science 2.0 uses the technologies of web 2.0 to conversations between researchers, let them discuss the data and connect it with other data that might be relevant. Blogs, wikis and such permit users to make information available in ways that create a conversation. Web 2.0 permits scientists to create digitized conversations that provide context for the data.”<br />(+ Semantic Web)<br />
  3. 3. Open-Science<br />Open-access (PLoS/BMC)<br />Open-source (github/sourceforge/googlecode)<br />Open-data<br />
  4. 4. Open-Science<br />For<br />Against<br />Allows crowdsourcing.<br />Better for Science.<br />Fairer (public money).<br />More use (=citations)<br />Scooping?<br />Patents/publications?<br />Time/effort.<br />Data deluge?<br />
  5. 5. Open-Science<br />Sharing Detailed Research Data Is Associated with Increased Citation Rate. <br />Piwowar HA, Day RS, Fridsma DB (2007) PLoSONE 2(3): e308. doi:10.1371/journal.pone.0000308<br />
  6. 6. Daphnia Genome Consortium<br />wFleabase: Mar 2006<br />Genome release: July 2007<br />Genome Published: Feb 2011<br />>58 companion papers<br /><br />
  7. 7. Open Lab Notebooks<br />
  8. 8. Faster scientific communication<br />FUTURE<br />PAST<br />
  9. 9. Why is it important for Chinese research?<br />
  10. 10. SPEED<br />
  11. 11. SRA Closure<br />
  12. 12. SRA Closure<br />
  13. 13. Online sources of scientific information<br />Databases/portals of traditional media<br />Blogs (networks/aggregators)<br />Social Networks:<br />Open Notebook Science<br />Wikis<br />Forums/Other<br />
  14. 14. Traditional media 2.0<br />Science databases: general<br />Subject specific<br />Journal content: browse<br />eTOCs<br />RSS<br /><ul><li>Newspapers/television:</li></li></ul><li>Science Blogs<br />Some good examples:<br />Tree of Life (Jonathon Eisen):<br />Bad Science (Ben Goldacre):<br />A Blog around the Clock (Bora Zivkovic):<br />Not Exactly Rocket Science (Ed Yong):<br />Genetic Future (Daniel MacArthur):<br />OMICS, OMICS! (Keith Robison):<br />Bacpathgenomics (Kat Holt):<br />
  15. 15. Science Blogs<br />Group blogs:<br />Open Helix (Genomics news):<br />Genomes Unzipped (Personalized Genomics):<br />Blogging Networks:<br />PLoS Blogs:<br />Nature Network:<br />Scientific American:<br />Discover Blogs:<br />Science Blogs:<br />Occam’s Typewriter:<br />
  16. 16. Science Blogs<br />Blog Aggregators:<br />Science Blogging:<br />Research Blogging:<br />Honorable Mention:<br />NCBI ROFL:<br />Urologe A. 2005 Dec;44(12):1473-5.<br />Inappropriate use of a titanium penile ring. An interdisciplinary challenge for urologists, jewelers, and locksmiths.<br />Wiedemann A, Müller H, Rabs U.<br />Psychol Rep. 2011 Feb;108(1):43-4.<br />National anthems and suicide rates.<br />Lester D, Gunn JF 3rd.<br />
  17. 17. Social Media<br />Good for events<br />Good for networking<br />Good for groups<br />
  18. 18. Using Twitter for Science<br />James Darcy: “Researchers need to get themselves onto Twitter pronto because it is fast becoming the place to find out the breakthroughs in your research field.”<br />Jonathan Eisen:“To do science, you have to know what’s going on…I found Twitter…most useful for becoming informed of what other people are doing in science.” <br />“Twitter and other social networks such as FriendFeedenable real-time highlighting and ranking and tracking of what’s going on in the world of science.” <br />“Twitter is also useful for networking and finding collaborators.”<br /><br />
  19. 19. Using Twitter for Science<br />Twitter is:<br />Microblog: max 140 characters<br /> (“The SMS of the Internet”)<br />Global: 200m users, 190m tweets (1.6b searches)/day.<br />Fast: 2,200 new tweets/s! (can fluctuate 3-4x)<br />Instant: view global trends/keywords with hashtags #<br />
  20. 20. Using Twitter for Science<br />Twitter is good for:<br />Eavesdropping: follow informative people to get information and learn<br />Dialogue: exchange, discuss, and debate information<br />Broadcast: used by news organizations and businesses to inform audience about news or products/services<br />Data collection: e.g. using Tweeting fishermen to monitor fish populations.<br />Accidental journalism: e.g. landing on Hudson river, Michael Jackson death, Japan Earthquake<br />Mindcasting:  following a single story or topic, with links, for a period of time, e.g. like my ongoing coverage of the #Ecoliat @BGI_Events<br />
  21. 21. Using Twitter for Science<br />Twitter is not so good for:<br />
  22. 22. Using Twitter for Science<br />How it works (‘twetiquette’):<br />People will only read your messages if you have followers or RT’s (re-tweets), so:<br />Keep it interesting.<br />Keep it short (<140 characters)<br />Use links and link-shorteners (<br />Keep it interactive (2-way).<br />Use hashtags and twitter ID’s (@xxx)<br />Have regular content (RT’s).<br />Intersperse tweets.<br />Think about timezones (Europe=late afternoon, US=night).<br />
  23. 23. Using Twitter for Science<br />Who to follow:<br />BGI Collaborators<br />Science news/blogs<br />@NatureNews @dgmacarthur<br />@Sciencemagazine @Boraz<br />@Biomedcentral @genomesunzipped<br />@genomeresearch @OpenHelix<br />@PLoS @sciencebase<br />@BioITWorld @edyong209<br />@Metahit<br />@Genome10K<br />@Assemblathon<br />#EMP @gilbertjacka<br />@phylogenomics<br />Scientists<br />EBI/Sanger @ewanbirney@moorejh@timjph @bffo<br />@Alexbateman1 @deannachurch<br />@lenovere@kamounlab<br />@emblebi@JCVenter<br />Hashtags<br />#pm101 = personalized medicine#OA = open access<br />#microbiome #metagenomics<br />#epigenomics #omics #genomics<br />
  24. 24. Using Twitter for Science<br />Conferences<br />ISMB2010 twitter activity<br />Tweets at the ISMB 2010 meeting <br />ISMB2010 comments July 9-13<br />
  25. 25. Using Twitter for Science<br />Conferences<br />Follow the meeting from home:<br />Conference: Twitter Feed: Hashtag:<br />ASHG @geneticssociety #ICHG2011<br />Society for Neuroscience @SfNtweets #SfN10/#SfN11<br />Plant and Animal Genomes @PAGmeeting #PAG<br />ISMB @iscb #ISMB<br />ICSB @ICSB_2011 #ICSB2011HD<br />AACR @AACR #AACR<br />
  26. 26. Using Twitter for Science<br />Conferences<br /><br />
  27. 27. Using Twitter for Science<br />Conferences<br /><br />
  28. 28. Using Twitter for Science<br />Aided by: Feed aggregators/Dashboards<br />
  29. 29. Science Social Networks<br />Genome 10K Networking:<br />Science 3.0:<br />
  30. 30. Forums/Other<br />Forums<br /><br /><br /><br /><br /><br /> (Russian) <br /> (Chinese)<br /><br />Protocols/Workflows/Hubs<br /><br /><br /><br /><br /><br />
  31. 31. Further reading:<br />Twitter:<br />A gentle introduction to Twitter for the apprehensive academic <br /><br />What is Twitter and Why Scientists Need To Use It.<br /><br />Science journalism: Breaking the convention?<br /><br />Analysing the ISMB 2010 meeting using R<br /><br />Sharing slides from a presentation plus how to do this w/ Slideshare<br /><br />Slideshare:<br />
  32. 32. Why is this important to BGI?<br />Flickr cc: opensourceway<br />
  33. 33. We produce data. (LOTS)<br />1 IlluminaHiSeq 2000 (+Truseq upgrade) <br />= 600Gb/run (12 days)<br />X 128 Hiseq= 6Tb/day = >2Pb/year<br />= ~ 2000 Human Genomes/day<br />
  34. 34. Coming soon…<br />Large-Scale Data <br />Journal/Database<br />In conjunction with:<br />Editor-in-Chief: Laurie Goodman, PhD<br /> Editor: Scott Edmunds, PhD<br /> Assistant Editor: Alexandra Basford, PhD<br /><br />
  35. 35. Our first DOI:<br />To maximize its utility to the research community and aid those  fighting the current epidemic, genomic data is released here into the public domain under a CC0 license. Until the publication of research papers on the assembly and whole-genome analysis of this isolate we would ask you to cite this dataset as:<br />Li, D; Xi, F; Zhao, M; Liang, Y; Chen, W; Cao, S; Xu, R; Wang, G; Wang, J; Zhang, Z; Li, Y; Cui, Y; Chang, C; Cui, C; Luo, Y; Qin, J; Li, S; Li, J; Peng, Y; Pu, F; Sun, Y; Chen,Y; Zong, Y; Ma, X; Yang, X; Cen, Z; Zhao, X; Chen, F; Yin, X; Song,Y ; Rohde, H; Li, Y; Wang, J; Wang, J and the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011) Genomic data from Escherichia coli O104:H4 isolate TY-2482. BGI Shenzhen. doi:10.5524/100001<br />To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to Genomic Data from the 2011 E. coli outbreak. This work is published from: China. <br />
  36. 36. E. Coli #crowdsourcing: the first tweenome?<br />“On 2 June, Chinese scientists announced that they had deciphered the microbe's entire 5.2-million-base-pair genome and immediately made the DNA sequence available for researchers to download. Scores of scientists all over the world started poring over the data, assembling sequence fragments generated by BGI into a coherent genome, and comparing it to reference genomes for E. coli and other bacteria.” <br />“The two announcements came on the second day of a U.K. meeting on applied bioinformatics and public health microbiology. Speakers and other attendees immediately started working on annotating the bacterial sequence provided by BGI. “In less than 24 hours we got the reads, the assembly, and the annotation. A good case study,” blogged Marina Manrique of era7 bioinformatics, a Spanish company that quickly did an automated analysis of the E. coli's genome.“<br />
  37. 37. E. Coli #crowdsourcing: the first tweenome?<br />
  38. 38. E. Coli #crowdsourcing: the first tweenome?<br />
  39. 39. E. Coli #crowdsourcing: the first tweenome?<br />
  40. 40. E. Coli #crowdsourcing: the first tweenome?<br />“The way that the genetic data of the 2011 E. coli strain were disseminated globally suggests a more effective approach for tackling public health problems. Both groups put their sequencing data on the Internet, so scientists the world over could immediately begin their own analysis of the bug's makeup. BGI scientists also are using Twitter to communicate their latest findings.”<br />“German scientists and their colleagues at the Beijing Genomics Institute in China have been working on uncovering secrets of the outbreak. BGI scientists revised their draft genetic sequence of the E. coli strain and have been sharing their data with dozens of scientists around the world as a way to "crowdsource" this data. By publishing their data publicy and freely, these other scientists can have a look at the genetic structure, and try to sort it out for themselves.” <br />
  41. 41. Follow us:<br />@gigascience<br />@BGI_Events<br /><br /><br /><br /><br />@SCEdmunds<br /><br />