Advertisement
Advertisement

More Related Content

Slideshows for you(20)

Similar to Cassava genome hub(20)

Advertisement

More from CIAT(20)

Advertisement

Cassava genome hub

  1. Cassava Genome Hub Updates on cassava genomics big data management and analysis. Anestis Gkanogiannis CIRAD June 24, 2016
  2. Introduction Big Data The Cassava Genome Hub Usecase Table of Contents 1 Introduction 2 Big Data 3 The Cassava Genome Hub Architecture Technologies Data Tools JBrowse SNiPlay GIGWA DiffExDB Genetic Map Querying Tools Galaxy 4 Usecase
  3. Introduction Big Data The Cassava Genome Hub Usecase Who am I? Born and raised in the Greek island of Evia. Physicist, 1998 - 2003, BSc, UOC, Crete, Greece Informatician 2003 -2005, MSc in Information Retrieval, AUEB, Athens 2005 - 2011, PhD in Machine Learning, AUEB, Athens
  4. Introduction Big Data The Cassava Genome Hub Usecase Who am I? Born and raised in the Greek island of Evia. Physicist, 1998 - 2003, BSc, UOC, Crete, Greece Informatician 2003 -2005, MSc in Information Retrieval, AUEB, Athens 2005 - 2011, PhD in Machine Learning, AUEB, Athens 2011 - 2013, Text Analysis, UNB, Fredericton, Canada 2013 - 2015, Bacterial Genomics, Genoscope, Paris, France 2015 - 2016, Plant Genomics, CIRAD, Montpellier, France 2016 - ??
  5. Introduction Big Data The Cassava Genome Hub Usecase Who are we?
  6. Introduction Big Data The Cassava Genome Hub Usecase Table of Contents 1 Introduction 2 Big Data 3 The Cassava Genome Hub Architecture Technologies Data Tools JBrowse SNiPlay GIGWA DiffExDB Genetic Map Querying Tools Galaxy 4 Usecase
  7. Introduction Big Data The Cassava Genome Hub Usecase Definition Everyone is talking about it.
  8. Introduction Big Data The Cassava Genome Hub Usecase Definition Everyone is talking about it. Any combination of
  9. Introduction Big Data The Cassava Genome Hub Usecase Definition Everyone is talking about it. Any combination of Very hot subject in Omics.
  10. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Table of Contents 1 Introduction 2 Big Data 3 The Cassava Genome Hub Architecture Technologies Data Tools JBrowse SNiPlay GIGWA DiffExDB Genetic Map Querying Tools Galaxy 4 Usecase
  11. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools http://www.cassavagenome.org
  12. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Table of Contents 1 Introduction 2 Big Data 3 The Cassava Genome Hub Architecture Technologies Data Tools JBrowse SNiPlay GIGWA DiffExDB Genetic Map Querying Tools Galaxy 4 Usecase
  13. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools
  14. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Technologies
  15. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Data Volume Tens of TB of raw sequence data. Hundreds of GB of processed and analyzed data. Velocity New and improved assemblies and annotation. New sequencing technologies and lower cost. Variety Genomic sequences, RNASeq,RADSeq, etc. Annotation Variants Metabolomic
  16. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Data Public/Private Type Technology Description Publication Samples Public Genomic WGS Assembly and annotation V6 Prochnik et al, 2012 Public Genomic WGS Genetic Variants Bredeson et al, 2016 61 Private Genomic RADSeq Genetic Variants in progress 1100 Private Genomic WGS Genetic Variants in progress 34 Public Transcriptomic RNASeq Response to Xanthomonas Munoz-Bodnar et al, 2014 12(2*6) Public Transcriptomic RNASeq Response to Xanthomonas Cohn et al, 2014 18(3*6) Private Transcriptomic RNASeq Response to White Fly in progress 16(2*8) Table: Resources of data available
  17. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Table of Contents 1 Introduction 2 Big Data 3 The Cassava Genome Hub Architecture Technologies Data Tools JBrowse SNiPlay GIGWA DiffExDB Genetic Map Querying Tools Galaxy 4 Usecase
  18. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools JBrowse A fast, embeddable Genome Browser built completely with JavaScript and HTML5.
  19. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools JBrowse A fast, embeddable Genome Browser built completely with JavaScript and HTML5.
  20. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools JBrowse A fast, embeddable Genome Browser built completely with JavaScript and HTML5.
  21. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools SNiPlay SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations.
  22. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools SNiPlay SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations.
  23. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools SNiPlay SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations.
  24. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools SNiPlay SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations.
  25. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools GIGWA A web-based tool that provides an easy and intuitive way to explore large amounts of genotyping data by filtering it. Data storage relies on MongoDB, which offers good scalability properties. Can handle multiple databases and may be deployed in either single- or multi-user mode, while it provides a wide range of popular export formats.
  26. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools GIGWA A web-based tool that provides an easy and intuitive way to explore large amounts of genotyping data by filtering it. Data storage relies on MongoDB, which offers good scalability properties. Can handle multiple databases and may be deployed in either single- or multi-user mode, while it provides a wide range of popular export formats.
  27. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools DiffExDB Explore differential expression analyses. Visualize heatmap of RPKM expression values.
  28. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools DiffExDB Explore differential expression analyses. Visualize heatmap of RPKM expression values.
  29. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools DiffExDB Explore differential expression analyses. Visualize heatmap of RPKM expression values.
  30. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools CMap A browser-based tool for the visual comparison of various maps (sequence, genetic, etc.).
  31. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools CMap A browser-based tool for the visual comparison of various maps (sequence, genetic, etc.).
  32. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools QuigMap A fast cross-platform genetic map viewer.
  33. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools BLAST blastn : Search nucleotide databases using a nucleotide query. blastp : Search protein databases using a protein query. blastx, tblastn, tblastx
  34. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools BLAST blastn : Search nucleotide databases using a nucleotide query. blastp : Search protein databases using a protein query. blastx, tblastn, tblastx
  35. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Advanced Search Search for genomic features, genomic locations, enzymatic codes, gene ontology terms, etc. Output as nucleotide or translated aminoacid sequences.
  36. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Advanced Search Search for genomic features, genomic locations, enzymatic codes, gene ontology terms, etc. Output as nucleotide or translated aminoacid sequences.
  37. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Pathway Tools Creates a new Pathway/Genome Database (PGDB) containing the predicted metabolic pathways. Supports query, visualization, and analysis of PGDBs.
  38. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Galaxy A scientific workflow, data integration and data analysis platform that aims to make computational biology accessible to research scientists that do not have computer programming experience. Provides means to build multi-step computational analyses. It provides a graphical user interface for specifying what data to operate on, what steps to take, and what order to do them in.
  39. Introduction Big Data The Cassava Genome Hub Usecase Architecture Tools Galaxy A scientific workflow, data integration and data analysis platform that aims to make computational biology accessible to research scientists that do not have computer programming experience. Provides means to build multi-step computational analyses. It provides a graphical user interface for specifying what data to operate on, what steps to take, and what order to do them in.
  40. Introduction Big Data The Cassava Genome Hub Usecase Table of Contents 1 Introduction 2 Big Data 3 The Cassava Genome Hub Architecture Technologies Data Tools JBrowse SNiPlay GIGWA DiffExDB Genetic Map Querying Tools Galaxy 4 Usecase
  41. Introduction Big Data The Cassava Genome Hub Usecase Usecase
  42. Introduction Big Data The Cassava Genome Hub Usecase Usecase
  43. Introduction Big Data The Cassava Genome Hub Usecase Thank you!
Advertisement