SlideShare a Scribd company logo
1 of 19
JoãoAndré Carriço, Mario Ramirez
Microbiology Institute and Instituto de Medicina Molecular,
Faculty of Medicine, University of Lisbon
jcarrico@fm.ul.pt twitter: @jacarrico
RAMI-NGS, Hamburg, Germany, 9-11 June 2016
 Moving fromTyping into High
Throughput Sequencing (HTS)
Genomics :
 Increase in discrimination
 Extra information to be extracted the
genome (resistance profiles, virulence
factors, genome organization)
 Global Outbreak detection / Surveillance
 Direct application in public health
 Source attribution -> intervention
Image credits:
1) http://www.iissiidiology.net/en/publications/104-ayfaar-interpersonal-and-true-human-relationship-harmonization-mechanisms
2) http://blog.f1000research.com/2014/04/04/reproducibility-tweetchat-recap/
Data Integration
Harmonization
Reproducibility
1)
Algorithms
Interfaces
Ontologies
Read mapping algorithms
 Bowtie2
 BWA
 SOAP2
 Saruman
 mr/mrsFAST
 …. (And a lot more )
Algorithms
Hatem M et all BMC Bioinformatics
2013..14:184
DOI: 10.1186/1471-2105-14-184
+ a plethora of parameters for each of them
+ a (proper) choice of reference
Gene-by-gene approach allele call algorithms:
 BIGSdb ( Jolley, K.A. & Maiden, M. C. J. BMC Bioinf 11, 595 (2010).)
 Enterobase (https://enterobase.warwick.ac.uk/)
 GEP (Genome Profiler) (JCM. 2015 May;53(5):1765-7)
 Ridom Seqsphere
 Bionumerics (Applied Maths)
 Mostly assembly based (yes it is a lot of work … )
 Assembly algorithms have some parameters (mostly k-mer
sizes)
 Lots of heuristics for allele definition..
Algorithms
 Gene by gene approaches:
 What is a locus?
 What is an allele?
It depends on the
algorithm(s) used!
Algorithms
However the results are
largely congruent!
Ontologies
Image from http://www.emiliosanfilippo.it/?page_id=1172
 “Formal representation of knowledge as a set of concepts within a
domain, and the relationships between those concepts” –Wikipedia
 Domain modeling: represents all the concepts involved in in
microbial typing by sequence-based methods
 Provides a shared vocabulary, where the concepts should be
unambiguous
 Enables a machine-readable format that can be used for software
and algorithms automatically interact with multiple databases
Ontologies
Ontologies
GenEpiO: Combining Different Epi, Lab,
Genomics and Clinical Data Fields.
Lab Analytics
Genomics, PFGE
Serotyping, Phage typing
MLST, AMR
Clinical Data
Patient demographics,
Medical History,
Comorbidities, Symptoms,
Health Status
Reporting
Case/Investigation Status
GenEpiO
(Genomic Epidemiology
Application Ontology)
See draft version at https://github.com/Public-Health-Bioinformatics/IRIDA_ontology
Original slide from
Emma Griffiths
Ontologies
Public Health
Surveillance
Case Cluster
Analysis
Result
Reporting
Infectious Disease Epidemiology
(from case to Intervention)
Lab Surveillance
(from sample to strain typing results)
Evidence
Collection
& Outbreak
Investigation
Sample Collection
& Processing
Sequence Data
Generation &
Processing
Bioinformatics
Analysis
Result
Reporting
Whole Genome
Sequencing (SO, ERO, OBI etc)
Quality Control (OBI, ERO)
Anatomy
(FMA)
Environment (Envo)
Food (FoodOn)
Clinical Sampling (OBI)
Custom LIMS
Quality Control (OBI, ERO)
AMR (ARO)
Virulence (PATO)
Phylogenetic Clustering (EDAM)
Mobile Elements (MobiO)
Quality Control (OBI, ERO)
AMR (ARO) LOINC
Surveillance (SurvO)
Demographics (SIO)
Patient History (SIO)
Symptoms (SYMP)
Exposures (ExO)
Source Attribution (IDO)
Travel (IDO)
Transmission (TRANS)
Food (FoodOn)
Geography (OMRSE)
Outbreak Protocols
Surveillance (SurvO)
Food (FoodOn)
Surveillance (SurvO)
Mobile Elements (MobiO)
Infectious Disease (IDO)
Typing (TypON)
Nomenclature &Taxonomy
(NCBItaxon)
Original slide from Emma Griffiths /IRIDA
http://foodontology.github.io/foodon/
(pipeline) NGSOnto
 Provides machine-readable web-based
interface,i.e.,the algorithms (not humans) can:
 retrieve, submit , update data /analysis results
 launch analysis/algorithms
Interfaces
http://www.clker.com/cliparts/q/P/V/D/5/R/cog-allgrey-hi.png
 BIGSdb
 Enterobase
Offer an Restful API for data retrieving,
submission and data analysis
Interfaces
Interfaces
Interfaces
https://online.phyloviz.net/
API:
*account creation
*profile + metadata upload
*running goeBURST
*retrieving a link
Private or Public data sharing
Scalable to thousands of nodes
Tree Analysis tools:
Interactive distance matrix
NLV graph
Transparency of
analytical methods
Better definition
of concepts
(Clinical/Lab/Analysis)
Better tool/database
interoperability
• Reproducibility of results
• Added value of analysis
• Custom interfaces for non-bionf specialists
 UMMI Members
 Bruno Gonçalves
 Mickael Silva
 Miguel MAchado
 Mário Ramirez
 José Melo-Cristino
 INESC-ID
 Alexandre Francisco
 Cátia Vaz
 Marta Nascimento
 EFSA INNUENDO Project (https://sites.google.com/site/innuendocon/)
 Mirko Rossi
 FP7 PathoNGenTrace (http://www.patho-ngen-trace.eu/):
 Dag Harmsen (Univ. Muenster)
 Stefan Niemann (Research Center Borstel)
 Keith Jolley, James Bray and Martin Maiden (Univ.Oxford)
 Joerg Rothganger (RIDOM)
 Hannes Pouseele (Applied Maths)
 Genome Canada IRIDA project (www.irida.ca)
 Franklin Bristow, Thomas Matthews, Aaron Petkau, Morag Graham and Gary Van Domselaar (NLM , PHAC)
 Ed Taboada and Peter Kruczkiewicz (Lab Foodborne Zoonoses, PHAC)
 Fiona Brinkman (SFU)
 William Hsiao (BCCDC)
INTEGRATED RAPID INFECTIOUS DISEASE ANALYSIS

More Related Content

What's hot

Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsJoão André Carriço
 
Cell Authentication By STR Profiling
Cell Authentication By STR ProfilingCell Authentication By STR Profiling
Cell Authentication By STR ProfilingCreative-Bioarray
 
Bacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBIBacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBInist-spin
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016ExternalEvents
 
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...ExternalEvents
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseNathan Olson
 
Bioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisBioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisDespoina Kalfakakou
 
EVE SMITH Resume
EVE SMITH Resume EVE SMITH Resume
EVE SMITH Resume Eve Smith
 
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...ExternalEvents
 
Candidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologistCandidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologistJonathan Duckworth
 
Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...
Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...
Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...ExternalEvents
 
Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...
Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...
Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...ExternalEvents
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 
How can Whole Genome Sequencing information be used to address data requireme...
How can Whole Genome Sequencing information be used to address data requireme...How can Whole Genome Sequencing information be used to address data requireme...
How can Whole Genome Sequencing information be used to address data requireme...OECD Environment
 

What's hot (20)

Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and Annotations
 
Cell Authentication By STR Profiling
Cell Authentication By STR ProfilingCell Authentication By STR Profiling
Cell Authentication By STR Profiling
 
Bacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBIBacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBI
 
Mikel egana itbam_2010_ogo_system
Mikel egana itbam_2010_ogo_systemMikel egana itbam_2010_ogo_system
Mikel egana itbam_2010_ogo_system
 
Proof of concept of WGS based surveillance: meningococcal disease
Proof of concept of WGS based surveillance: meningococcal diseaseProof of concept of WGS based surveillance: meningococcal disease
Proof of concept of WGS based surveillance: meningococcal disease
 
Overview of the ECDC whole genome sequencing strategy
Overview of the ECDC whole genome sequencing strategyOverview of the ECDC whole genome sequencing strategy
Overview of the ECDC whole genome sequencing strategy
 
Proposal for 2016 survey of WGS capacity in EU/EEA Member States
Proposal for 2016 survey of WGS capacity in EU/EEA Member StatesProposal for 2016 survey of WGS capacity in EU/EEA Member States
Proposal for 2016 survey of WGS capacity in EU/EEA Member States
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016
 
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
Prashant esa2017
Prashant esa2017Prashant esa2017
Prashant esa2017
 
Bioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisBioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesis
 
EVE SMITH Resume
EVE SMITH Resume EVE SMITH Resume
EVE SMITH Resume
 
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
 
Candidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologistCandidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologist
 
Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...
Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...
Whole Genome Sequencing (WGS) for surveillance of foodborne infections in Den...
 
Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...
Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...
Applications of Whole Genome Sequencing (WGS) to Food Safety – Perspective fr...
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
Human Cell Line Authentication. Why is it so important?
Human Cell Line Authentication. Why is it so important?Human Cell Line Authentication. Why is it so important?
Human Cell Line Authentication. Why is it so important?
 
How can Whole Genome Sequencing information be used to address data requireme...
How can Whole Genome Sequencing information be used to address data requireme...How can Whole Genome Sequencing information be used to address data requireme...
How can Whole Genome Sequencing information be used to address data requireme...
 

Similar to Common languages in genomic epidemiology: from ontologies to algorithms

Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...
Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...
Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...Emma Griffiths
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffithsIRIDA_community
 
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...Emma Griffiths
 
importance of pathogenomics in plant pathology
importance of pathogenomics in plant pathologyimportance of pathogenomics in plant pathology
importance of pathogenomics in plant pathologyvinay ju
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
Teresa Coque Hospital Universitario Ramón y Cajal.
Teresa Coque  Hospital Universitario Ramón y Cajal. Teresa Coque  Hospital Universitario Ramón y Cajal.
Teresa Coque Hospital Universitario Ramón y Cajal. Fundación Ramón Areces
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision
 
Phenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome databasePhenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome databaseJennifer Smith
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management inscit2006
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsJTADrexel
 
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024dkNET
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsNigel Collier
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be onemhaendel
 
Integrative Bioinformatics
Integrative BioinformaticsIntegrative Bioinformatics
Integrative Bioinformaticsbrnbarcelona
 
BRN Seminar 12/06/14 Integrative Bioinformatics
BRN Seminar 12/06/14 Integrative Bioinformatics BRN Seminar 12/06/14 Integrative Bioinformatics
BRN Seminar 12/06/14 Integrative Bioinformatics brnmomentum
 
Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.Muhammad Hunan Faiz
 

Similar to Common languages in genomic epidemiology: from ontologies to algorithms (20)

Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...
Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...
Context is Everything: Integrating Genomics, Epidemiological and Clinical Dat...
 
Sequencing and Beyond?
Sequencing and Beyond?Sequencing and Beyond?
Sequencing and Beyond?
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffiths
 
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
 
Genentech icgc 2015
Genentech icgc 2015Genentech icgc 2015
Genentech icgc 2015
 
importance of pathogenomics in plant pathology
importance of pathogenomics in plant pathologyimportance of pathogenomics in plant pathology
importance of pathogenomics in plant pathology
 
Pathogen Genome Data
Pathogen Genome DataPathogen Genome Data
Pathogen Genome Data
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Teresa Coque Hospital Universitario Ramón y Cajal.
Teresa Coque  Hospital Universitario Ramón y Cajal. Teresa Coque  Hospital Universitario Ramón y Cajal.
Teresa Coque Hospital Universitario Ramón y Cajal.
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Phenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome databasePhenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome database
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease Informatics
 
Eccmid meet the expert 2015
Eccmid meet the expert 2015Eccmid meet the expert 2015
Eccmid meet the expert 2015
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
 
Integrative Bioinformatics
Integrative BioinformaticsIntegrative Bioinformatics
Integrative Bioinformatics
 
BRN Seminar 12/06/14 Integrative Bioinformatics
BRN Seminar 12/06/14 Integrative Bioinformatics BRN Seminar 12/06/14 Integrative Bioinformatics
BRN Seminar 12/06/14 Integrative Bioinformatics
 
Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.
 

Recently uploaded

CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsSérgio Sacani
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Silpa
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.Silpa
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 

Recently uploaded (20)

Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 

Common languages in genomic epidemiology: from ontologies to algorithms

  • 1. JoãoAndré Carriço, Mario Ramirez Microbiology Institute and Instituto de Medicina Molecular, Faculty of Medicine, University of Lisbon jcarrico@fm.ul.pt twitter: @jacarrico RAMI-NGS, Hamburg, Germany, 9-11 June 2016
  • 2.  Moving fromTyping into High Throughput Sequencing (HTS) Genomics :  Increase in discrimination  Extra information to be extracted the genome (resistance profiles, virulence factors, genome organization)  Global Outbreak detection / Surveillance  Direct application in public health  Source attribution -> intervention
  • 3. Image credits: 1) http://www.iissiidiology.net/en/publications/104-ayfaar-interpersonal-and-true-human-relationship-harmonization-mechanisms 2) http://blog.f1000research.com/2014/04/04/reproducibility-tweetchat-recap/ Data Integration Harmonization Reproducibility 1)
  • 5. Read mapping algorithms  Bowtie2  BWA  SOAP2  Saruman  mr/mrsFAST  …. (And a lot more ) Algorithms Hatem M et all BMC Bioinformatics 2013..14:184 DOI: 10.1186/1471-2105-14-184 + a plethora of parameters for each of them + a (proper) choice of reference
  • 6. Gene-by-gene approach allele call algorithms:  BIGSdb ( Jolley, K.A. & Maiden, M. C. J. BMC Bioinf 11, 595 (2010).)  Enterobase (https://enterobase.warwick.ac.uk/)  GEP (Genome Profiler) (JCM. 2015 May;53(5):1765-7)  Ridom Seqsphere  Bionumerics (Applied Maths)  Mostly assembly based (yes it is a lot of work … )  Assembly algorithms have some parameters (mostly k-mer sizes)  Lots of heuristics for allele definition.. Algorithms
  • 7.  Gene by gene approaches:  What is a locus?  What is an allele? It depends on the algorithm(s) used! Algorithms However the results are largely congruent!
  • 9.  “Formal representation of knowledge as a set of concepts within a domain, and the relationships between those concepts” –Wikipedia  Domain modeling: represents all the concepts involved in in microbial typing by sequence-based methods  Provides a shared vocabulary, where the concepts should be unambiguous  Enables a machine-readable format that can be used for software and algorithms automatically interact with multiple databases Ontologies
  • 11. GenEpiO: Combining Different Epi, Lab, Genomics and Clinical Data Fields. Lab Analytics Genomics, PFGE Serotyping, Phage typing MLST, AMR Clinical Data Patient demographics, Medical History, Comorbidities, Symptoms, Health Status Reporting Case/Investigation Status GenEpiO (Genomic Epidemiology Application Ontology) See draft version at https://github.com/Public-Health-Bioinformatics/IRIDA_ontology Original slide from Emma Griffiths Ontologies
  • 12. Public Health Surveillance Case Cluster Analysis Result Reporting Infectious Disease Epidemiology (from case to Intervention) Lab Surveillance (from sample to strain typing results) Evidence Collection & Outbreak Investigation Sample Collection & Processing Sequence Data Generation & Processing Bioinformatics Analysis Result Reporting Whole Genome Sequencing (SO, ERO, OBI etc) Quality Control (OBI, ERO) Anatomy (FMA) Environment (Envo) Food (FoodOn) Clinical Sampling (OBI) Custom LIMS Quality Control (OBI, ERO) AMR (ARO) Virulence (PATO) Phylogenetic Clustering (EDAM) Mobile Elements (MobiO) Quality Control (OBI, ERO) AMR (ARO) LOINC Surveillance (SurvO) Demographics (SIO) Patient History (SIO) Symptoms (SYMP) Exposures (ExO) Source Attribution (IDO) Travel (IDO) Transmission (TRANS) Food (FoodOn) Geography (OMRSE) Outbreak Protocols Surveillance (SurvO) Food (FoodOn) Surveillance (SurvO) Mobile Elements (MobiO) Infectious Disease (IDO) Typing (TypON) Nomenclature &Taxonomy (NCBItaxon) Original slide from Emma Griffiths /IRIDA http://foodontology.github.io/foodon/ (pipeline) NGSOnto
  • 13.  Provides machine-readable web-based interface,i.e.,the algorithms (not humans) can:  retrieve, submit , update data /analysis results  launch analysis/algorithms Interfaces http://www.clker.com/cliparts/q/P/V/D/5/R/cog-allgrey-hi.png
  • 14.  BIGSdb  Enterobase Offer an Restful API for data retrieving, submission and data analysis Interfaces
  • 16. Interfaces https://online.phyloviz.net/ API: *account creation *profile + metadata upload *running goeBURST *retrieving a link Private or Public data sharing Scalable to thousands of nodes Tree Analysis tools: Interactive distance matrix NLV graph
  • 17. Transparency of analytical methods Better definition of concepts (Clinical/Lab/Analysis) Better tool/database interoperability • Reproducibility of results • Added value of analysis • Custom interfaces for non-bionf specialists
  • 18.
  • 19.  UMMI Members  Bruno Gonçalves  Mickael Silva  Miguel MAchado  Mário Ramirez  José Melo-Cristino  INESC-ID  Alexandre Francisco  Cátia Vaz  Marta Nascimento  EFSA INNUENDO Project (https://sites.google.com/site/innuendocon/)  Mirko Rossi  FP7 PathoNGenTrace (http://www.patho-ngen-trace.eu/):  Dag Harmsen (Univ. Muenster)  Stefan Niemann (Research Center Borstel)  Keith Jolley, James Bray and Martin Maiden (Univ.Oxford)  Joerg Rothganger (RIDOM)  Hannes Pouseele (Applied Maths)  Genome Canada IRIDA project (www.irida.ca)  Franklin Bristow, Thomas Matthews, Aaron Petkau, Morag Graham and Gary Van Domselaar (NLM , PHAC)  Ed Taboada and Peter Kruczkiewicz (Lab Foodborne Zoonoses, PHAC)  Fiona Brinkman (SFU)  William Hsiao (BCCDC) INTEGRATED RAPID INFECTIOUS DISEASE ANALYSIS