Context is Everything: Integrating Genomics, Epidemiological and Clinical Data Using GenEpiO

Context is Everything: Integrating
Genomics, Epidemiological and Clinical
Data Using GenEpiO
Emma Griffiths
Brinkman Lab
Simon Fraser University, Burnaby, Canada
On behalf of the GenEpiO Development Team
Will Hsiao & Damion Dooley (BC Public Health Lab),
Emma Griffiths & Fiona Brinkman (SFU)
Sept 15, 2016
1
Each year, one in eight Canadians (or 4 million people)
get sick with a domestically acquired food-borne illness.
http://www.phac-aspc.gc.ca/efwd-emoha/efbi-emoa-eng.php
Slide courtesy of Will Hsiao
Genomic sequences don’t mean much without contextual information.
• Must be combined with contextual
information to make sense of the data…
FAST!
Contextual info = Lab, Clinical, Epi Data
What is the pathogen?
Where did the exposures occur?
When were the exposures?
Who is at risk?
What virulence factors or
drug resistance
determinants are present?
How severe is the disease?
Why did the
contamination happen?
Bacterial Genome 1
Bacterial Genome 2
Bacterial Genome 3
Bacterial Genome 4
Bacterial Genome 5
Bacterial Genome 6
Bacterial Genome 7
Bacterial Genome 8
Bacterial Genome 9
OUTBREAK?
Whole genome sequencing is increasingly used for diagnostics in
foodborne outbreaks and surveillance.
3
The
of Contextual Information
Isn’t
STANDARDIZED
•Free text, short hand, granularity, misspelling, paper format 4
• Ontologies help resolve issues of
taxonomy, granularity and specificity
• Reduces time consuming manual
processing and mining
Leafy Greens “has_disposition” Transmission Vehicle (E.coli)
Spinach Lettuce
Endive “is_a” Lettuce TypeIcebergSpinacia
oleracea
Amaranthus
hybridus
S. Africa
5
Ontology, A Way of Structuring Information
• Standardized, well-defined hierarchy
of terms
• Interconnected with logical
relationships e.g. Endive “is_a”
Lettuce type
Integrates Epi Exposure data
6
GenEpiO: Genomic Epidemiology Application Ontology
Standardize fields/terms used to describe genomics, lab, clinical and
epidemiological data critical for food-borne pathogen outbreak investigations
Provincial National
Public Health Agency Public Health Agency
(Hsiao, co-PI) (Van Domselaar, co-PI)
Academic/Public
Brinkman (PI)
www.irida.ca
GenEpiO: Combining Different Epi, Lab, Genomics and Clinical Data Fields
Lab Analytics
Genomics, PFGE
Serotyping, Phage typing
MLST, AMR
Sample Metadata
Isolation Source (Food, Host
Body Product,
Environmental), BioSample
Epidemiology Investigation
Exposures
Clinical Data
Patient demographics, Medical
History, Comorbidities,
Symptoms, Health Status
Reporting
Case/Investigation Status
GenEpiO
(Genomic Epidemiology
Application Ontology)
7
Current Ontology Development Focuses On 3 Key Areas
Food Antimicrobial
Resistance
Disease
Surveillance
8
(FoodON)
(SurvO) (ARO)
See draft version at https://github.com/GenEpiO/genepio/wiki
Ontologies are commonly encoded using OWL (Web Ontology Language).
• Markup language for sharing ontologies on the web
• Machine and human-readable
• OWL statements written in RDF
• Expressed in XML syntax
• Protégé (editor)
• Ontology lookup services:
9
Use ontology to
identify common
exposures, symptoms
etc among genomics
clusters
Example: Automating Case Definition generation
Correlate Genomics Salmonella Cluster A cases between 01 Mar 2014- 15 Mar 2014 with
High-Risk Food Types  Chicken Nuggets and Geographical Location of Vancouver
XXXXXXXXXXXXXX
GenEpiO Will Help Integrate Genomics and
Epidemiological Data.
10
Genomic Epidemiology Ontology is like instrumentation for
your contextual information…
it needs maintenance and continual improvement
To achieve consensus and uptake  International GenEpiO Consortium
Join us!
E-mail: IRIDA-mail@sfu.ca
11
(>60 members from 15 countries)
GenEpiO facilitates
data sharing!
Context is everything in foodborne outbreak investigations.
GenEpiO helps fit the data together.
12
13
Project Leaders
Fiona Brinkman – SFU
Will Hsiao – PHMRL
Gary Van Domselaar – NML
Simon Fraser University (SFU)
Emma Griffiths
Geoff Winsor
Julie Shay
Matthew Laird
Bhav Dhillon
McMaster University
Andrew McArthur
Daim Sardar
European Food Safety Agency
Leibana Criado Ernesto
Vernazza Francesco
Rizzi Valentina
IRIDA-mail@sfu.ca
National Microbiology Laboratory (NML)
Franklin Bristow
Aaron Petkau
Thomas Matthews
Josh Adam
Adam Olsen
Tara Lynch
Shaun Tyler
Philip Mabon
Philip Au
Celine Nadon
Matthew Stuart-Edwards
Morag Graham
Chrystal Berry
Lorelee Tschetter
Eduardo Toboada
Peter Kruczkiewicz
Chad Laing
Vic Gannon
Matthew Whiteside
Ross Duncan
Steven Mutschall
University of Lisbon
Joᾶo Carriҫo
European Bioinformatics Institute
Melanie Courtot
Helen Parkinson
BC Public Laboratory and
BC Centre for Disease Control
(BCCDC)
Damion Dooley
Judy Isaac-Renton
Patrick Tang
Natalie Prystajecky
Jennifer Gardy
Linda Hoang
Kim MacDonald
Yin Chang
Eleni Galanis
Marsha Taylor
Jennifer Law
University of Maryland
Lynn Schriml
Canadian Food Inspection Agency
(CFIA)
Adam Koziol
Burton Blais
Catherine Carrillo
Dalhousie University
Rob Beiko
Alex Keddy
www.irida.ca
1 of 13

More Related Content

What's hot(20)

Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffiths
IRIDA_community179 views
Ewald (J.Z.) Groenewald - Opening PlenaryEwald (J.Z.) Groenewald - Opening Plenary
Ewald (J.Z.) Groenewald - Opening Plenary
Consortium for the Barcode of Life (CBOL)496 views
Scientists find way toScientists find way to
Scientists find way to
rosbincomputerlit90 views
FruitBreedomics general presentationFruitBreedomics general presentation
FruitBreedomics general presentation
fruitbreedomics502 views

Viewers also liked(15)

Dormitorio menino 1Dormitorio menino 1
Dormitorio menino 1
Maurício Aurvalle405 views
bloodhoundsbloodhounds
bloodhounds
kailers325 views
LeadershipLeadership
Leadership
Suva Rastafarian279 views
Prokochuk_Irina_font design_calligraphyProkochuk_Irina_font design_calligraphy
Prokochuk_Irina_font design_calligraphy
Ira Prokopchuk121 views
RefactoringRefactoring
Refactoring
Sway Wang266 views
Présentation mémoire KOUAO Ekra MathieuPrésentation mémoire KOUAO Ekra Mathieu
Présentation mémoire KOUAO Ekra Mathieu
Mathieu Ekra Kouao572 views
Formation ecommerceFormation ecommerce
Formation ecommerce
Jérôme Avner MAMAN905 views
New Jersey: Your Dream Home DestinationNew Jersey: Your Dream Home Destination
New Jersey: Your Dream Home Destination
Donnelly Real Estate282 views
Israel Powerpoint CountryIsrael Powerpoint Country
Israel Powerpoint Country
Andrew Schwartz9.1K views

Similar to Context is Everything: Integrating Genomics, Epidemiological and Clinical Data Using GenEpiO(20)

Deep phenotyping for everyoneDeep phenotyping for everyone
Deep phenotyping for everyone
mhaendel2.8K views
Biocuration gen epio_posterBiocuration gen epio_poster
Biocuration gen epio_poster
IRIDA_community262 views
Teresa Coque  Hospital Universitario Ramón y Cajal. Teresa Coque  Hospital Universitario Ramón y Cajal.
Teresa Coque Hospital Universitario Ramón y Cajal.
Fundación Ramón Areces 628 views
apcp Deeksha  Bhartiyaapcp Deeksha  Bhartiya
apcp Deeksha Bhartiya
Ravi Mehrotra MD,PhD, FRCPath, FAMS163 views
UNC Water and Health Conference 2011: Professor Glenn Morris, University of F...UNC Water and Health Conference 2011: Professor Glenn Morris, University of F...
UNC Water and Health Conference 2011: Professor Glenn Morris, University of F...
SHARE (Sanitation and Hygiene Applied Research for Equity)468 views

Recently uploaded(20)

Chromatography ppt.pptxChromatography ppt.pptx
Chromatography ppt.pptx
varshachandgudesvpm14 views
plasmidsplasmids
plasmids
scribddarkened3527 views
Isolating mechanism.pptxIsolating mechanism.pptx
Isolating mechanism.pptx
JagadishaTV23 views
Radioactivity.pptxRadioactivity.pptx
Radioactivity.pptx
Rachana Choudhary5 views
domestic waste_100013.pptxdomestic waste_100013.pptx
domestic waste_100013.pptx
padmasriv2510 views
Water-bath Water-bath
Water-bath
zolajoneslabtronuk8 views
Pollination By Nagapradheesh.M.pptxPollination By Nagapradheesh.M.pptx
Pollination By Nagapradheesh.M.pptx
MNAGAPRADHEESH11 views
Batrachospermum.pptxBatrachospermum.pptx
Batrachospermum.pptx
nisarahmad63231614 views
miscellaneous compound.pdfmiscellaneous compound.pdf
miscellaneous compound.pdf
manjusha kareppa14 views
Matthias Beller ChemAI 231116.pptxMatthias Beller ChemAI 231116.pptx
Matthias Beller ChemAI 231116.pptx
Marco Tibaldi79 views
SANJAY HPLC.pptxSANJAY HPLC.pptx
SANJAY HPLC.pptx
sanjayudps2016115 views
  Class 2 (12 july).pdf  Class 2 (12 july).pdf
Class 2 (12 july).pdf
climber99778 views

Context is Everything: Integrating Genomics, Epidemiological and Clinical Data Using GenEpiO

  • 1. Context is Everything: Integrating Genomics, Epidemiological and Clinical Data Using GenEpiO Emma Griffiths Brinkman Lab Simon Fraser University, Burnaby, Canada On behalf of the GenEpiO Development Team Will Hsiao & Damion Dooley (BC Public Health Lab), Emma Griffiths & Fiona Brinkman (SFU) Sept 15, 2016 1
  • 2. Each year, one in eight Canadians (or 4 million people) get sick with a domestically acquired food-borne illness. http://www.phac-aspc.gc.ca/efwd-emoha/efbi-emoa-eng.php Slide courtesy of Will Hsiao
  • 3. Genomic sequences don’t mean much without contextual information. • Must be combined with contextual information to make sense of the data… FAST! Contextual info = Lab, Clinical, Epi Data What is the pathogen? Where did the exposures occur? When were the exposures? Who is at risk? What virulence factors or drug resistance determinants are present? How severe is the disease? Why did the contamination happen? Bacterial Genome 1 Bacterial Genome 2 Bacterial Genome 3 Bacterial Genome 4 Bacterial Genome 5 Bacterial Genome 6 Bacterial Genome 7 Bacterial Genome 8 Bacterial Genome 9 OUTBREAK? Whole genome sequencing is increasingly used for diagnostics in foodborne outbreaks and surveillance. 3
  • 4. The of Contextual Information Isn’t STANDARDIZED •Free text, short hand, granularity, misspelling, paper format 4
  • 5. • Ontologies help resolve issues of taxonomy, granularity and specificity • Reduces time consuming manual processing and mining Leafy Greens “has_disposition” Transmission Vehicle (E.coli) Spinach Lettuce Endive “is_a” Lettuce TypeIcebergSpinacia oleracea Amaranthus hybridus S. Africa 5 Ontology, A Way of Structuring Information • Standardized, well-defined hierarchy of terms • Interconnected with logical relationships e.g. Endive “is_a” Lettuce type Integrates Epi Exposure data
  • 6. 6 GenEpiO: Genomic Epidemiology Application Ontology Standardize fields/terms used to describe genomics, lab, clinical and epidemiological data critical for food-borne pathogen outbreak investigations Provincial National Public Health Agency Public Health Agency (Hsiao, co-PI) (Van Domselaar, co-PI) Academic/Public Brinkman (PI) www.irida.ca
  • 7. GenEpiO: Combining Different Epi, Lab, Genomics and Clinical Data Fields Lab Analytics Genomics, PFGE Serotyping, Phage typing MLST, AMR Sample Metadata Isolation Source (Food, Host Body Product, Environmental), BioSample Epidemiology Investigation Exposures Clinical Data Patient demographics, Medical History, Comorbidities, Symptoms, Health Status Reporting Case/Investigation Status GenEpiO (Genomic Epidemiology Application Ontology) 7
  • 8. Current Ontology Development Focuses On 3 Key Areas Food Antimicrobial Resistance Disease Surveillance 8 (FoodON) (SurvO) (ARO) See draft version at https://github.com/GenEpiO/genepio/wiki
  • 9. Ontologies are commonly encoded using OWL (Web Ontology Language). • Markup language for sharing ontologies on the web • Machine and human-readable • OWL statements written in RDF • Expressed in XML syntax • Protégé (editor) • Ontology lookup services: 9
  • 10. Use ontology to identify common exposures, symptoms etc among genomics clusters Example: Automating Case Definition generation Correlate Genomics Salmonella Cluster A cases between 01 Mar 2014- 15 Mar 2014 with High-Risk Food Types  Chicken Nuggets and Geographical Location of Vancouver XXXXXXXXXXXXXX GenEpiO Will Help Integrate Genomics and Epidemiological Data. 10
  • 11. Genomic Epidemiology Ontology is like instrumentation for your contextual information… it needs maintenance and continual improvement To achieve consensus and uptake  International GenEpiO Consortium Join us! E-mail: IRIDA-mail@sfu.ca 11 (>60 members from 15 countries) GenEpiO facilitates data sharing!
  • 12. Context is everything in foodborne outbreak investigations. GenEpiO helps fit the data together. 12
  • 13. 13 Project Leaders Fiona Brinkman – SFU Will Hsiao – PHMRL Gary Van Domselaar – NML Simon Fraser University (SFU) Emma Griffiths Geoff Winsor Julie Shay Matthew Laird Bhav Dhillon McMaster University Andrew McArthur Daim Sardar European Food Safety Agency Leibana Criado Ernesto Vernazza Francesco Rizzi Valentina IRIDA-mail@sfu.ca National Microbiology Laboratory (NML) Franklin Bristow Aaron Petkau Thomas Matthews Josh Adam Adam Olsen Tara Lynch Shaun Tyler Philip Mabon Philip Au Celine Nadon Matthew Stuart-Edwards Morag Graham Chrystal Berry Lorelee Tschetter Eduardo Toboada Peter Kruczkiewicz Chad Laing Vic Gannon Matthew Whiteside Ross Duncan Steven Mutschall University of Lisbon Joᾶo Carriҫo European Bioinformatics Institute Melanie Courtot Helen Parkinson BC Public Laboratory and BC Centre for Disease Control (BCCDC) Damion Dooley Judy Isaac-Renton Patrick Tang Natalie Prystajecky Jennifer Gardy Linda Hoang Kim MacDonald Yin Chang Eleni Galanis Marsha Taylor Jennifer Law University of Maryland Lynn Schriml Canadian Food Inspection Agency (CFIA) Adam Koziol Burton Blais Catherine Carrillo Dalhousie University Rob Beiko Alex Keddy www.irida.ca

Editor's Notes

  1. So why do we care about foodborne diseases? Despite our high standard in food safety, each year 1 in eight Canadian get food poisoning, costing the economy $4 billion dollars. It is important to track the source and spread of the disease to prevent further sickness
  2. Consortium key to help develop consensus and wide adoption, international input and expertise
  3. “Person, place, time” Exposure, food items, geographical information, symptoms, onset of symptoms Created (manually in excel) on ad hoc basis per investigation Need to be shared between stakeholders, but data governance is an issue
  4. Create a smaller core (Lab, Epi exposure, and Food) ontology for line-list testing Create a consortium for group to take on different domains of Genomic Epidemiology Application Ontology Pursuing longer term funding for ontology