SlideShare a Scribd company logo
Genomic CDS
An example of a complex ontology for
pharmacogenetics and clinical decision support
Matthias Samwald
Medical University of Vienna
W3C Semantic Web for Healthcare and Life Science Interest Group
Drug efficacy and toxicity can vary drastically between
patients with different genetic profiles
Up to 100,000 deaths and 2 million hospitalizations are caused by adverse drug
reactions per year in the United States alone.
Goal: Solving this problem by creating a barrier-free
system for storing and interpreting personal
pharmacogenomic information
The backend of this systems:
OWL 2 for pharmacogenomic knowledge
representation and clinical decision support
Simplified example for a single gene
Some variants are necessary; some variants and
variant combinations are necessary and sufficient for
calling an allele.
Of course, some of the genes in the ontology are far
larger than that
This is how it actually looks in the ontology
1 Class: 'human with CYP2C9*3'
2 EquivalentTo:
3 has some rs1057910_C
4 SubClassOf:
5 has some 'CYP2C9 *3',
6 (has some rs1057910_C) and
7 (has some rs1057911_A) and
8 (has some rs1799853_C) and
9 (has some rs2256871_A) and
10 (has some rs28371685_C) and
11 (has some rs72558188_AGAAATGGAA) and
12 (has some rs72558189_G) and
13 (has some rs9332239_C)
...
This is how it actually looks in the ontology
1 Class: 'human with CYP2C9*18'
2 EquivalentTo:
3 (has some rs1057910_C) and
4 (has some rs1057911_T) and
5 (has some rs72558193_C)
6 SubClassOf:
7 has some 'CYP2C9 *18',
8 (has some rs1057910_C) and
9 (has some rs1057911_T) and
10 (has some rs1799853_C) and
11 (has some rs2256871_A) and
12 (has some rs28371685_C) and
...
The trouble with some cheap genetic tests is that we
don‘t know on which strand each specifc observed
variant is located, we only know that they are on one
of the strands
In some cases, these test results are somewhat
ambiguous, which can lead to logical inconsistencies
Three alleles match, but only two are allowed! The
reasoner flags the ontology as inconsistent.
Class: 'human with GENE1*3'
SubClassOf: human
EquivalentTo:
(has some rs1_A) and
(has some rs2_G),
has some GENE1*3
Class: 'human with GENE1*4'
[…]
Allele definitions in OWL 2
Class: 'human with GENE1*3'
SubClassOf: human
EquivalentTo:
(has some (rs1_A that (taken_by some GENE1*3))) and
(has some (rs2_G that (taken_by some GENE1*3))),
has some GENE1*3
Class: 'human with GENE1*4'
[…]
ObjectProperty: 'taken by'
Characteristics: Functional
Class: GENE1
EquivalentTo:
GENE1_star_3
or GENE1_star_4
or GENE1_star_5
Experimental extension: telling the reasoner that each
variant can only be used to call a single allele
With these additions, the only possible combination is
inferred and the ontology becomes consistent
Inferred by HermiT, but not by TrOWL
See file genomic-cds-haplotype-resolution-demo.owl
Dosing guideline from an FDA drug label
Dosing guideline from an FDA drug label
1 Class: 'human triggering CDS rule 9'
2 Annotations:
3 CDS_message "0.5-2 mg warfarin per day should be considered
4 as a starting dose range for a patient with this genotype
5 according to the warfarin drug label."
6 EquivalentTo:
7 (has some 'CYP2C9 *1') and
8 (has some 'CYP2C9 *3') and
9 (has exactly 2 rs9923231_T)
Describing an individual patient in OWL
1 Individual: example_patient
2 Types:
3 human,
4 (has some rs1208_A) and (has some rs1208_G),
5 (has some rs8192709_C) and (has some rs8192709_T),
6 (has some rs9934438_A) and (has some rs9934438_G),
7 has exactly 2 rs10264272_C,
8 has exactly 2 rs9923231_T,
9 has exactly 2 rs12720461_C,
10 (has some ‘CYP2C9 *1’) and (has some ‘CYP2C9 *3’),
11 has exactly 2 ‘CYP2C19 *1’,
12 (has exactly 3 CYP2D6) and (has exactly 2 ‘CYP2D6 *1’)
13 and (has exactly 1 ‘CYP2D6 *2’)
...
heterozygous
SNP variants
homozygous
SNP variants
allelic variants
and CNV
Describing an individual patient in OWL
1 Individual: example_patient
2 Types:
3 human,
4 (has some rs1208_A) and (has some rs1208_G),
5 (has some rs8192709_C) and (has some rs8192709_T),
6 (has some rs9934438_A) and (has some rs9934438_G),
7 has exactly 2 rs10264272_C,
8 has exactly 2 rs9923231_T,
9 has exactly 2 rs12720461_C,
10 (has some ‘CYP2C9 *1’) and (has some ‘CYP2C9 *3’),
11 has exactly 2 ‘CYP2C19 *1’,
12 (has exactly 3 CYP2D6) and (has exactly 2 CYP2D6_star_1)
13 and (has exactly 1 CYP2D6_star_2)
...
heterozygous
SNP variants
homozygous
SNP variants
allelic variants
"0.5 - 2 mg warfarin per day
should be considered as a
starting dose range for a patient
with this genotype according to
the warfarin drug label."
OWL Reasoner
Definitions can become quite complex –
OWL reasoning helps identify inconsistencies and lacking definitions
Definitions can become quite complex –
OWL reasoning helps identify inconsistencies and lacking definitions
Qualified cardinality restriction (exactly 2 alleles, all of them decreased-activity)
Negation / 0 cardinality
OWL 2 DL reasoning (especially realization /
inferring types of patient individual) needs to
be fast!!
TrOWL is massively more performant than the HermiT
reasoner in classifying our demo ontology
These ‘demo’ ontologies also include the genetic profile of a single patient.
The ‘light’ version contains only necessary&sufficient variants for alleles, full ontology also contains
necessary variants for alleles
Ontologies have ALCQ expressivity.
Tested with reasoner plugins in Protégé 4.3, Running on an Amazon EC2 “High-Memory Extra
Large Instance” virtual machine, Microsoft Windows Server 2008, 17.1 GB of memory, 64-bit
platform, two virtual cores with 3.25 EC2 compute units each
HermiT 1.3.8 TrOWL 1.1 MORe JFact 0.1.5 MORe Hermit 0.1.5
genomic-cds-light-
demo.owl
(2150 classes,
9500 axioms)
3 h 48 m 1.5 s did not terminate
within 1 h
did not terminate
within 1 h
genomic-cds-demo.owl
(2300 classes,
11000 axioms)
detected
inconsistencies and
terminated without
making inferences
5.8 s did not terminate
within 1 h
did not terminate
within 1 h
(did not find an owl version, sorry)
Conclusions
Conclusions
• TrOWL performs best, but only provides partial results
• Need a better documentation of what inferences TrOWL can
do / can not do. Could it be enhanced to reliably cover this
use-case?
• Create reasoner optimized for ALCQ with qualified cardinality
restrictions (with cardinalities >1)?
• There seems to be opportunity for (automated, semi-
automated) modularization
• Reasoner should try to make inferences even when ‘local’
inconsistencies in a module are found (e.g., unresolvable
ambiguity for a specific gene)…
Conclusions
• ‘Annoying’ limitation inherent to OWL 2D DL: no
cardinality restrictions on transitive properties or
property paths). Would make modelling much
simpler. E.g.: there are two baskets in the room, one
basket contains exactly two eggs, the other contains
exactly four eggs. Does the room contain at least
three eggs?
Thanks
W3C collaborators:
Michel Dumontier (Carleton University)
Robert R. Freimuth (Mayo Clinic)
Richard Boyce (University of Pittsburgh)
Simon Lin (Marshfield Clinic)
Robert L. Powers (Predictive Medicine, Inc.)
Joanne S. Luciano (Rensselaer Polytechnic Institute)
Eric Prud’hommeaux (W3C)
M. Scott Marshall (MAASTRO Clinic)
Funding:
Austrian Science Fund (FWF): [PP 25608-N15]
http://www.genomic-cds.org/
http://safety-code.org/
You probably don‘t want to see them
Additional slides
Closing the world
Class: human
SubClassOf:
has exactly 2 rs1,
has exactly 2 rs2,
has exactly 2 rs3,
has exactly 2 GENE1
DisjointClasses:
rs1, rs2, rs3
DisjointClasses:
GENE1_star_1, GENE1_star_2,
GENE1_star_3, …

More Related Content

Viewers also liked

Executive Summary DM and Pharmacogenetics
Executive Summary DM and PharmacogeneticsExecutive Summary DM and Pharmacogenetics
Executive Summary DM and Pharmacogenetics
Armie Pacheco
 
clinica maternal happy baby
clinica maternal happy baby clinica maternal happy baby
clinica maternal happy baby
natinicole
 
Genetic Profiling Debate
Genetic Profiling DebateGenetic Profiling Debate
Genetic Profiling Debate
Shaza El Nemr
 
Pharmacogenetics
PharmacogeneticsPharmacogenetics
Pharmacogenetics
Dr. Almas A
 
Ecuaciones con mas de una funcion trigonometrica
Ecuaciones con mas de una funcion trigonometricaEcuaciones con mas de una funcion trigonometrica
Ecuaciones con mas de una funcion trigonometrica
Cipriano Arboleda
 

Viewers also liked (18)

Executive Summary DM and Pharmacogenetics
Executive Summary DM and PharmacogeneticsExecutive Summary DM and Pharmacogenetics
Executive Summary DM and Pharmacogenetics
 
Presentation1
Presentation1Presentation1
Presentation1
 
CPIC Guidelines
CPIC GuidelinesCPIC Guidelines
CPIC Guidelines
 
clinica maternal happy baby
clinica maternal happy baby clinica maternal happy baby
clinica maternal happy baby
 
Clinical application of cardiovascular pharmacogenetics
Clinical application of cardiovascular pharmacogeneticsClinical application of cardiovascular pharmacogenetics
Clinical application of cardiovascular pharmacogenetics
 
FINAL MCP - PROJECT VEGGIES
FINAL MCP - PROJECT VEGGIESFINAL MCP - PROJECT VEGGIES
FINAL MCP - PROJECT VEGGIES
 
Genetic Profiling Debate
Genetic Profiling DebateGenetic Profiling Debate
Genetic Profiling Debate
 
Dr. Chasman on Pharmacogenetics of Statin Therapies
Dr. Chasman on Pharmacogenetics of Statin TherapiesDr. Chasman on Pharmacogenetics of Statin Therapies
Dr. Chasman on Pharmacogenetics of Statin Therapies
 
WHMS PGx Presentation
WHMS PGx PresentationWHMS PGx Presentation
WHMS PGx Presentation
 
Pharmacogenetics
PharmacogeneticsPharmacogenetics
Pharmacogenetics
 
Pharmacogenetics of antipsychotic and antidepressent
Pharmacogenetics of antipsychotic and antidepressentPharmacogenetics of antipsychotic and antidepressent
Pharmacogenetics of antipsychotic and antidepressent
 
Nz pep lecture_jan2016
Nz pep lecture_jan2016Nz pep lecture_jan2016
Nz pep lecture_jan2016
 
Pharmacogenetics
PharmacogeneticsPharmacogenetics
Pharmacogenetics
 
Pharmacogenomics, Pharmacogenetics and Pharmacokinetics
Pharmacogenomics, Pharmacogenetics and Pharmacokinetics Pharmacogenomics, Pharmacogenetics and Pharmacokinetics
Pharmacogenomics, Pharmacogenetics and Pharmacokinetics
 
Ecuaciones con mas de una funcion trigonometrica
Ecuaciones con mas de una funcion trigonometricaEcuaciones con mas de una funcion trigonometrica
Ecuaciones con mas de una funcion trigonometrica
 
Pharmacogenetics & Teratogenicity
Pharmacogenetics & TeratogenicityPharmacogenetics & Teratogenicity
Pharmacogenetics & Teratogenicity
 
Pharmacogenetics devang
Pharmacogenetics devangPharmacogenetics devang
Pharmacogenetics devang
 
Snp
SnpSnp
Snp
 

Similar to Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support

Abdulla, ICBO2011, Composite annotation for heart development
Abdulla, ICBO2011, Composite annotation for heart developmentAbdulla, ICBO2011, Composite annotation for heart development
Abdulla, ICBO2011, Composite annotation for heart development
Tariq Abdulla
 
On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...
On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...
On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...
Gang Zhang
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
Ulises Urzua
 
Knowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learningKnowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learning
jaumebp
 

Similar to Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support (20)

2016 Presentation at the University of Hawaii Cancer Center
2016 Presentation at the University of Hawaii Cancer Center2016 Presentation at the University of Hawaii Cancer Center
2016 Presentation at the University of Hawaii Cancer Center
 
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
 
Abdulla, ICBO2011, Composite annotation for heart development
Abdulla, ICBO2011, Composite annotation for heart developmentAbdulla, ICBO2011, Composite annotation for heart development
Abdulla, ICBO2011, Composite annotation for heart development
 
Clinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidanceClinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidance
 
Mason abrf single_cell_2017
Mason abrf single_cell_2017Mason abrf single_cell_2017
Mason abrf single_cell_2017
 
Samwald ore 2014
Samwald   ore 2014Samwald   ore 2014
Samwald ore 2014
 
Prediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methodsPrediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methods
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics final
 
Image analysis; Spinocellular carcinoma; Melanoma; Basal cell carcinoma; Art...
 Image analysis; Spinocellular carcinoma; Melanoma; Basal cell carcinoma; Art... Image analysis; Spinocellular carcinoma; Melanoma; Basal cell carcinoma; Art...
Image analysis; Spinocellular carcinoma; Melanoma; Basal cell carcinoma; Art...
 
Cornell Pbsb 20090126 Nets
Cornell Pbsb 20090126 NetsCornell Pbsb 20090126 Nets
Cornell Pbsb 20090126 Nets
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paper
 
2 2. hyung min chung ppt
2 2. hyung min chung ppt2 2. hyung min chung ppt
2 2. hyung min chung ppt
 
Reiter lecture 11.11.14
Reiter lecture 11.11.14Reiter lecture 11.11.14
Reiter lecture 11.11.14
 
Literature Mining and Systems Biology
Literature Mining and Systems BiologyLiterature Mining and Systems Biology
Literature Mining and Systems Biology
 
On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...
On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...
On the All or Half Law of Recombinant DNA, Lentivus Transduction and some oth...
 
Beiko dcsi2013
Beiko dcsi2013Beiko dcsi2013
Beiko dcsi2013
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
Knowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learningKnowledge extraction and visualisation using rule-based machine learning
Knowledge extraction and visualisation using rule-based machine learning
 
Zoology Second Year Important Question | Exam Tips and Tricks
Zoology Second Year Important Question | Exam Tips and TricksZoology Second Year Important Question | Exam Tips and Tricks
Zoology Second Year Important Question | Exam Tips and Tricks
 
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
 

More from Matthias Samwald

Towards Transformative Artificial Intelligence in Life Science and Health Care
Towards Transformative Artificial Intelligence in Life Science and Health CareTowards Transformative Artificial Intelligence in Life Science and Health Care
Towards Transformative Artificial Intelligence in Life Science and Health Care
Matthias Samwald
 
Hcls call september 2013 clinical pharmacogenomics
Hcls call september 2013   clinical pharmacogenomicsHcls call september 2013   clinical pharmacogenomics
Hcls call september 2013 clinical pharmacogenomics
Matthias Samwald
 

More from Matthias Samwald (9)

Towards Transformative Artificial Intelligence in Life Science and Health Care
Towards Transformative Artificial Intelligence in Life Science and Health CareTowards Transformative Artificial Intelligence in Life Science and Health Care
Towards Transformative Artificial Intelligence in Life Science and Health Care
 
Bridging theory and practice: Clinical decision support systems for personali...
Bridging theory and practice: Clinical decision support systems for personali...Bridging theory and practice: Clinical decision support systems for personali...
Bridging theory and practice: Clinical decision support systems for personali...
 
CRISPR as a potential tool for malaria eradication
CRISPR as a potential tool for malaria eradicationCRISPR as a potential tool for malaria eradication
CRISPR as a potential tool for malaria eradication
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
 
VO Taxonomie und Ontologie (SS 2016)
VO Taxonomie und Ontologie (SS 2016)VO Taxonomie und Ontologie (SS 2016)
VO Taxonomie und Ontologie (SS 2016)
 
6 Worlds Collide
6 Worlds Collide6 Worlds Collide
6 Worlds Collide
 
The FindMeEvidence project: An open-source, mobile-friendly search engine for...
The FindMeEvidence project: An open-source, mobile-friendly search engine for...The FindMeEvidence project: An open-source, mobile-friendly search engine for...
The FindMeEvidence project: An open-source, mobile-friendly search engine for...
 
The Medication Safety Code initiative: Towards a global IT system for persona...
The Medication Safety Code initiative: Towards a global IT system for persona...The Medication Safety Code initiative: Towards a global IT system for persona...
The Medication Safety Code initiative: Towards a global IT system for persona...
 
Hcls call september 2013 clinical pharmacogenomics
Hcls call september 2013   clinical pharmacogenomicsHcls call september 2013   clinical pharmacogenomics
Hcls call september 2013 clinical pharmacogenomics
 

Recently uploaded

Recently uploaded (20)

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 

Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support

  • 1. Genomic CDS An example of a complex ontology for pharmacogenetics and clinical decision support Matthias Samwald Medical University of Vienna W3C Semantic Web for Healthcare and Life Science Interest Group
  • 2. Drug efficacy and toxicity can vary drastically between patients with different genetic profiles Up to 100,000 deaths and 2 million hospitalizations are caused by adverse drug reactions per year in the United States alone.
  • 3. Goal: Solving this problem by creating a barrier-free system for storing and interpreting personal pharmacogenomic information
  • 4. The backend of this systems: OWL 2 for pharmacogenomic knowledge representation and clinical decision support
  • 5.
  • 6. Simplified example for a single gene
  • 7. Some variants are necessary; some variants and variant combinations are necessary and sufficient for calling an allele.
  • 8. Of course, some of the genes in the ontology are far larger than that
  • 9. This is how it actually looks in the ontology 1 Class: 'human with CYP2C9*3' 2 EquivalentTo: 3 has some rs1057910_C 4 SubClassOf: 5 has some 'CYP2C9 *3', 6 (has some rs1057910_C) and 7 (has some rs1057911_A) and 8 (has some rs1799853_C) and 9 (has some rs2256871_A) and 10 (has some rs28371685_C) and 11 (has some rs72558188_AGAAATGGAA) and 12 (has some rs72558189_G) and 13 (has some rs9332239_C) ...
  • 10. This is how it actually looks in the ontology 1 Class: 'human with CYP2C9*18' 2 EquivalentTo: 3 (has some rs1057910_C) and 4 (has some rs1057911_T) and 5 (has some rs72558193_C) 6 SubClassOf: 7 has some 'CYP2C9 *18', 8 (has some rs1057910_C) and 9 (has some rs1057911_T) and 10 (has some rs1799853_C) and 11 (has some rs2256871_A) and 12 (has some rs28371685_C) and ...
  • 11. The trouble with some cheap genetic tests is that we don‘t know on which strand each specifc observed variant is located, we only know that they are on one of the strands
  • 12.
  • 13.
  • 14. In some cases, these test results are somewhat ambiguous, which can lead to logical inconsistencies Three alleles match, but only two are allowed! The reasoner flags the ontology as inconsistent.
  • 15. Class: 'human with GENE1*3' SubClassOf: human EquivalentTo: (has some rs1_A) and (has some rs2_G), has some GENE1*3 Class: 'human with GENE1*4' […] Allele definitions in OWL 2
  • 16. Class: 'human with GENE1*3' SubClassOf: human EquivalentTo: (has some (rs1_A that (taken_by some GENE1*3))) and (has some (rs2_G that (taken_by some GENE1*3))), has some GENE1*3 Class: 'human with GENE1*4' […] ObjectProperty: 'taken by' Characteristics: Functional Class: GENE1 EquivalentTo: GENE1_star_3 or GENE1_star_4 or GENE1_star_5 Experimental extension: telling the reasoner that each variant can only be used to call a single allele
  • 17. With these additions, the only possible combination is inferred and the ontology becomes consistent Inferred by HermiT, but not by TrOWL See file genomic-cds-haplotype-resolution-demo.owl
  • 18. Dosing guideline from an FDA drug label
  • 19. Dosing guideline from an FDA drug label 1 Class: 'human triggering CDS rule 9' 2 Annotations: 3 CDS_message "0.5-2 mg warfarin per day should be considered 4 as a starting dose range for a patient with this genotype 5 according to the warfarin drug label." 6 EquivalentTo: 7 (has some 'CYP2C9 *1') and 8 (has some 'CYP2C9 *3') and 9 (has exactly 2 rs9923231_T)
  • 20. Describing an individual patient in OWL 1 Individual: example_patient 2 Types: 3 human, 4 (has some rs1208_A) and (has some rs1208_G), 5 (has some rs8192709_C) and (has some rs8192709_T), 6 (has some rs9934438_A) and (has some rs9934438_G), 7 has exactly 2 rs10264272_C, 8 has exactly 2 rs9923231_T, 9 has exactly 2 rs12720461_C, 10 (has some ‘CYP2C9 *1’) and (has some ‘CYP2C9 *3’), 11 has exactly 2 ‘CYP2C19 *1’, 12 (has exactly 3 CYP2D6) and (has exactly 2 ‘CYP2D6 *1’) 13 and (has exactly 1 ‘CYP2D6 *2’) ... heterozygous SNP variants homozygous SNP variants allelic variants and CNV
  • 21. Describing an individual patient in OWL 1 Individual: example_patient 2 Types: 3 human, 4 (has some rs1208_A) and (has some rs1208_G), 5 (has some rs8192709_C) and (has some rs8192709_T), 6 (has some rs9934438_A) and (has some rs9934438_G), 7 has exactly 2 rs10264272_C, 8 has exactly 2 rs9923231_T, 9 has exactly 2 rs12720461_C, 10 (has some ‘CYP2C9 *1’) and (has some ‘CYP2C9 *3’), 11 has exactly 2 ‘CYP2C19 *1’, 12 (has exactly 3 CYP2D6) and (has exactly 2 CYP2D6_star_1) 13 and (has exactly 1 CYP2D6_star_2) ... heterozygous SNP variants homozygous SNP variants allelic variants "0.5 - 2 mg warfarin per day should be considered as a starting dose range for a patient with this genotype according to the warfarin drug label." OWL Reasoner
  • 22. Definitions can become quite complex – OWL reasoning helps identify inconsistencies and lacking definitions
  • 23. Definitions can become quite complex – OWL reasoning helps identify inconsistencies and lacking definitions Qualified cardinality restriction (exactly 2 alleles, all of them decreased-activity) Negation / 0 cardinality
  • 24. OWL 2 DL reasoning (especially realization / inferring types of patient individual) needs to be fast!!
  • 25. TrOWL is massively more performant than the HermiT reasoner in classifying our demo ontology These ‘demo’ ontologies also include the genetic profile of a single patient. The ‘light’ version contains only necessary&sufficient variants for alleles, full ontology also contains necessary variants for alleles Ontologies have ALCQ expressivity. Tested with reasoner plugins in Protégé 4.3, Running on an Amazon EC2 “High-Memory Extra Large Instance” virtual machine, Microsoft Windows Server 2008, 17.1 GB of memory, 64-bit platform, two virtual cores with 3.25 EC2 compute units each HermiT 1.3.8 TrOWL 1.1 MORe JFact 0.1.5 MORe Hermit 0.1.5 genomic-cds-light- demo.owl (2150 classes, 9500 axioms) 3 h 48 m 1.5 s did not terminate within 1 h did not terminate within 1 h genomic-cds-demo.owl (2300 classes, 11000 axioms) detected inconsistencies and terminated without making inferences 5.8 s did not terminate within 1 h did not terminate within 1 h
  • 26. (did not find an owl version, sorry) Conclusions
  • 27. Conclusions • TrOWL performs best, but only provides partial results • Need a better documentation of what inferences TrOWL can do / can not do. Could it be enhanced to reliably cover this use-case? • Create reasoner optimized for ALCQ with qualified cardinality restrictions (with cardinalities >1)? • There seems to be opportunity for (automated, semi- automated) modularization • Reasoner should try to make inferences even when ‘local’ inconsistencies in a module are found (e.g., unresolvable ambiguity for a specific gene)…
  • 28. Conclusions • ‘Annoying’ limitation inherent to OWL 2D DL: no cardinality restrictions on transitive properties or property paths). Would make modelling much simpler. E.g.: there are two baskets in the room, one basket contains exactly two eggs, the other contains exactly four eggs. Does the room contain at least three eggs?
  • 29. Thanks W3C collaborators: Michel Dumontier (Carleton University) Robert R. Freimuth (Mayo Clinic) Richard Boyce (University of Pittsburgh) Simon Lin (Marshfield Clinic) Robert L. Powers (Predictive Medicine, Inc.) Joanne S. Luciano (Rensselaer Polytechnic Institute) Eric Prud’hommeaux (W3C) M. Scott Marshall (MAASTRO Clinic) Funding: Austrian Science Fund (FWF): [PP 25608-N15] http://www.genomic-cds.org/ http://safety-code.org/
  • 30. You probably don‘t want to see them Additional slides
  • 31. Closing the world Class: human SubClassOf: has exactly 2 rs1, has exactly 2 rs2, has exactly 2 rs3, has exactly 2 GENE1 DisjointClasses: rs1, rs2, rs3 DisjointClasses: GENE1_star_1, GENE1_star_2, GENE1_star_3, …