SlideShare a Scribd company logo
???
Monarch is supported generously by:
a NIH Office of the Director Grant #5R24OD011883 as well as by
NCI/Leidos #15X143, BD2K U54HG007990-S2 (Haussler) &
BD2K PA-15-144-U01 (Kesselman)
info@monarchinitiative.org @monarchinit
The Problem: Human genome is poorly annotated
A better understanding of human gene function and disease
mechanisms is critical for diagnosis, precision medicine, and
targeted therapies
The Approach: Monarch cross-species
G2P Integration Pipeline
Ontologies Data Standards Curation and
Data Modeling
Algorithms Tools
The Solution: Leverage all the species data
Solve the cross-species language divide
www.monarchinitiative.org/sources
Acknowledgements and Contact Info
Palmoplantar
hyperkeratosis
Thick hand skin
Ulcerated
paws
MONARCH TEAM MAINTAINS
MONARCH TEAM CONTRIBUTES
LEGEND
Data source Ontology
Bridging
Ontology
PHENOTYPESDISEASES
MODEL
ORGNISMHUMAN
Community Ontology Term Phenotype
ANATOMY
ClinVar
Coriell
CTD
Elem of Morph
Gene Reviews
GWAS
HPOA
OMIMdb
Orphanet
KEGG
AnimalQTLDB
FlyBase
IMPC
MGI
MPD
OMIA
RGD
WormBase
ZFIN
MeSH
MedGen
OMIM
HP
EFO
ORDO
VT
FBcv
ZP
WP
MP
MONDO
UPheno
MA
ZFA
UBERON
FBbt
WA
CL
EMAPA
MODEL
ORGNISM
HUMAN
PROBLEM
Phenotypic language differs by organism
and also by community, thus impeding
integration
SOLUTION SOLUTION
Monarch integrates the data sources
through bridging ontologies
PROBLEM
SOLUTION
PROBLEM
SOLUTION
SOLUTION
SOLUTION
SOLUTION
SOLUTION
The phenotypes are
associated with very
different aspects of the
genotype in each data
source.
The Challenge: Fragmented, heterogeneous G2P data
Mus
mgdmgd
mmrrcmmrrc
mgimgi
animalqtldbanimalqtldb
Homo
cgdcgd
clinvarclinvar
gwascataloggwascatalog
hpoahpoa
keggkegg
omimomim
orphanetorphanet
coriellcoriell
omiaomia
monarchmonarch-curated
Canis
Macaca
Panthera
Equus
Ovis
Danio
zfinzfin
Gallus
Sula
Vulpes
Anas
Coturnix
Peromyscus
Tragelaphus
other
>100
SPECIES
Bos
Sus
0%
40%
60%
80%
100%
Human
only
Human +
other
20%
The phenotypic consequences of
mutation for the human coding
genome are <20%; inclusion of
orthologs from other species boosts
this number to over 80%
We learn about different
phenotypes from different
species, and want to use
all this data
Improve data quality and interoperability
Evidence and provenance for G2P associations is
incomplete, not computable, and frequently conflated.
This hampers integration and pathogenicity
determination.
Disentangle these concepts, and model data
to make it computable.
PROBLEMS SOLUTIONS
https://mme.monarchinitiative.org
github.com/ga4gh/schemas
Diagnosing rare diseases requires
identifying similar patients and
models Monarch integrated
cross-species data available on pa-
tient matchmaker exchange.
Data models for modeling any bio-
logical database source expecially
G2P sources are highly heterogene-
ous.
Data are insufficiently described to
understand what they are or how
they were produced.
Monarch integrated cross-
species data available on
patient matchmaker exchange
Monarch is contributing GA4GH
Schemas to bridge the heterogeneous
G2P sources
HCLS provides a guide to indicate what
are the essential metadata, and how to
express it. Monarch was a key contributor
toward this community effort and is testing
the model for all sources in its corpus
Compute over diseases, phenotypes, modes
to diagnose diseases
PhenoGrid
http://www.sanger.ac.uk/science/tools/exomiser
http://patientarchive.org/
Exomiser
https://www.npmjs.com/package/phenogrid
Whole exome
Remove off-target
and common
variants
Variant score from allele
freq and pathogenicity
Phenotype score from phenotypic
similarity
PHIVE score to give final candidates
Mendelianfilters
Combine genotype and phenotype data for variant prioritization
Visualize phenotype profile comparisons
Between patients and...
- Other patients
- Known diseases
- Models
Embeddable 3rd party widget for data resources
PhenoTua / Noctua
Uniquely identify a model or disease
Check organism/genotype nomenclature
Choose terms from any phenotype ontology
Provide evidence
Edit collaboratively, group sharing
View in two modalities:
- Ontology smart spreadsheet
- Graphical Causal Networks
HPO Pubmed Browser
Curate causal networks between genes, genotypes,
phenotypes, diseases, using organism-agnostic
standardized owl models
http://create.monarchinitiative.org/
Check Annotation Sufficiency
Automated extraction of Human
Phenotype Ontology concepts from
free text clinical summaries.
Intuitive visualization of patient
phenotype profiles and diagnoses.
Immediate visual feed-back on
phenotype profiles using the
Monarch annotation sufficiency score.
Fine-grained patient sharing access control.
Encrypted patient sensitive data - yet with
the possibility of searching over this data.
Visualize and Browse Relationships
Finding literature relevant to a set of phenotypes
should be easy.
http://pubmed-browser.human-phenotype-ontology.org/
Zemojtel, T. et al. Effective diagnosis of genetic disease by computation-
al phenotype analysis of the disease-associated genome. Science Trans-
lational Medicine Vol. 6, Issue 252, pp. 252ra123 (11 diagnosed fami-
lies)
Pippucci, T. et al. A novel null homozygous mutation confirms CAC-
NA2D2 as a gene mutated in epileptic encephalopathy. PLoS One 8,
e82154 (2013). (1 diagnosed family)
Requena, T. et al. Identification of two novel mutations in FAM136A and
DTNA genes in autosomal-dominant familial Meniereʼs disease. Human
Molecular Genetics. 24, 1119–26 (2015). (2 diagnosed families)
Bone, W. et al. Computational evaluation of exome sequence data using
human and model organism phenotypes improves diagnostic efficiency.
Genetics in Medicine. In press (2015). doi:10.1038/gim.2015.137 (4
diagnosed families)
18PublishedDiagnoses
www.monarchinitiative.org
www.owlsim.org
Patient X
Disease Y Model Z
Make causal relationships computable:
Improve modeling of evidence and provenance
owlsim
http://brcaexchange.org/
Providence Evidence Claim
- Data (eg: images, sequences)
- Evidence codes
- Publications
- Statistical confidence (p-val, z-score)
- Summary figures
- Conclusions from previous studies
- Tacit knowledge of a domain expert
- types of assay/technique/study or
instances thereof
- agent(s) who produced evidence
- agent(s) who asserted the claim
- time and place
- materials (e.g. models systems,
reagents, instruments)
Process history
Key participants in process Outputs of process
http://tinyurl.com/brca-g2p
http://tinyurl.com/acmg-guidelines
- Causal relationships, hypothesized
relationships, coorelations etc.
Fuzzy matching between patients, phenotypes, and diseases
Problem: It is difficult to prioritize candidate genes for
diagnosis, or identifying model that best capitulates a disease
Compute similarity of phenotypic profiles
Graph-based semantic similarity
PROBLEM SOLUTION
Researchers donʼt know when their
phenotyping is sufficient to be useful
beyond their specialized community
Clinicians donʼt know when their phe-
notyping is sufficient for diagnosis
Compare patient or organism phenotypic
profile against all known diesases and
genotypes. Get feedback in real time.
http://tinyurl.com/phenotypesufficiency
https://monarchinitiative.org/page/services
patient
archive
? ? ? ? ?
patient
archive
PROBLEMS SOLUTIONS
Problems with identifier design and provision
result in link rot and content drift therefore com-
promising the flow and integrity of information.
Identifiers must resolve, and when referenced in
the same context must not collide. Prefixes play a
critical role in these two goals; however, due to
confusion and inconsistency about prefixes, a
single identifier can be referenced multiple differ-
ent ways: 12345, MGI:12345, MGI:MGI:12345,
MGI:MGI_12345, thus complicating determina-
tions of equivalence and data integration.
Moreover prefixes used in the same context can
conflict (eg. GEO).
Monarch is a key contributor to
identifier standards for big data
integration
10 Simple Rules for Design and
Provision of Life Science Database
Identifiers for the Web
Monarch is leading a community
effort to coordinate prefixes
between the eight active prefix
registries
JDDCP
prefix commons
zenodo.org/record/31765
github.com/prefixcommons
health care &
life sciences
w3.org/TR/hcls-dataset/
MENDELIAN DISEASES
3,462
OMIM ?
47,964
VARIANTS
CLINVAR
with no known genetic basis with no known diseases
1 Oregon Health & Sciences University; Portland, OR • 2 Lawrence Berkeley National Lab, Berkeley, CA • 3 University of Pittsburgh, Pittsburgh, PA • 4 University of California San Diego, San Diego, CA • 5 Garvan Institute, Sydney, Australia • 6 Sanger Center, Hinxton, UK • 7 Charite
From Model Mechanism to Precision Medicine:
an Open Science Integrated Genotype-Phenotype Platform
Nicole Vasilevsky1, Nicole Washington2, Chuck Borromeo3, Matthew Brush1, Seth Carbon2, Michael Davis3, Nathan Dunn2, Mark Englestad1, Jeremy Espino3, Shahim Essaid1, Jeffrey Grethe4, Tudor Groza5, Harry Hochheiser3, Sebastian Köhler6, Suzanna Lewis2,
Julie McMurry1, Craig McNamara5, Chris Mungall2, Jeremy Nguyen Xuan2, Peter Robinson7, Kent Shefchek1, Damian Smedley6, Zhou Yuan3, Edwin Zhang5, Melissa Haendel1,
Human Disease:
HADZISELIMOVIC
SYNDROME
mouse model:
b2b1035Clo
(aka Blue Meanie)
tricuspid
valve atresia
MP:0006123
prenatal growth
retardation
MP:0010865
persistent truncus
arteriosis
MP:0002633
cleft palate
MP:0000111
1
Ventricular
hypertrophy
HP:0001714
High-arched
palate
HP:0000156
Failure to thrive
HP:0001508
Pulmonary
artery atresia
HP:0004935
Renal
hypoplasia
HP:0000089
abnormal
kidney
morphology
abnormal
palate
morphology
growth
deficiency
Malformation
of the heart
and great
vessels
abnormal
heart and
great artery
attachment
duplex kidney
MP:0004017
common
(UPheno)

More Related Content

What's hot

On the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integrationOn the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integration
mhaendel
 
GA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team updateGA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team update
mhaendel
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Michel Dumontier
 
Deep phenotyping to aid identification of coding & non-coding rare disease v...
Deep phenotyping to aid identification  of coding & non-coding rare disease v...Deep phenotyping to aid identification  of coding & non-coding rare disease v...
Deep phenotyping to aid identification of coding & non-coding rare disease v...
mhaendel
 
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
mhaendel
 
GA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project IntroductionGA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project Introduction
mhaendel
 
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
mhaendel
 
Semantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discoverySemantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discovery
mhaendel
 
Use of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosisUse of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosis
mhaendel
 
Integrating clinical and model organism G2P data for disease discovery
Integrating clinical and model organism G2P data for disease discoveryIntegrating clinical and model organism G2P data for disease discovery
Integrating clinical and model organism G2P data for disease discovery
mhaendel
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
Nils Gehlenborg
 
Enhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the LaypersonEnhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the Layperson
Nicole Vasilevsky
 
Resazurin Cell Viability Assay
Resazurin Cell Viability AssayResazurin Cell Viability Assay
Resazurin Cell Viability Assay
creativebioarray22
 
Gardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineGardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineAmanda Natalizio
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
inside-BigData.com
 
Cell authentication by str profile
Cell authentication by str profileCell authentication by str profile
Cell authentication by str profile
Bennie George
 
Guided visual exploration of patient stratifications in cancer genomics
Guided visual exploration of patient stratifications in cancer genomicsGuided visual exploration of patient stratifications in cancer genomics
Guided visual exploration of patient stratifications in cancer genomics
Nils Gehlenborg
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Nils Gehlenborg
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
Joaquin Dopazo
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene finding
Joaquin Dopazo
 

What's hot (20)

On the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integrationOn the frontier of genotype-2-phenotype data integration
On the frontier of genotype-2-phenotype data integration
 
GA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team updateGA4GH Phenotype Ontologies Task team update
GA4GH Phenotype Ontologies Task team update
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
 
Deep phenotyping to aid identification of coding & non-coding rare disease v...
Deep phenotyping to aid identification  of coding & non-coding rare disease v...Deep phenotyping to aid identification  of coding & non-coding rare disease v...
Deep phenotyping to aid identification of coding & non-coding rare disease v...
 
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
 
GA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project IntroductionGA4GH Monarch Driver Project Introduction
GA4GH Monarch Driver Project Introduction
 
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
Patient-led deep phenotyping using a lay-friendly version of the Human Phenot...
 
Semantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discoverySemantics for rare disease phenotyping, diagnostics, and discovery
Semantics for rare disease phenotyping, diagnostics, and discovery
 
Use of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosisUse of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosis
 
Integrating clinical and model organism G2P data for disease discovery
Integrating clinical and model organism G2P data for disease discoveryIntegrating clinical and model organism G2P data for disease discovery
Integrating clinical and model organism G2P data for disease discovery
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
 
Enhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the LaypersonEnhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the Layperson
 
Resazurin Cell Viability Assay
Resazurin Cell Viability AssayResazurin Cell Viability Assay
Resazurin Cell Viability Assay
 
Gardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineGardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in Medicine
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
 
Cell authentication by str profile
Cell authentication by str profileCell authentication by str profile
Cell authentication by str profile
 
Guided visual exploration of patient stratifications in cancer genomics
Guided visual exploration of patient stratifications in cancer genomicsGuided visual exploration of patient stratifications in cancer genomics
Guided visual exploration of patient stratifications in cancer genomics
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient Stratification
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene finding
 

Similar to The Monarch Initiative: From Model Organism to Precision Medicine

Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Jeremy Yang
 
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
Gabe Rudy
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision
 
Phenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome databasePhenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome databaseJennifer Smith
 
Update on the Druggable Proteome
Update on the Druggable ProteomeUpdate on the Druggable Proteome
Update on the Druggable Proteome
Chris Southan
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14mhaendel
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experimentsHelena Deus
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
inscit2006
 
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015
Chris Mungall
 
Friend harvard 2013-01-30
Friend harvard 2013-01-30Friend harvard 2013-01-30
Friend harvard 2013-01-30
Sage Base
 
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
Antoaneta Vladimirova
 
Using Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS VariantsUsing Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS Variants
Golden Helix Inc
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Rick Silva
 
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET
 
Setting Normative Function: Identification of areas where Standards and Guide...
Setting Normative Function: Identification of areas where Standards and Guide...Setting Normative Function: Identification of areas where Standards and Guide...
Setting Normative Function: Identification of areas where Standards and Guide...
Human Variome Project
 
Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...
Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...
Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...
Databricks
 
The Transforming Genetic Medicine Initiative (TGMI)
The Transforming Genetic Medicine Initiative (TGMI)The Transforming Genetic Medicine Initiative (TGMI)
The Transforming Genetic Medicine Initiative (TGMI)
Genome Reference Consortium
 

Similar to The Monarch Initiative: From Model Organism to Precision Medicine (20)

Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
 
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Phenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome databasePhenotypes and models portal at the rat genome database
Phenotypes and models portal at the rat genome database
 
Update on the Druggable Proteome
Update on the Druggable ProteomeUpdate on the Druggable Proteome
Update on the Druggable Proteome
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015
 
Friend harvard 2013-01-30
Friend harvard 2013-01-30Friend harvard 2013-01-30
Friend harvard 2013-01-30
 
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
 
Duzkale_2013_Variant Interpretation_
Duzkale_2013_Variant Interpretation_Duzkale_2013_Variant Interpretation_
Duzkale_2013_Variant Interpretation_
 
Using Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS VariantsUsing Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS Variants
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...
 
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
 
Setting Normative Function: Identification of areas where Standards and Guide...
Setting Normative Function: Identification of areas where Standards and Guide...Setting Normative Function: Identification of areas where Standards and Guide...
Setting Normative Function: Identification of areas where Standards and Guide...
 
Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...
Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...
Identify Disease-Associated Genetic Variants Via 3D Genomics Structure and Re...
 
03 Guerra, Rudy
03 Guerra, Rudy03 Guerra, Rudy
03 Guerra, Rudy
 
The Transforming Genetic Medicine Initiative (TGMI)
The Transforming Genetic Medicine Initiative (TGMI)The Transforming Genetic Medicine Initiative (TGMI)
The Transforming Genetic Medicine Initiative (TGMI)
 
MLGG_for_linkedIn
MLGG_for_linkedInMLGG_for_linkedIn
MLGG_for_linkedIn
 

More from mhaendel

The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA
mhaendel
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
mhaendel
 
Building (and traveling) the data-brick road: A report from the front lines ...
Building (and traveling) the data-brick road:  A report from the front lines ...Building (and traveling) the data-brick road:  A report from the front lines ...
Building (and traveling) the data-brick road: A report from the front lines ...
mhaendel
 
Reusable data for biomedicine: A data licensing odyssey
Reusable data for biomedicine:  A data licensing odysseyReusable data for biomedicine:  A data licensing odyssey
Reusable data for biomedicine: A data licensing odyssey
mhaendel
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
mhaendel
 
How open is open? An evaluation rubric for public knowledgebases
How open is open?  An evaluation rubric for public knowledgebasesHow open is open?  An evaluation rubric for public knowledgebases
How open is open? An evaluation rubric for public knowledgebases
mhaendel
 
Science in the open, what does it take?
Science in the open, what does it take?Science in the open, what does it take?
Science in the open, what does it take?
mhaendel
 
Credit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributionsCredit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributions
mhaendel
 
Getting (and giving) credit for all that we do
Getting (and giving) credit for all that we doGetting (and giving) credit for all that we do
Getting (and giving) credit for all that we do
mhaendel
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
mhaendel
 
Semantic phenotyping for disease diagnosis and discovery
Semantic phenotyping for disease diagnosis and discovery Semantic phenotyping for disease diagnosis and discovery
Semantic phenotyping for disease diagnosis and discovery
mhaendel
 
Dataset description using the W3C HCLS standard
Dataset description using the W3C HCLS standardDataset description using the W3C HCLS standard
Dataset description using the W3C HCLS standard
mhaendel
 
On the nature of Credit
On the nature of CreditOn the nature of Credit
On the nature of Credit
mhaendel
 
Standardizing scholarly output with the VIVO ontology
Standardizing scholarly output with the VIVO ontologyStandardizing scholarly output with the VIVO ontology
Standardizing scholarly output with the VIVO ontology
mhaendel
 

More from mhaendel (14)

The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA The Software and Data Licensing Solution: Not Your Dad’s UBMTA
The Software and Data Licensing Solution: Not Your Dad’s UBMTA
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
 
Building (and traveling) the data-brick road: A report from the front lines ...
Building (and traveling) the data-brick road:  A report from the front lines ...Building (and traveling) the data-brick road:  A report from the front lines ...
Building (and traveling) the data-brick road: A report from the front lines ...
 
Reusable data for biomedicine: A data licensing odyssey
Reusable data for biomedicine:  A data licensing odysseyReusable data for biomedicine:  A data licensing odyssey
Reusable data for biomedicine: A data licensing odyssey
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
 
How open is open? An evaluation rubric for public knowledgebases
How open is open?  An evaluation rubric for public knowledgebasesHow open is open?  An evaluation rubric for public knowledgebases
How open is open? An evaluation rubric for public knowledgebases
 
Science in the open, what does it take?
Science in the open, what does it take?Science in the open, what does it take?
Science in the open, what does it take?
 
Credit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributionsCredit where credit is due: acknowledging all types of contributions
Credit where credit is due: acknowledging all types of contributions
 
Getting (and giving) credit for all that we do
Getting (and giving) credit for all that we doGetting (and giving) credit for all that we do
Getting (and giving) credit for all that we do
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
Semantic phenotyping for disease diagnosis and discovery
Semantic phenotyping for disease diagnosis and discovery Semantic phenotyping for disease diagnosis and discovery
Semantic phenotyping for disease diagnosis and discovery
 
Dataset description using the W3C HCLS standard
Dataset description using the W3C HCLS standardDataset description using the W3C HCLS standard
Dataset description using the W3C HCLS standard
 
On the nature of Credit
On the nature of CreditOn the nature of Credit
On the nature of Credit
 
Standardizing scholarly output with the VIVO ontology
Standardizing scholarly output with the VIVO ontologyStandardizing scholarly output with the VIVO ontology
Standardizing scholarly output with the VIVO ontology
 

Recently uploaded

POST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its managementPOST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its management
touseefaziz1
 
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdfBENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
DR SETH JOTHAM
 
The Normal Electrocardiogram - Part I of II
The Normal Electrocardiogram - Part I of IIThe Normal Electrocardiogram - Part I of II
The Normal Electrocardiogram - Part I of II
MedicoseAcademics
 
Surgical Site Infections, pathophysiology, and prevention.pptx
Surgical Site Infections, pathophysiology, and prevention.pptxSurgical Site Infections, pathophysiology, and prevention.pptx
Surgical Site Infections, pathophysiology, and prevention.pptx
jval Landero
 
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
bkling
 
Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...
Sujoy Dasgupta
 
Factory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in Stock
Factory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in StockFactory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in Stock
Factory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in Stock
rebeccabio
 
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdfARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
Anujkumaranit
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
i3 Health
 
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptxMaxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 
KDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologistsKDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologists
د.محمود نجيب
 
heat stroke and heat exhaustion in children
heat stroke and heat exhaustion in childrenheat stroke and heat exhaustion in children
heat stroke and heat exhaustion in children
SumeraAhmad5
 
24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all
DrSathishMS1
 
The hemodynamic and autonomic determinants of elevated blood pressure in obes...
The hemodynamic and autonomic determinants of elevated blood pressure in obes...The hemodynamic and autonomic determinants of elevated blood pressure in obes...
The hemodynamic and autonomic determinants of elevated blood pressure in obes...
Catherine Liao
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
Shweta
 
Cervical & Brachial Plexus By Dr. RIG.pptx
Cervical & Brachial Plexus By Dr. RIG.pptxCervical & Brachial Plexus By Dr. RIG.pptx
Cervical & Brachial Plexus By Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTSARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
Dr. Vinay Pareek
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
MedicoseAcademics
 
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdfAlcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Dr Jeenal Mistry
 
Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
MedicoseAcademics
 

Recently uploaded (20)

POST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its managementPOST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its management
 
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdfBENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
 
The Normal Electrocardiogram - Part I of II
The Normal Electrocardiogram - Part I of IIThe Normal Electrocardiogram - Part I of II
The Normal Electrocardiogram - Part I of II
 
Surgical Site Infections, pathophysiology, and prevention.pptx
Surgical Site Infections, pathophysiology, and prevention.pptxSurgical Site Infections, pathophysiology, and prevention.pptx
Surgical Site Infections, pathophysiology, and prevention.pptx
 
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
 
Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...
 
Factory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in Stock
Factory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in StockFactory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in Stock
Factory Supply Best Quality Pmk Oil CAS 28578–16–7 PMK Powder in Stock
 
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdfARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
 
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptxMaxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
 
KDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologistsKDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologists
 
heat stroke and heat exhaustion in children
heat stroke and heat exhaustion in childrenheat stroke and heat exhaustion in children
heat stroke and heat exhaustion in children
 
24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all
 
The hemodynamic and autonomic determinants of elevated blood pressure in obes...
The hemodynamic and autonomic determinants of elevated blood pressure in obes...The hemodynamic and autonomic determinants of elevated blood pressure in obes...
The hemodynamic and autonomic determinants of elevated blood pressure in obes...
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
 
Cervical & Brachial Plexus By Dr. RIG.pptx
Cervical & Brachial Plexus By Dr. RIG.pptxCervical & Brachial Plexus By Dr. RIG.pptx
Cervical & Brachial Plexus By Dr. RIG.pptx
 
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTSARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
 
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdfAlcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
 
Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
 

The Monarch Initiative: From Model Organism to Precision Medicine

  • 1. ??? Monarch is supported generously by: a NIH Office of the Director Grant #5R24OD011883 as well as by NCI/Leidos #15X143, BD2K U54HG007990-S2 (Haussler) & BD2K PA-15-144-U01 (Kesselman) info@monarchinitiative.org @monarchinit The Problem: Human genome is poorly annotated A better understanding of human gene function and disease mechanisms is critical for diagnosis, precision medicine, and targeted therapies The Approach: Monarch cross-species G2P Integration Pipeline Ontologies Data Standards Curation and Data Modeling Algorithms Tools The Solution: Leverage all the species data Solve the cross-species language divide www.monarchinitiative.org/sources Acknowledgements and Contact Info Palmoplantar hyperkeratosis Thick hand skin Ulcerated paws MONARCH TEAM MAINTAINS MONARCH TEAM CONTRIBUTES LEGEND Data source Ontology Bridging Ontology PHENOTYPESDISEASES MODEL ORGNISMHUMAN Community Ontology Term Phenotype ANATOMY ClinVar Coriell CTD Elem of Morph Gene Reviews GWAS HPOA OMIMdb Orphanet KEGG AnimalQTLDB FlyBase IMPC MGI MPD OMIA RGD WormBase ZFIN MeSH MedGen OMIM HP EFO ORDO VT FBcv ZP WP MP MONDO UPheno MA ZFA UBERON FBbt WA CL EMAPA MODEL ORGNISM HUMAN PROBLEM Phenotypic language differs by organism and also by community, thus impeding integration SOLUTION SOLUTION Monarch integrates the data sources through bridging ontologies PROBLEM SOLUTION PROBLEM SOLUTION SOLUTION SOLUTION SOLUTION SOLUTION The phenotypes are associated with very different aspects of the genotype in each data source. The Challenge: Fragmented, heterogeneous G2P data Mus mgdmgd mmrrcmmrrc mgimgi animalqtldbanimalqtldb Homo cgdcgd clinvarclinvar gwascataloggwascatalog hpoahpoa keggkegg omimomim orphanetorphanet coriellcoriell omiaomia monarchmonarch-curated Canis Macaca Panthera Equus Ovis Danio zfinzfin Gallus Sula Vulpes Anas Coturnix Peromyscus Tragelaphus other >100 SPECIES Bos Sus 0% 40% 60% 80% 100% Human only Human + other 20% The phenotypic consequences of mutation for the human coding genome are <20%; inclusion of orthologs from other species boosts this number to over 80% We learn about different phenotypes from different species, and want to use all this data Improve data quality and interoperability Evidence and provenance for G2P associations is incomplete, not computable, and frequently conflated. This hampers integration and pathogenicity determination. Disentangle these concepts, and model data to make it computable. PROBLEMS SOLUTIONS https://mme.monarchinitiative.org github.com/ga4gh/schemas Diagnosing rare diseases requires identifying similar patients and models Monarch integrated cross-species data available on pa- tient matchmaker exchange. Data models for modeling any bio- logical database source expecially G2P sources are highly heterogene- ous. Data are insufficiently described to understand what they are or how they were produced. Monarch integrated cross- species data available on patient matchmaker exchange Monarch is contributing GA4GH Schemas to bridge the heterogeneous G2P sources HCLS provides a guide to indicate what are the essential metadata, and how to express it. Monarch was a key contributor toward this community effort and is testing the model for all sources in its corpus Compute over diseases, phenotypes, modes to diagnose diseases PhenoGrid http://www.sanger.ac.uk/science/tools/exomiser http://patientarchive.org/ Exomiser https://www.npmjs.com/package/phenogrid Whole exome Remove off-target and common variants Variant score from allele freq and pathogenicity Phenotype score from phenotypic similarity PHIVE score to give final candidates Mendelianfilters Combine genotype and phenotype data for variant prioritization Visualize phenotype profile comparisons Between patients and... - Other patients - Known diseases - Models Embeddable 3rd party widget for data resources PhenoTua / Noctua Uniquely identify a model or disease Check organism/genotype nomenclature Choose terms from any phenotype ontology Provide evidence Edit collaboratively, group sharing View in two modalities: - Ontology smart spreadsheet - Graphical Causal Networks HPO Pubmed Browser Curate causal networks between genes, genotypes, phenotypes, diseases, using organism-agnostic standardized owl models http://create.monarchinitiative.org/ Check Annotation Sufficiency Automated extraction of Human Phenotype Ontology concepts from free text clinical summaries. Intuitive visualization of patient phenotype profiles and diagnoses. Immediate visual feed-back on phenotype profiles using the Monarch annotation sufficiency score. Fine-grained patient sharing access control. Encrypted patient sensitive data - yet with the possibility of searching over this data. Visualize and Browse Relationships Finding literature relevant to a set of phenotypes should be easy. http://pubmed-browser.human-phenotype-ontology.org/ Zemojtel, T. et al. Effective diagnosis of genetic disease by computation- al phenotype analysis of the disease-associated genome. Science Trans- lational Medicine Vol. 6, Issue 252, pp. 252ra123 (11 diagnosed fami- lies) Pippucci, T. et al. A novel null homozygous mutation confirms CAC- NA2D2 as a gene mutated in epileptic encephalopathy. PLoS One 8, e82154 (2013). (1 diagnosed family) Requena, T. et al. Identification of two novel mutations in FAM136A and DTNA genes in autosomal-dominant familial Meniereʼs disease. Human Molecular Genetics. 24, 1119–26 (2015). (2 diagnosed families) Bone, W. et al. Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency. Genetics in Medicine. In press (2015). doi:10.1038/gim.2015.137 (4 diagnosed families) 18PublishedDiagnoses www.monarchinitiative.org www.owlsim.org Patient X Disease Y Model Z Make causal relationships computable: Improve modeling of evidence and provenance owlsim http://brcaexchange.org/ Providence Evidence Claim - Data (eg: images, sequences) - Evidence codes - Publications - Statistical confidence (p-val, z-score) - Summary figures - Conclusions from previous studies - Tacit knowledge of a domain expert - types of assay/technique/study or instances thereof - agent(s) who produced evidence - agent(s) who asserted the claim - time and place - materials (e.g. models systems, reagents, instruments) Process history Key participants in process Outputs of process http://tinyurl.com/brca-g2p http://tinyurl.com/acmg-guidelines - Causal relationships, hypothesized relationships, coorelations etc. Fuzzy matching between patients, phenotypes, and diseases Problem: It is difficult to prioritize candidate genes for diagnosis, or identifying model that best capitulates a disease Compute similarity of phenotypic profiles Graph-based semantic similarity PROBLEM SOLUTION Researchers donʼt know when their phenotyping is sufficient to be useful beyond their specialized community Clinicians donʼt know when their phe- notyping is sufficient for diagnosis Compare patient or organism phenotypic profile against all known diesases and genotypes. Get feedback in real time. http://tinyurl.com/phenotypesufficiency https://monarchinitiative.org/page/services patient archive ? ? ? ? ? patient archive PROBLEMS SOLUTIONS Problems with identifier design and provision result in link rot and content drift therefore com- promising the flow and integrity of information. Identifiers must resolve, and when referenced in the same context must not collide. Prefixes play a critical role in these two goals; however, due to confusion and inconsistency about prefixes, a single identifier can be referenced multiple differ- ent ways: 12345, MGI:12345, MGI:MGI:12345, MGI:MGI_12345, thus complicating determina- tions of equivalence and data integration. Moreover prefixes used in the same context can conflict (eg. GEO). Monarch is a key contributor to identifier standards for big data integration 10 Simple Rules for Design and Provision of Life Science Database Identifiers for the Web Monarch is leading a community effort to coordinate prefixes between the eight active prefix registries JDDCP prefix commons zenodo.org/record/31765 github.com/prefixcommons health care & life sciences w3.org/TR/hcls-dataset/ MENDELIAN DISEASES 3,462 OMIM ? 47,964 VARIANTS CLINVAR with no known genetic basis with no known diseases 1 Oregon Health & Sciences University; Portland, OR • 2 Lawrence Berkeley National Lab, Berkeley, CA • 3 University of Pittsburgh, Pittsburgh, PA • 4 University of California San Diego, San Diego, CA • 5 Garvan Institute, Sydney, Australia • 6 Sanger Center, Hinxton, UK • 7 Charite From Model Mechanism to Precision Medicine: an Open Science Integrated Genotype-Phenotype Platform Nicole Vasilevsky1, Nicole Washington2, Chuck Borromeo3, Matthew Brush1, Seth Carbon2, Michael Davis3, Nathan Dunn2, Mark Englestad1, Jeremy Espino3, Shahim Essaid1, Jeffrey Grethe4, Tudor Groza5, Harry Hochheiser3, Sebastian Köhler6, Suzanna Lewis2, Julie McMurry1, Craig McNamara5, Chris Mungall2, Jeremy Nguyen Xuan2, Peter Robinson7, Kent Shefchek1, Damian Smedley6, Zhou Yuan3, Edwin Zhang5, Melissa Haendel1, Human Disease: HADZISELIMOVIC SYNDROME mouse model: b2b1035Clo (aka Blue Meanie) tricuspid valve atresia MP:0006123 prenatal growth retardation MP:0010865 persistent truncus arteriosis MP:0002633 cleft palate MP:0000111 1 Ventricular hypertrophy HP:0001714 High-arched palate HP:0000156 Failure to thrive HP:0001508 Pulmonary artery atresia HP:0004935 Renal hypoplasia HP:0000089 abnormal kidney morphology abnormal palate morphology growth deficiency Malformation of the heart and great vessels abnormal heart and great artery attachment duplex kidney MP:0004017 common (UPheno)