SlideShare a Scribd company logo
1 of 74
Open Targets: integrating
genetics and genomics for
disease biology and
translational medicine
What is the Open Targets
Partnership?
How to navigate the
Open Targets Platform?
Aims
Are there other Open
Targets tools?
Where do I get
help?
Source: PhRMA adaptation based on Tufts CSDD & School of Medicine, and FDA
Lengthy, costly, low success rate, HIGH ATTRITION RATES
Drug discovery: some challenges
Public databases for drug discovery
• EMBL-EBI (European Bioinformatics Institute)
• Elsewhere
Fit everything together
• Time consuming
• Possible lack of resources or expertise
• …
I wish I did not have to go to all
those different places to get the
information I’m after.
I know. If we only had a one-stop
shop with as much data as possible,
plus new analyses and links to the
original source for my own
assessment.
A resource that is
comprehensive, trustworthy,
up-to-date, sustainable,
easy-to-use and free.
Open Targets is all you need!
Our Vision
A partnership to transform drug discovery
through the systematic identification and
prioritisation of targets
https://www.opentargets.org
2014 2016 2017 2018
Data
generation
Therapeutic
hypothesis
Public data
Data
integration
targetvalidation.org
Experimental projects Bioinformatic projects
Virtuous cycle in Open Targets
www.opentargets.org/projects
Concurrent
Open Targets generates data
www.opentargets.org/science/
• EMBL-EBI and Sanger Institute
• > 1,000 cancer cell lines + drug sensitivity data
• RNASeq, CRISPR screens
• Sanger Institute and GSK
• Genome wide knockouts in gut epithelium
• Organoids, metagenomics
• Alzheimer’s and Parkinson’s
• CRISPR screens, iPS cells
• Sanger Institute, Biogen, Gurdon
Open Targets integrates data*
• EMBL-EBI, Biogen, GSK
• Associations between targets and diseases
• Germline variants
• Somatic mutations
• Drug information
• RNA expression
• Animal models
• Text mining
* Publicly available resources
In addition to upcoming Open Targets experimental data
Open Targets Platform
https://www.targetvalidation.org
• Associations between targets and diseases…
• Ensembl Gene IDs e.g. ENSGXXXXXXXXXXX
• UniProt IDs e.g P15056
• HGNC names e.g. DMD
• Also non-coding RNA genes
Our targets  genes or proteins
• Modified version of Experimental Factor Ontology (EFO)
• Controlled vocabulary (Alzheimers versus Alzheimer’s)
• Hierarchy (relationships)
Our diseases
• Promotes consistency
• Increases the richness of annotation
• Allow for easier and automatic integration
Evidence for our T-D associations
https://docs.targetvalidation.org/data-sources/data-sources
Data sources grouped into data types
Genetic
Associations
Somatic
Mutations
Drugs
Affected
Pathways
Differentia
l RNA
expression
Animal
Models
Text
Mining
EVA
GWAS
Catalog
PheWA
S
Cancer Gene
Census
EVA
Expression Atlas PhenoDigm
Europe
PMC
G2P
How the data* flows
JSON
summary
document
Validator Association
score
calculation
Target Profile
Disease profile
* e.g. genetic variants from NHGRI-GWAS catalog
JSON summary document
* IDs (gene, disease, papers) + curation (e.g. manual) + evidence + source + stats for the score
JSON
Association score
Which targets have more
evidence for an association?
What is the relative weight of the
evidence for different targets?
Statistical integration, aggregation and scoring
Four-tier scoring framework
https://docs.targetvalidation.org/getting-started/scoring
A) per evidence (e.g. one SNP from a GWAS paper)
B) per data source (e.g. SNPs from the GWAS catalog)
C) per data type (e.g. Genetic associations)
D) overall
EVA
UniProt
Gene2Phenotype
GWAS catalog
Cancer Gene Census
EVA (somatic)
IntOGen
ChEMBL
Reactome
Expression Atlas
Europe PMC
PhenoDigm
Genetic associations
Somatic mutations
RNA expression
Animal models
Affected pathways
Text mining
Drugs
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*0.5
*0.2
*0.2
Association
S1 + S2/22
+ S3/32
+ S4/42
+ Si/i2
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
Genomics England
PhEWAS catalog
*1.0
*1.0
ΣH
ΣH
Calculated at 4
levels:
•Evidence
•Data source
•Data type
•Overall
Score: 0 to 1 (max)
weight factor
Aggregation with
(harmonic sum)
ΣH
Note: Each data set has
its own scoring and
ranking scheme
Aggregating data  harmonic sum
f = sample size (cases versus controls)
s = predicted functional consequence (VEP)
c = p value reported in the paper
Factors affecting the relative strength of an evidence
e.g. GWAS Catalog
S = f * s * c
f, relative occurrence of a target-disease evidence
s, strength of the effect described by the evidence
c, confidence of the observation for the target-disease evidence
https://docs.targetvalidation.org/getting-started/scoring
Ranking target-disease association
Association score: the overall score across all data types
• Based on the data sources
• Different weight applied:
genetic association = drugs = mutations = pathways > RNA expression > animal models = text mining
https://www.targetvalidation.org/
Demo 1: Disease centric workflow
What is the evidence for the
association between a target
and a disease?
Which targets are
associated with a disease?
Pages 7 - 30
In addition to T-D associations
• Everything you wanted to know about…
… but were afraid to ask.
Disease
profile page
Target profile
page
Target profile page*
Protein Drugs
Pathways
interactions
RNA and
protein
baseline
expression
Variants,
isoforms
and
genomic
context
Mouse
phenotypes
Bibliography
Description
Synonyms
Gene Ontology
Protein Structure
Protein Interactions
Similar
Targets
Expression
Atlas
Library/LINK
Extra, extra, extra!
Cancer hallmarks and cancer biomarkers
Gene tree
* e.g. http://www.targetvalidation.org/target/ENSG00000141510
Classification Drugs
Similar
diseases
Bibliography
Open Targets
Library/LINK
Disease profile page*
* e.g. http://www.targetvalidation.org/disease/Orphanet_262
Modes of data access
We have a list of 26 possible
targets for inflammatory
bowel disease?
https://tinyurl.com/batch-video
Demo 2: Batch search
Are these targets represented
in other diseases?
Which pathways are
represented in this set of
targets?
https://api.opentargets.io/v3/platform/public/evidence/filter?
target=ENSG00000141867&disease=EFO_0000565&datatype=expression_atla
s&size=100&format=json
REST API calls: some examples*
https://api.opentargets.io/v3/platform/public/search?q=EFO_0003767
https://api.opentargets.io/v3/platform/public/association/filter?
target=ENSG00000110324&direct=false&fields=is_direct&fields=disease.efo_info.la
bel&size=100
* blog.opentargets.org/tag/api/
https://api.opentargets.io/v3/platform/public/search?q=asthma
Breaking down the URLs
https://api.opentargets.io/v3/platform/ public/association/filter
?target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id
Server
Endpoint parameters
Parameters
https://api.opentargets.io/v3/platform/public/association/filter?
target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id
http://api.opentargets.io/v3/platform/docs#
The documentation
Private: methods used by the UI to serve external data.
Subject to change without notice
http://api.opentargets.io/v3/platform/docs#
How to search
http://api.opentargets.io/v3/platform/docs
REST API: some use cases
How to get all diseases
associated with a target
How to get the association score
for a target – disease pair
How to get the evidence for a
target – disease association
How to search
https://api.opentargets.io/v3/platform/public/search?q=PTEN
http://api.opentargets.io/v3/platform
/public/association/filter?
target=ENSG00000171862
&direct=true
How to get all diseases associated with a target
http://api.opentargets.io/v3/platform/
public/association?id=
ENSG00000171862-EFO_0000616
How to get the score for a target – disease pair
How to get the evidence for an association
http://api.opentargets.io/v3/platform/
public/evidence/filter
?target=ENSG00000171862
http://api.opentargets.io/v3/platform
/public/evidence/filter?
target=ENSG00000171862
&disease=Orphanet_2563
Introduction: REST API webinar
https://youtu.be/KQbfhwpeEvc
How to run our REST endpoints
* http://opentargets.readthedocs.io/en/stable/index.html
• Paste the URL in the location bar in a browser
• Use the terminal window (e.g. with CURL command)
• Use our free clients (i.e. Python* and R**)
• Call them from your own application/workflow
** no longer supported by Open Targets; feel free to make PRs
Paste the URL in the a location bar
Command line e.g. CURL –X GET
Python and R clients for the REST API
http://opentargets.readthedocs.iohttp://opentargets.readthedocs.io
No longer supported by
Open Targets
No longer supported by
Open Targets
Can we change the way the
associations are scored? Perhaps
to increase the weight on text
mining data?
Yes, you can with the Open Targets Python client!
Data downloads
Open Targets toolkit: LINK
• LINK: Literature coNcept Knowledgebase
• Subject / predicate / object structured relations
From PubMed abstracts
Proof of Concept
Further developement
http://link.opentargets.io/
Addressing text mining shortcomings
• Entities: genes, diseases, drugs
• Concepts extracted via NLP
(Natural Language Processing)
• 28 M documents, 500 M relations
• http://blog.opentargets.org/link/
Open Targets toolkit: DoRothEA
• Candidate TF-drug interactions in cancer
• 1000 cancer cell lines
• 265 anti-cancer compounds
• 127 transcription factors
http://cancerres.aacrjournals.org/content/early/2017/12/09/0008-5472.CAN-17-1679
dorothea.opentargets.io
Example: Rapamycin
• ~ 1000 cancer cell lines
• 265 anti-cancer compounds
• 127 transcription factors
• Resource of integrated multiomics data
• Added value (e.g. score) and links to original sources
• Graphical web interface: easy to use
April 2018 release
Open Targets Platform
21K
targets
9.7K
diseases
2.3 M
associations
6.1 M
evidence
Data
generation
Therapeutic
hypothesis
Public data
Data
integration
targetvalidation.org
Experimental projects Bioinformatic projects
Virtuous cycle in Open Targets
www.opentargets.org/projects
Concurrent
We support decision-making
Which targets are
associated with a
disease?
Are there FDA drugs
for this association?
…
Can I find out about the
mechanisms of the
disease?
How to access the Platform
Core bioinformatics
pipelines
www.opentargets.org/projects
Experimental
projects
Generate new evidence
CRISPR/Cas9
Organoids and IPS cells
(cellular models for disease)
Integration of available data
Web interface
Batch search tool
REST API
Data dumps
Main data store
Elasticsearch
Angular JS
Web App*
Public
Access
REST
API**
* UI: first released in December 2015
** API first release in April 2016
https://www.targetvalidation.org
https://api.opentargets.io
Our breakthrough paper
http://nar.oxfordjournals.org/content/45/D1/D985.long
http://www.narbreakthrough.com/
blog.opentargets.org/
@targetvalidate
support@targetvalidation.org
http://tinyurl.com/opentargets-in
Help!
https://tinyurl.com/opentargets-youtube
https://docs.targetvalidation.org/
Acknowledgements
Data sources: GWAS catalog
• Genome Wide Association Studies
• Array-based chips  genotyping 100,000 SNPs genomewide
Details on data sources to associate
targets and diseases
Extra slides
Data sources: UniProt
• Protein: sequence, annotation, function
• Manual curation of coding variants in patients
EMBL-EBI train online
• Variants, genes, phenotypes in rare diseases
• Literature curation  consultant clinical geneticists in the UK
Data sources: Gene2Phenotype
Data sources: UniProt
• Protein: sequence, annotation, function
• Manual curation of coding variants in patients
EMBL-EBI train online
Data sources: PheWAS
• Phenome Wide Association Studies
• A variant associated with multiple phenotypes
• Clinical phenotypes derived from EMR-linked biobank BioVU
• ICD9 codes mapped to EFO
Data sources: GE PanelApp
• Aid clinical interpretation of genomes for the 100K project
• We include ‘green genes’ from version 1+ and phenotypes
Data sources: EVA
• With ClinVar information for rare diseases
• Clinical significance: pathogenic, protective
EMBL-EBI train online
Data sources: The Cancer Gene Census
• Genes with mutations causally implicated in cancer
• Gene associated with a cancer plus other cancers associated
with that gene
Data sources: IntOGen
• Genes and somatic (driver) mutations, 28 cancer types
• Involvement in cancer biology
• Rubio-Perez et al. 2015
Data sources: ChEMBL
• Known drugs linked to a disease and a known target
• FDA approved for clinical trials or marketing
EMBL-EBI train online
Data sources: Reactome
• Biochemical reactions and pathways
• Manual curation of pathways affected by mutations
EMBL-EBI train online
Data sources: SLAPenrich
• 374 pathways curated and mapped to cancer hallmarks
• Divergence of the total number of cancer samples with
genomic alterations
• Mutational burden and total exonic block length of genes
Data sources: PROGENy
• Comparison of pathway activities between normal and primary
samples from The Cancer Genome Atlas
• Inferred from RNA-seq: 9,250 tumour and 741 normal samples
• EGFR, hypoxia, JAK.STAT, MAPK, NFkB, PI3K, TGFb, TNFa,
Trail, VEGF, and p53
Data sources: Expression Atlas
• Baseline expression for human genes
- target profile page
• Differential mRNA expression (healthy versus diseased):
- target-disease associations
EMBL-EBI train online
Data sources: Europe PMC
• Mining titles, abstracts, full text in research articles
• Target and disease co-occurrence in the same sentence
• Dictionary (not NLP)
EMBL-EBI train online
Data sources: PhenoDigm
• Semantic approach to associate mouse models with diseases
Aggregating scores across the data
• Using a mathematical function, the harmonic sum*
where S1,S2,...,Si are the individual sorted evidence scores in descending order
* PMID: 19107201, PMID: 20118918
• Advantages:
A) account for replication
B) deflate the effect of large amounts of data e.g. text mining
Target-Disease Association Score
EuropePMC
(Text Mining)
UniProt
(Manual Curation)
ChEMBL
(Manual Curation)
Overall
VERY simplified diagram

More Related Content

What's hot

ENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsMaté Ongenaert
 
New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project Senthil Natesan
 
P4 Medicine: A Vision For Your Molecular Health
P4 Medicine: A Vision For Your Molecular HealthP4 Medicine: A Vision For Your Molecular Health
P4 Medicine: A Vision For Your Molecular HealthSachin Rawat
 
The Present and Future of Personal Health Record and Artificial Intelligence ...
The Present and Future of Personal Health Record and Artificial Intelligence ...The Present and Future of Personal Health Record and Artificial Intelligence ...
The Present and Future of Personal Health Record and Artificial Intelligence ...Hyung Jin Choi
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionBiotech Online
 
Genetics & Genomic Testing
Genetics & Genomic Testing Genetics & Genomic Testing
Genetics & Genomic Testing CHC Connecticut
 
Human genome
Human genomeHuman genome
Human genomeDansfera
 
Overview of epigenetics and its role in disease
Overview of epigenetics and its role in diseaseOverview of epigenetics and its role in disease
Overview of epigenetics and its role in diseaseGarry D. Lasaga
 
Precision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcarePrecision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcareData Science Thailand
 
Personalized Medicine Overview
Personalized Medicine OverviewPersonalized Medicine Overview
Personalized Medicine Overviewahelicher
 
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuKAUSHAL SAHU
 
Artificial intelligence in health care
Artificial intelligence in health careArtificial intelligence in health care
Artificial intelligence in health careSonja Aits
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
 
Human genome project [autosaved]
Human genome project [autosaved]Human genome project [autosaved]
Human genome project [autosaved]keerthi samuel
 
Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019
Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019
Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019Vladimir Trajkovski
 
Brief Introduction of Protein-Protein Interactions (PPIs)
Brief Introduction of Protein-Protein Interactions (PPIs)Brief Introduction of Protein-Protein Interactions (PPIs)
Brief Introduction of Protein-Protein Interactions (PPIs)Creative Proteomics
 
Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...tuxette
 

What's hot (20)

ENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsENCODE project: brief summary of main findings
ENCODE project: brief summary of main findings
 
New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project
 
P4 Medicine: A Vision For Your Molecular Health
P4 Medicine: A Vision For Your Molecular HealthP4 Medicine: A Vision For Your Molecular Health
P4 Medicine: A Vision For Your Molecular Health
 
The Present and Future of Personal Health Record and Artificial Intelligence ...
The Present and Future of Personal Health Record and Artificial Intelligence ...The Present and Future of Personal Health Record and Artificial Intelligence ...
The Present and Future of Personal Health Record and Artificial Intelligence ...
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Genetics & Genomic Testing
Genetics & Genomic Testing Genetics & Genomic Testing
Genetics & Genomic Testing
 
Human genome
Human genomeHuman genome
Human genome
 
Overview of epigenetics and its role in disease
Overview of epigenetics and its role in diseaseOverview of epigenetics and its role in disease
Overview of epigenetics and its role in disease
 
Precision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcarePrecision Medicine - The Future of Healthcare
Precision Medicine - The Future of Healthcare
 
Personalized Medicine Overview
Personalized Medicine OverviewPersonalized Medicine Overview
Personalized Medicine Overview
 
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahu
 
Artificial intelligence in health care
Artificial intelligence in health careArtificial intelligence in health care
Artificial intelligence in health care
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
Human genome project [autosaved]
Human genome project [autosaved]Human genome project [autosaved]
Human genome project [autosaved]
 
Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019
Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019
Prof. Dr. Vladimir Trajkovski - Epigenetics of ASD-10.05.2019
 
Brief Introduction of Protein-Protein Interactions (PPIs)
Brief Introduction of Protein-Protein Interactions (PPIs)Brief Introduction of Protein-Protein Interactions (PPIs)
Brief Introduction of Protein-Protein Interactions (PPIs)
 
Complex Diseases
Complex DiseasesComplex Diseases
Complex Diseases
 
Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...
 
Epigenetics diet and cancer
Epigenetics diet and cancerEpigenetics diet and cancer
Epigenetics diet and cancer
 

Similar to Open Targets, identifying targets for drug development in the treatment of diseases.

Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholdermhaendel
 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1BigData_Europe
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experimentsHelena Deus
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesDan Sullivan, Ph.D.
 
Wikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curationWikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curationGregory Stupp
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Ann-Marie Roche
 
Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Kees van Bochove
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Frederik van den Broek
 
NSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopNSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopChirag Patel
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biologyPranavathiyani G
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Ben Gardner
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei LinChien-Wei Lin
 
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicinemhaendel
 
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...Perficient, Inc.
 
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Maté Ongenaert
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...David Peyruc
 

Similar to Open Targets, identifying targets for drug development in the treatment of diseases. (20)

Open Targets workshop at C4X in 2019
Open Targets workshop at C4X in 2019Open Targets workshop at C4X in 2019
Open Targets workshop at C4X in 2019
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious Diseases
 
Wikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curationWikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curation
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1
 
Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
 
NSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopNSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data Workshop
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
 
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicine
 
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
 
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 

More from Denise Carvalho-Silva, PhD

EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference Denise Carvalho-Silva, PhD
 
Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series Denise Carvalho-Silva, PhD
 
Variation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar seriesVariation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar seriesDenise Carvalho-Silva, PhD
 
Browsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblBrowsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblDenise Carvalho-Silva, PhD
 

More from Denise Carvalho-Silva, PhD (6)

Open Targets talk at the BSPR 2019 meeting
Open Targets talk at the BSPR 2019 meetingOpen Targets talk at the BSPR 2019 meeting
Open Targets talk at the BSPR 2019 meeting
 
Ensembl Browser Workshop
Ensembl Browser WorkshopEnsembl Browser Workshop
Ensembl Browser Workshop
 
EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference
 
Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series
 
Variation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar seriesVariation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar series
 
Browsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblBrowsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with Ensembl
 

Recently uploaded

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 

Recently uploaded (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 

Open Targets, identifying targets for drug development in the treatment of diseases.

  • 1. Open Targets: integrating genetics and genomics for disease biology and translational medicine
  • 2. What is the Open Targets Partnership? How to navigate the Open Targets Platform? Aims Are there other Open Targets tools? Where do I get help?
  • 3. Source: PhRMA adaptation based on Tufts CSDD & School of Medicine, and FDA Lengthy, costly, low success rate, HIGH ATTRITION RATES Drug discovery: some challenges
  • 4. Public databases for drug discovery • EMBL-EBI (European Bioinformatics Institute) • Elsewhere
  • 5. Fit everything together • Time consuming • Possible lack of resources or expertise • …
  • 6. I wish I did not have to go to all those different places to get the information I’m after. I know. If we only had a one-stop shop with as much data as possible, plus new analyses and links to the original source for my own assessment. A resource that is comprehensive, trustworthy, up-to-date, sustainable, easy-to-use and free. Open Targets is all you need!
  • 7. Our Vision A partnership to transform drug discovery through the systematic identification and prioritisation of targets https://www.opentargets.org 2014 2016 2017 2018
  • 8. Data generation Therapeutic hypothesis Public data Data integration targetvalidation.org Experimental projects Bioinformatic projects Virtuous cycle in Open Targets www.opentargets.org/projects Concurrent
  • 9. Open Targets generates data www.opentargets.org/science/ • EMBL-EBI and Sanger Institute • > 1,000 cancer cell lines + drug sensitivity data • RNASeq, CRISPR screens • Sanger Institute and GSK • Genome wide knockouts in gut epithelium • Organoids, metagenomics • Alzheimer’s and Parkinson’s • CRISPR screens, iPS cells • Sanger Institute, Biogen, Gurdon
  • 10. Open Targets integrates data* • EMBL-EBI, Biogen, GSK • Associations between targets and diseases • Germline variants • Somatic mutations • Drug information • RNA expression • Animal models • Text mining * Publicly available resources In addition to upcoming Open Targets experimental data
  • 11. Open Targets Platform https://www.targetvalidation.org • Associations between targets and diseases…
  • 12. • Ensembl Gene IDs e.g. ENSGXXXXXXXXXXX • UniProt IDs e.g P15056 • HGNC names e.g. DMD • Also non-coding RNA genes Our targets  genes or proteins
  • 13. • Modified version of Experimental Factor Ontology (EFO) • Controlled vocabulary (Alzheimers versus Alzheimer’s) • Hierarchy (relationships) Our diseases • Promotes consistency • Increases the richness of annotation • Allow for easier and automatic integration
  • 14. Evidence for our T-D associations https://docs.targetvalidation.org/data-sources/data-sources
  • 15. Data sources grouped into data types Genetic Associations Somatic Mutations Drugs Affected Pathways Differentia l RNA expression Animal Models Text Mining EVA GWAS Catalog PheWA S Cancer Gene Census EVA Expression Atlas PhenoDigm Europe PMC G2P
  • 16. How the data* flows JSON summary document Validator Association score calculation Target Profile Disease profile * e.g. genetic variants from NHGRI-GWAS catalog
  • 17. JSON summary document * IDs (gene, disease, papers) + curation (e.g. manual) + evidence + source + stats for the score JSON
  • 18. Association score Which targets have more evidence for an association? What is the relative weight of the evidence for different targets?
  • 19. Statistical integration, aggregation and scoring Four-tier scoring framework https://docs.targetvalidation.org/getting-started/scoring A) per evidence (e.g. one SNP from a GWAS paper) B) per data source (e.g. SNPs from the GWAS catalog) C) per data type (e.g. Genetic associations) D) overall
  • 20. EVA UniProt Gene2Phenotype GWAS catalog Cancer Gene Census EVA (somatic) IntOGen ChEMBL Reactome Expression Atlas Europe PMC PhenoDigm Genetic associations Somatic mutations RNA expression Animal models Affected pathways Text mining Drugs *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *0.5 *0.2 *0.2 Association S1 + S2/22 + S3/32 + S4/42 + Si/i2 ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH Genomics England PhEWAS catalog *1.0 *1.0 ΣH ΣH Calculated at 4 levels: •Evidence •Data source •Data type •Overall Score: 0 to 1 (max) weight factor Aggregation with (harmonic sum) ΣH Note: Each data set has its own scoring and ranking scheme Aggregating data  harmonic sum
  • 21. f = sample size (cases versus controls) s = predicted functional consequence (VEP) c = p value reported in the paper Factors affecting the relative strength of an evidence e.g. GWAS Catalog S = f * s * c f, relative occurrence of a target-disease evidence s, strength of the effect described by the evidence c, confidence of the observation for the target-disease evidence https://docs.targetvalidation.org/getting-started/scoring
  • 22. Ranking target-disease association Association score: the overall score across all data types • Based on the data sources • Different weight applied: genetic association = drugs = mutations = pathways > RNA expression > animal models = text mining
  • 23. https://www.targetvalidation.org/ Demo 1: Disease centric workflow What is the evidence for the association between a target and a disease? Which targets are associated with a disease? Pages 7 - 30
  • 24. In addition to T-D associations • Everything you wanted to know about… … but were afraid to ask. Disease profile page Target profile page
  • 25. Target profile page* Protein Drugs Pathways interactions RNA and protein baseline expression Variants, isoforms and genomic context Mouse phenotypes Bibliography Description Synonyms Gene Ontology Protein Structure Protein Interactions Similar Targets Expression Atlas Library/LINK Extra, extra, extra! Cancer hallmarks and cancer biomarkers Gene tree * e.g. http://www.targetvalidation.org/target/ENSG00000141510
  • 26. Classification Drugs Similar diseases Bibliography Open Targets Library/LINK Disease profile page* * e.g. http://www.targetvalidation.org/disease/Orphanet_262
  • 27. Modes of data access
  • 28. We have a list of 26 possible targets for inflammatory bowel disease? https://tinyurl.com/batch-video Demo 2: Batch search Are these targets represented in other diseases? Which pathways are represented in this set of targets?
  • 29. https://api.opentargets.io/v3/platform/public/evidence/filter? target=ENSG00000141867&disease=EFO_0000565&datatype=expression_atla s&size=100&format=json REST API calls: some examples* https://api.opentargets.io/v3/platform/public/search?q=EFO_0003767 https://api.opentargets.io/v3/platform/public/association/filter? target=ENSG00000110324&direct=false&fields=is_direct&fields=disease.efo_info.la bel&size=100 * blog.opentargets.org/tag/api/ https://api.opentargets.io/v3/platform/public/search?q=asthma
  • 30. Breaking down the URLs https://api.opentargets.io/v3/platform/ public/association/filter ?target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id Server Endpoint parameters Parameters https://api.opentargets.io/v3/platform/public/association/filter? target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id
  • 31. http://api.opentargets.io/v3/platform/docs# The documentation Private: methods used by the UI to serve external data. Subject to change without notice http://api.opentargets.io/v3/platform/docs#
  • 32. How to search http://api.opentargets.io/v3/platform/docs REST API: some use cases How to get all diseases associated with a target How to get the association score for a target – disease pair How to get the evidence for a target – disease association
  • 36. How to get the evidence for an association http://api.opentargets.io/v3/platform/ public/evidence/filter ?target=ENSG00000171862 http://api.opentargets.io/v3/platform /public/evidence/filter? target=ENSG00000171862 &disease=Orphanet_2563
  • 37. Introduction: REST API webinar https://youtu.be/KQbfhwpeEvc
  • 38. How to run our REST endpoints * http://opentargets.readthedocs.io/en/stable/index.html • Paste the URL in the location bar in a browser • Use the terminal window (e.g. with CURL command) • Use our free clients (i.e. Python* and R**) • Call them from your own application/workflow ** no longer supported by Open Targets; feel free to make PRs
  • 39. Paste the URL in the a location bar
  • 40.
  • 41. Command line e.g. CURL –X GET
  • 42. Python and R clients for the REST API http://opentargets.readthedocs.iohttp://opentargets.readthedocs.io No longer supported by Open Targets No longer supported by Open Targets
  • 43. Can we change the way the associations are scored? Perhaps to increase the weight on text mining data? Yes, you can with the Open Targets Python client!
  • 45. Open Targets toolkit: LINK • LINK: Literature coNcept Knowledgebase • Subject / predicate / object structured relations From PubMed abstracts Proof of Concept Further developement http://link.opentargets.io/
  • 46. Addressing text mining shortcomings • Entities: genes, diseases, drugs • Concepts extracted via NLP (Natural Language Processing) • 28 M documents, 500 M relations • http://blog.opentargets.org/link/
  • 47. Open Targets toolkit: DoRothEA • Candidate TF-drug interactions in cancer • 1000 cancer cell lines • 265 anti-cancer compounds • 127 transcription factors http://cancerres.aacrjournals.org/content/early/2017/12/09/0008-5472.CAN-17-1679 dorothea.opentargets.io
  • 48. Example: Rapamycin • ~ 1000 cancer cell lines • 265 anti-cancer compounds • 127 transcription factors
  • 49. • Resource of integrated multiomics data • Added value (e.g. score) and links to original sources • Graphical web interface: easy to use April 2018 release Open Targets Platform 21K targets 9.7K diseases 2.3 M associations 6.1 M evidence
  • 50. Data generation Therapeutic hypothesis Public data Data integration targetvalidation.org Experimental projects Bioinformatic projects Virtuous cycle in Open Targets www.opentargets.org/projects Concurrent
  • 51. We support decision-making Which targets are associated with a disease? Are there FDA drugs for this association? … Can I find out about the mechanisms of the disease?
  • 52. How to access the Platform Core bioinformatics pipelines www.opentargets.org/projects Experimental projects Generate new evidence CRISPR/Cas9 Organoids and IPS cells (cellular models for disease) Integration of available data Web interface Batch search tool REST API Data dumps Main data store Elasticsearch Angular JS Web App* Public Access REST API** * UI: first released in December 2015 ** API first release in April 2016 https://www.targetvalidation.org https://api.opentargets.io
  • 56. Data sources: GWAS catalog • Genome Wide Association Studies • Array-based chips  genotyping 100,000 SNPs genomewide
  • 57. Details on data sources to associate targets and diseases Extra slides
  • 58. Data sources: UniProt • Protein: sequence, annotation, function • Manual curation of coding variants in patients EMBL-EBI train online
  • 59. • Variants, genes, phenotypes in rare diseases • Literature curation  consultant clinical geneticists in the UK Data sources: Gene2Phenotype
  • 60. Data sources: UniProt • Protein: sequence, annotation, function • Manual curation of coding variants in patients EMBL-EBI train online
  • 61. Data sources: PheWAS • Phenome Wide Association Studies • A variant associated with multiple phenotypes • Clinical phenotypes derived from EMR-linked biobank BioVU • ICD9 codes mapped to EFO
  • 62. Data sources: GE PanelApp • Aid clinical interpretation of genomes for the 100K project • We include ‘green genes’ from version 1+ and phenotypes
  • 63. Data sources: EVA • With ClinVar information for rare diseases • Clinical significance: pathogenic, protective EMBL-EBI train online
  • 64. Data sources: The Cancer Gene Census • Genes with mutations causally implicated in cancer • Gene associated with a cancer plus other cancers associated with that gene
  • 65. Data sources: IntOGen • Genes and somatic (driver) mutations, 28 cancer types • Involvement in cancer biology • Rubio-Perez et al. 2015
  • 66. Data sources: ChEMBL • Known drugs linked to a disease and a known target • FDA approved for clinical trials or marketing EMBL-EBI train online
  • 67. Data sources: Reactome • Biochemical reactions and pathways • Manual curation of pathways affected by mutations EMBL-EBI train online
  • 68. Data sources: SLAPenrich • 374 pathways curated and mapped to cancer hallmarks • Divergence of the total number of cancer samples with genomic alterations • Mutational burden and total exonic block length of genes
  • 69. Data sources: PROGENy • Comparison of pathway activities between normal and primary samples from The Cancer Genome Atlas • Inferred from RNA-seq: 9,250 tumour and 741 normal samples • EGFR, hypoxia, JAK.STAT, MAPK, NFkB, PI3K, TGFb, TNFa, Trail, VEGF, and p53
  • 70. Data sources: Expression Atlas • Baseline expression for human genes - target profile page • Differential mRNA expression (healthy versus diseased): - target-disease associations EMBL-EBI train online
  • 71. Data sources: Europe PMC • Mining titles, abstracts, full text in research articles • Target and disease co-occurrence in the same sentence • Dictionary (not NLP) EMBL-EBI train online
  • 72. Data sources: PhenoDigm • Semantic approach to associate mouse models with diseases
  • 73. Aggregating scores across the data • Using a mathematical function, the harmonic sum* where S1,S2,...,Si are the individual sorted evidence scores in descending order * PMID: 19107201, PMID: 20118918 • Advantages: A) account for replication B) deflate the effect of large amounts of data e.g. text mining
  • 74. Target-Disease Association Score EuropePMC (Text Mining) UniProt (Manual Curation) ChEMBL (Manual Curation) Overall VERY simplified diagram

Editor's Notes

  1. PhRMA ( Lengthy: Drug discovery from start (idea) to finish (market) can take up to 20 years. Costly: specially from clinical trials when human subjects are tested (volunteers or suffering from the condition) Compounds drop out along the drug discovery journey  low success, high attrition
  2. If you work in early stages of drug discovery, which databases or resources do you rely on? Some of the DBs out there.
  3. Fit everything to generate therapeutic hypothesis. Going through those resources takes time, my group may not have allocated resources (computing) or expertise (bioinformatics, ML, NLP), we are working on our own (in one specific lab) so it’s not multidisciplinary. Can workshop attendees list others?
  4. A shared vision to create a partnership to transform drug discovery through systematics target ID and prioritisation. From that idea Open Targets (formerly CTTV) was started to combine the world class functional genomics expertise at Sanger with the EBI‘s excellence in bioinformatics and computational biology and the industry expertise of GSK and now Biogen as well with the aim to find out all that‘s important about a target before starting the drug development pipeline. Thereby the pipeline would be filled with validated and valid targets with a much improved odds ratio for success.
  5. Target ID and prioritisation knowledge cycle. Our research focusses on data generation, data integration, and enabled data access Our research is connected as we provide the tools to generate hypothesis and explore them experimentally We focus primarily on oncology, immunology (including IBD) neurodegeneration with some experiments in other areas or across diseases
  6. The choice of our disease ontology was EFO. It is cross-referenced against DO, OMIM, HP, MP, MONDO as well
  7. Evidence: sequence (SNPs, mutations, germline and somatic curated at the DNA or protein level), differential mRNA expression, sentences from research papers, drug information, pathways affected by pathogenic mutation, animal models (https://github.com/opentargets/json_schema/blob/master/doc/instructions.md)
  8. Blueprint data included in baseline expression
  9. Highlight the REST
  10. http://api.opentargets.io/v3/platform/public/association?id=ENSG00000171862-EFO_0000616
  11. http://api.opentargets.io/v3/platform/public/association?id=ENSG00000171862-EFO_0000616 http://api.opentargets.io/v3/platform/public/association/filter?target=ENSG00000171862
  12. http://api.opentargets.io/v3/platform /public/evidece/filter? target=ENSG00000171862 &disease=Orphanet_2563 http://api.opentargets.io/v3/platform/public/association?id=ENSG00000171862-EFO_0000616 http://api.opentargets.io/v3/platform/public/association/filter?target=ENSG00000171862
  13. Target ID and prioritisation knowledge cycle. Our research focusses on data generation, data integration, and enabled data access Our research is connected as we provide the tools to generate hypothesis and explore them experimentally We focus primarily on oncology, immunology (including IBD) neurodegeneration with some experiments in other areas or across diseases
  14. Somatic mutations from cancer drivers genes across 28 tumor types. We have imported data (2014.12) from 6,792 samples spanning 28 cancer types (Rubio-Perez et al)."
  15. Linear regression model from 14 tumour types.
  16. Back up