SlideShare a Scribd company logo
1 of 24
Download to read offline
© Copyright 2018 Galapagos NVCONFIDENTIAL
Unleash transcriptomics to gain insights in
disease mechanisms: integration in the
Galapagos knowledge platform “Hydra”
Maté Ongenaert, PhD
Senior Scientist Bioinformatics
László Kupcsik, Gabriele Sclauzero, Markus Wyss, Joeri Samson, Jan Van Der Schueren
August 30 2018
Leveraging Therapeutic Discovery with Curated Transcriptomic Data - Novartis Campus, Basel, Switzerland
2
Outline
Galapagos in a nutshell
Public / private data integration
Introduction – transcriptomics at Galapagos
StudyExplorer: cross-species / cross- platform /
cross-study integration and multi-level exploration
Hydra: knowledge platform
disease portal application
3
Galapagos
In a nutshell
• Galapagos is a clinical-stage biotechnology company, specialized in the
discovery and development of small molecule medicines with novel modes
of action
• “Flagship” program: filgotinib (JAK-1 inhibitor) in inflammation (Phase 3
studies in Rheumatoid arthritis, Crohn’s disease and ulcerative colitis)
• Advanced programs in lung fibrosis, cystic fibrosis, osteoarthritis and
atopic dermatitis
4
Transcriptomics at Galapagos
Targets, disease and compounds
Target
discovery
Lead
discovery
Lead
optimization
Pre-clinical
development
Clinical
Phase I
Clinical
Phase II
Target Disease Compound
• Target discovery and prioritisation
• Disease mechanism elucidation
• Cellular systems and primary cell disease models characterisation
• Animal models and disease mechanisms
• Compound mode of action (in-vitro, cellular systems, in-vivo)
• Biomarkers
5
Public/private data integration
Curation and integration
Ontologies
Expert
curation
Robust
processing
Internal data
Key assay assets
Primary cell assays
Animal models
Compounds
Compound MoA
Public data
Complementary
data & knowledge
- Disease biomarkers
and biology
- Drug / reference
compounds mode of
actions and disease
relevance
- Drug response
Public and private data integration with proper curation and
ontologies and robust processing and data management is
the foundation for all that is next…
6
Next level of integration
Meta-analysis across platforms
Ontologies
Expert
curation
Robust
processing
Internal data Public data
• Meta-analysis layer
 Integration at the single disease / disease area level for scientific
experts in that area
 Across studies, across platforms
 Apples vs. pears
 Careful selection of appropriate studies
 Careful curation and annotation of the comparisons / contrasts
 Challenges vs. opportunities
 Loss of the power of the original study
 Focus on still accurate and robust results across studies
7
Next level of integration
Meta-analysis across platforms
• Meta-analysis results
 MetaIntegrator R/BioConductor package
 Hedges’ g effect size for each gene in each
dataset
 Summary effect size using a random effect
making use of variance of a gene within a given
dataset and the inter-dataset variation
 Main outputs: effect size and False Discovery
Rate (FDR)
 Example in IPF (idiopathic pulmonary fibrosis)
 10 studies (private and public studies)
 5 different platforms
 > 500 samples
8
Next level of integration
Meta-analysis across platforms - layers
• Meta-analysis: adding levels to the exploration
 Meta-analysis level
 Across studies, across platform
 Loss of accuracy at the original study level
 Focussed on global performance, robustness
 Differential expression level
 Single study level
 Analysis statistics as originally performed
 Expression levels in groups
 Sample/group individual data
 Assessment of heterogeneity, subgroups
StudyExplorer
R/Shiny dynamic
reporting
interfaces
Flexible user-
interfaces for
exploration across
all these data
layers
Suitable graphics
for each datalayer
9
Next level of integration
Meta-analysis across platforms - implementation
Ontologies
Expert
curation
Robust
processing
Manual dataset selection and curation, ontology linking
Aided by Genevestigator frontend
Genevestigator API
Data export / calculation
Meta-analysis
Interface
10
Next level of integration
StudyExplorer application
• StudyExplorer – meta-analysis level
11
Next level of integration
StudyExplorer application
• StudyExplorer – meta-analysis level > study-level > sample level
12
Next level of integration
StudyExplorer application
• StudyExplorer – flexible platform
 Cross-species, human patient data and animal models side by side
13
Next level of integration
StudyExplorer application
• StudyExplorer – flexible platform
 Time-series and other visualisations
14
Next level of integration
StudyExplorer application
• StudyExplorer – flexible platform
 Time-series and other visualisations
15
Next level of integration
StudyExplorer application
• StudyExplorer – flexible platform
 Molecular signatures
 Layering at the ‘gene’ level: exploration at signature level and performance of underlying genes
16
Disease 1
Transcriptomics
Scientific literature
Animal models
Screening data
Disease 2
• “Hydra” knowledge platform
 Integration
 Across diseases and therapeutic areas
 Across different datasources
 Knowledge capturing and sharing
 Modular architecture
Knowledge generation
Hydra platform principles
Transcriptomics
Scientific literature
Animal models
Screening data
• Hydra application: disease portal
 Global overview of relationships
between targets and diseases
 Layered exploration: overviews and
details
 Knowledge capturing and sharing
17
Hydra
Architecture
API
BioPortal
API
Ontology
API
GeneCards
API
Players/genes
TextMining
TenWise
Transcriptomics
GeneVestigator
API
Knowledge Building
Services
Disease Portal
18
Knowledge generation
Hydra - Disease Portal
• Get started! Ontology powered searching and autocompletion
19
Knowledge generation
Hydra - Disease Portal
• Disease portal summary overview pages (target/disease matrix)
20
Knowledge generation
Hydra - Disease Portal
• Disease portal – transcriptomics aspects
21
Knowledge generation
Hydra - Disease Portal
• Disease portal – transcriptomics aspects
22
Knowledge generation
Hydra - Disease Portal
• Disease portal – knowledge generation by combining datasources
23
Public-private data integration: expert annotation and curation are crucial
Flexible framework for integrations – layered exploration benefits from curation
and annotation and powerful API components
Hydra knowledge platform: infrastructure build for hypothesis exploration and
knowledge generation
24
Many thanks to all involved Galapagos
colleagues, Nebion and XAOP

More Related Content

What's hot

What's hot (12)

IUPHAR Guide to IMMUNOPHARMACOLOGY
IUPHAR Guide to IMMUNOPHARMACOLOGYIUPHAR Guide to IMMUNOPHARMACOLOGY
IUPHAR Guide to IMMUNOPHARMACOLOGY
 
JPROT-TargetedProteomics-CallforPapers
JPROT-TargetedProteomics-CallforPapersJPROT-TargetedProteomics-CallforPapers
JPROT-TargetedProteomics-CallforPapers
 
IUPHAR Guide to IMMUNOPHARMACOLOGY Poster - Pharmacology 2016
IUPHAR Guide to IMMUNOPHARMACOLOGY Poster - Pharmacology 2016IUPHAR Guide to IMMUNOPHARMACOLOGY Poster - Pharmacology 2016
IUPHAR Guide to IMMUNOPHARMACOLOGY Poster - Pharmacology 2016
 
Results of the 2015 survey on WGS capacity in EU/EEA Member States
Results of the 2015 survey on WGS capacity in EU/EEA Member StatesResults of the 2015 survey on WGS capacity in EU/EEA Member States
Results of the 2015 survey on WGS capacity in EU/EEA Member States
 
2016 11-11_TSRI News Release
2016 11-11_TSRI News Release2016 11-11_TSRI News Release
2016 11-11_TSRI News Release
 
GtoImmPdb_flash_poster_presentation
GtoImmPdb_flash_poster_presentationGtoImmPdb_flash_poster_presentation
GtoImmPdb_flash_poster_presentation
 
GtoImmuPdb_2017
GtoImmuPdb_2017GtoImmuPdb_2017
GtoImmuPdb_2017
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
 
Bradley Research Sept 2007
Bradley Research Sept 2007Bradley Research Sept 2007
Bradley Research Sept 2007
 
CETR - Rutgers NJ Medical School
CETR - Rutgers NJ Medical SchoolCETR - Rutgers NJ Medical School
CETR - Rutgers NJ Medical School
 
Demonstration of ECDC web interface platform for molecular and genomic epidem...
Demonstration of ECDC web interface platform for molecular and genomic epidem...Demonstration of ECDC web interface platform for molecular and genomic epidem...
Demonstration of ECDC web interface platform for molecular and genomic epidem...
 
Ontology based phenotype database and mining tool
Ontology based phenotype database and mining toolOntology based phenotype database and mining tool
Ontology based phenotype database and mining tool
 

Similar to Unleash transcriptomics to gain insights in disease mechanisms: integration in the Galapagos knowledge platform “Hydra”

Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015IRIDA_community
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William HsiaoWilliam Hsiao
 
IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology William Hsiao
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA_community
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 
Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...
Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...
Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...Vaticle
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...David Peyruc
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
 
Community Finding with Applications on Phylogenetic Networks [Thesis]
Community Finding with Applications on Phylogenetic Networks [Thesis]Community Finding with Applications on Phylogenetic Networks [Thesis]
Community Finding with Applications on Phylogenetic Networks [Thesis]Luís Rita
 
Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...
Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...
Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...EURORDIS - Rare Diseases Europe
 
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdfAlain van Gool
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biologyPranavathiyani G
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionPaul Groth
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Ann-Marie Roche
 
Whole Genome Sequencing (WGS) for food safety management in France: Example...
Whole Genome Sequencing (WGS)  for food safety management in France:  Example...Whole Genome Sequencing (WGS)  for food safety management in France:  Example...
Whole Genome Sequencing (WGS) for food safety management in France: Example...ExternalEvents
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffithsIRIDA_community
 
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...Emma Griffiths
 

Similar to Unleash transcriptomics to gain insights in disease mechanisms: integration in the Galapagos knowledge platform “Hydra” (20)

Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...
Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...
Leveraging Publicly Accessible Clinical Trails Data Sharing, Dissemination an...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
disgenet2r: The DisGeNET R package
disgenet2r: The DisGeNET R packagedisgenet2r: The DisGeNET R package
disgenet2r: The DisGeNET R package
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
 
Community Finding with Applications on Phylogenetic Networks [Thesis]
Community Finding with Applications on Phylogenetic Networks [Thesis]Community Finding with Applications on Phylogenetic Networks [Thesis]
Community Finding with Applications on Phylogenetic Networks [Thesis]
 
Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...
Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...
Workshop 4 - "Presentation of the RD Platform fact finding study on the trend...
 
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tension
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1
 
Whole Genome Sequencing (WGS) for food safety management in France: Example...
Whole Genome Sequencing (WGS)  for food safety management in France:  Example...Whole Genome Sequencing (WGS)  for food safety management in France:  Example...
Whole Genome Sequencing (WGS) for food safety management in France: Example...
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffiths
 
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
 

More from Maté Ongenaert

Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Maté Ongenaert
 
Ecobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenEcobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenMaté Ongenaert
 
Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Maté Ongenaert
 
ENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsMaté Ongenaert
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Maté Ongenaert
 
Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Maté Ongenaert
 
Exploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosisExploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosisMaté Ongenaert
 
High-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting themHigh-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting themMaté Ongenaert
 
Microarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the benchMicroarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the benchMaté Ongenaert
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyMaté Ongenaert
 
Integrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functionsIntegrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functionsMaté Ongenaert
 
Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...Maté Ongenaert
 
Bringing the data back to the researchers
Bringing the data back to the researchersBringing the data back to the researchers
Bringing the data back to the researchersMaté Ongenaert
 
The post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integrationThe post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integrationMaté Ongenaert
 
Literature managment training
Literature managment trainingLiterature managment training
Literature managment trainingMaté Ongenaert
 
Scientific literature managment - exercises
Scientific literature managment - exercisesScientific literature managment - exercises
Scientific literature managment - exercisesMaté Ongenaert
 

More from Maté Ongenaert (18)

Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
 
Ecobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenEcobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis Lokeren
 
Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Workshop NGS data analysis - 3
Workshop NGS data analysis - 3
 
ENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsENCODE project: brief summary of main findings
ENCODE project: brief summary of main findings
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2
 
Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Workshop NGS data analysis - 1
Workshop NGS data analysis - 1
 
Bots & spiders
Bots & spidersBots & spiders
Bots & spiders
 
Exploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosisExploring the neuroblastoma epigenome: perspectives for improved prognosis
Exploring the neuroblastoma epigenome: perspectives for improved prognosis
 
High-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting themHigh-throughput proteomics: from understanding data to predicting them
High-throughput proteomics: from understanding data to predicting them
 
Microarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the benchMicroarray data and pathway analysis: example from the bench
Microarray data and pathway analysis: example from the bench
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biology
 
Integrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functionsIntegrative transcriptomics to study non-coding RNA functions
Integrative transcriptomics to study non-coding RNA functions
 
Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...Race against the sequencing machine: processing of raw DNA sequence data at t...
Race against the sequencing machine: processing of raw DNA sequence data at t...
 
Bringing the data back to the researchers
Bringing the data back to the researchersBringing the data back to the researchers
Bringing the data back to the researchers
 
The post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integrationThe post-genomic era: epigenetic sequencing applications and data integration
The post-genomic era: epigenetic sequencing applications and data integration
 
Introduction
IntroductionIntroduction
Introduction
 
Literature managment training
Literature managment trainingLiterature managment training
Literature managment training
 
Scientific literature managment - exercises
Scientific literature managment - exercisesScientific literature managment - exercises
Scientific literature managment - exercises
 

Recently uploaded

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 

Recently uploaded (20)

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 

Unleash transcriptomics to gain insights in disease mechanisms: integration in the Galapagos knowledge platform “Hydra”

  • 1. © Copyright 2018 Galapagos NVCONFIDENTIAL Unleash transcriptomics to gain insights in disease mechanisms: integration in the Galapagos knowledge platform “Hydra” Maté Ongenaert, PhD Senior Scientist Bioinformatics László Kupcsik, Gabriele Sclauzero, Markus Wyss, Joeri Samson, Jan Van Der Schueren August 30 2018 Leveraging Therapeutic Discovery with Curated Transcriptomic Data - Novartis Campus, Basel, Switzerland
  • 2. 2 Outline Galapagos in a nutshell Public / private data integration Introduction – transcriptomics at Galapagos StudyExplorer: cross-species / cross- platform / cross-study integration and multi-level exploration Hydra: knowledge platform disease portal application
  • 3. 3 Galapagos In a nutshell • Galapagos is a clinical-stage biotechnology company, specialized in the discovery and development of small molecule medicines with novel modes of action • “Flagship” program: filgotinib (JAK-1 inhibitor) in inflammation (Phase 3 studies in Rheumatoid arthritis, Crohn’s disease and ulcerative colitis) • Advanced programs in lung fibrosis, cystic fibrosis, osteoarthritis and atopic dermatitis
  • 4. 4 Transcriptomics at Galapagos Targets, disease and compounds Target discovery Lead discovery Lead optimization Pre-clinical development Clinical Phase I Clinical Phase II Target Disease Compound • Target discovery and prioritisation • Disease mechanism elucidation • Cellular systems and primary cell disease models characterisation • Animal models and disease mechanisms • Compound mode of action (in-vitro, cellular systems, in-vivo) • Biomarkers
  • 5. 5 Public/private data integration Curation and integration Ontologies Expert curation Robust processing Internal data Key assay assets Primary cell assays Animal models Compounds Compound MoA Public data Complementary data & knowledge - Disease biomarkers and biology - Drug / reference compounds mode of actions and disease relevance - Drug response Public and private data integration with proper curation and ontologies and robust processing and data management is the foundation for all that is next…
  • 6. 6 Next level of integration Meta-analysis across platforms Ontologies Expert curation Robust processing Internal data Public data • Meta-analysis layer  Integration at the single disease / disease area level for scientific experts in that area  Across studies, across platforms  Apples vs. pears  Careful selection of appropriate studies  Careful curation and annotation of the comparisons / contrasts  Challenges vs. opportunities  Loss of the power of the original study  Focus on still accurate and robust results across studies
  • 7. 7 Next level of integration Meta-analysis across platforms • Meta-analysis results  MetaIntegrator R/BioConductor package  Hedges’ g effect size for each gene in each dataset  Summary effect size using a random effect making use of variance of a gene within a given dataset and the inter-dataset variation  Main outputs: effect size and False Discovery Rate (FDR)  Example in IPF (idiopathic pulmonary fibrosis)  10 studies (private and public studies)  5 different platforms  > 500 samples
  • 8. 8 Next level of integration Meta-analysis across platforms - layers • Meta-analysis: adding levels to the exploration  Meta-analysis level  Across studies, across platform  Loss of accuracy at the original study level  Focussed on global performance, robustness  Differential expression level  Single study level  Analysis statistics as originally performed  Expression levels in groups  Sample/group individual data  Assessment of heterogeneity, subgroups StudyExplorer R/Shiny dynamic reporting interfaces Flexible user- interfaces for exploration across all these data layers Suitable graphics for each datalayer
  • 9. 9 Next level of integration Meta-analysis across platforms - implementation Ontologies Expert curation Robust processing Manual dataset selection and curation, ontology linking Aided by Genevestigator frontend Genevestigator API Data export / calculation Meta-analysis Interface
  • 10. 10 Next level of integration StudyExplorer application • StudyExplorer – meta-analysis level
  • 11. 11 Next level of integration StudyExplorer application • StudyExplorer – meta-analysis level > study-level > sample level
  • 12. 12 Next level of integration StudyExplorer application • StudyExplorer – flexible platform  Cross-species, human patient data and animal models side by side
  • 13. 13 Next level of integration StudyExplorer application • StudyExplorer – flexible platform  Time-series and other visualisations
  • 14. 14 Next level of integration StudyExplorer application • StudyExplorer – flexible platform  Time-series and other visualisations
  • 15. 15 Next level of integration StudyExplorer application • StudyExplorer – flexible platform  Molecular signatures  Layering at the ‘gene’ level: exploration at signature level and performance of underlying genes
  • 16. 16 Disease 1 Transcriptomics Scientific literature Animal models Screening data Disease 2 • “Hydra” knowledge platform  Integration  Across diseases and therapeutic areas  Across different datasources  Knowledge capturing and sharing  Modular architecture Knowledge generation Hydra platform principles Transcriptomics Scientific literature Animal models Screening data • Hydra application: disease portal  Global overview of relationships between targets and diseases  Layered exploration: overviews and details  Knowledge capturing and sharing
  • 18. 18 Knowledge generation Hydra - Disease Portal • Get started! Ontology powered searching and autocompletion
  • 19. 19 Knowledge generation Hydra - Disease Portal • Disease portal summary overview pages (target/disease matrix)
  • 20. 20 Knowledge generation Hydra - Disease Portal • Disease portal – transcriptomics aspects
  • 21. 21 Knowledge generation Hydra - Disease Portal • Disease portal – transcriptomics aspects
  • 22. 22 Knowledge generation Hydra - Disease Portal • Disease portal – knowledge generation by combining datasources
  • 23. 23 Public-private data integration: expert annotation and curation are crucial Flexible framework for integrations – layered exploration benefits from curation and annotation and powerful API components Hydra knowledge platform: infrastructure build for hypothesis exploration and knowledge generation
  • 24. 24 Many thanks to all involved Galapagos colleagues, Nebion and XAOP