SlideShare a Scribd company logo
1 of 41
Download to read offline
Knowledge graphs and semantic models
for drug discovery and healthcare
on Thurs, 23rd April 2020 at 16:00 BST
Hosted by: Ian Harrow, Pistoia Alliance
Speaker: Ilaria Maresi, The Hyve
FAIR/OM projects Community of Interest webinar series
This webinar is being recorded
Audience Q&A
Please use the questions box
©PistoiaAlliance
Ilaria Maresi
• Ilaria is a Data Engineer at The Hyve, specialising
in Semantic Modelling and Knowledge Graphs with
applications in healthcare and drug discovery.
• As a mathematician by training, Ilaria came to the
bioinformatics field through her interest in the
intersection of biology, mathematics and
engineering.
• In her free time, she tries to get away from her
computer, and enjoys cooking and spending time
outside.
©PistoiaAlliance
Knowledge graphs and semantic models for
drug discovery and healthcare
Ilaria Maresi, Data Engineer
ilaria@thehyve.nl
©PistoiaAlliance
Agenda
● About me
● The problem
● What are semantic models and knowledge graphs?
○ Creating
○ Querying
● Semantic models in action
○ Clinical trials
○ Drug discovery landscape
● Wrap up
● Q&A
©PistoiaAlliance
We enable open science by developing
and implementing open source solutions
and FAIRifying data in life sciences
©PistoiaAlliance
About me
Data Engineer at The Hyve
● Semantic modelling
● Knowledge graphs
● ETL pipelines
● FAIR data services
Utrecht, The Netherlands
©PistoiaAlliance
The problem
©PistoiaAlliance
Drug discovery process
Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R-
and-D-The-Process-Behind-New-Medicines
©PistoiaAlliance
Differing and unlinked identifiers
Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R-
and-D-The-Process-Behind-New-Medicines
CMP102401
acetylsalicylic
acid
Aspirin
Query
I want to see the
trajectory of
compound xxx
©PistoiaAlliance
Differing terminology
Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R-
and-D-The-Process-Behind-New-Medicines
omics
experiment
lab test
genomics
assay
Query
I want to see all
the genomics
data related to
compound xxx
©PistoiaAlliance
Differing terminology
Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R-
and-D-The-Process-Behind-New-Medicines
non-small cell
lung cancer
NSCLC
non-small cell
malignant
neoplasm of
lung
Query
I want to see all
terminated studies
for non-small cell
lung cancer that
study compound
xxx
terminated
recruiting
terminated
©PistoiaAlliance
What are semantic models and
Knowledge Graphs?
©PistoiaAlliance
Semantic models are ...
“Semantic models of data sources represent the implicit meaning of
the data by specifying the concepts and the relationships within the
data. Such models are the key ingredients to automatically publish
the data into knowledge graphs”
– USC semantic modelling
©PistoiaAlliance
What about knowledge graphs?
A knowledge graph is several things:
● Database: contains actual data
● Graph: data items and concepts are connected via relationships
(i.e nodes and edges)
● Semantic: the meaning of the data is encoded in the graph,
allowing for meaning to be inferred
● Alive: constantly refreshed with new data & can be extended
and revised as new data comes in
©PistoiaAlliance
The power of a knowledge graph
● Linking concepts that are same or similar
○ Model level – entities
○ Data (instance) level – identifiers
● Harmonising data sources without transforming or forcing a
common standard on the data
● The more diverse your data, the more powerful your KG will be
○ Google Knowledge Graph
○ Diffbot
○ Wikidata
©PistoiaAlliance
A knowledge graph example
DBPedia:
● Community effort to extract
structured content from the
information created in
various Wikimedia projects
● Data coming in various
formats and from various
sources
● KG unifies data and enables
querying
©PistoiaAlliance
Using RDF to create semantic models
● Resource Description Framework (RDF)
● Information encoded in triples
● Almost everything needs a Uniform Resource Identifier (URI)
Subject Object
predicate
schema:Patient schema:MedicalCondition
schema:healthCondition
patient_1231 AnginaPectoris
schema:healthCondition
©PistoiaAlliance
Using RDF to create semantic models
patient_1231 AnginaPectoris
schema:healthCondition
Heart
schema:associatedAnatomy
AnatomicalStructure
rdf:type
12/12/1950
schema:birthDate
Ranolazine
schema:drug
Ranexa
schema:nonProprietaryName
©PistoiaAlliance
Querying a knowledge graph
● SPARQL Protocol and RDF Query Language (SPARQL)
©PistoiaAlliance
Querying a knowledge graph
For a single gene (CDX2) these are all its associated identifiers in wikidata:
Ensembl ID
Entrez ID
NCBI Gene ID
©PistoiaAlliance
Linked data is FAIR(er) data
● Findable: uniform resource identifiers that persist across an
organization
● Accessible: triple store access can be open or only to select users
● Interoperable: RDF gives meaning to data to both humans and
machines (include community standard ontologies!)
● Reusable: RDF is independent of tools/systems and can encode
provenance of (meta)data
©PistoiaAlliance
Semantic models in action
©PistoiaAlliance
A semantic model of clinical studies
Problem
○ Data across stages of
clinical studies exists
mostly in silos and is not
harmonised
○ Cross domain analytics
is currently burdensome
○ Unclear what processes
data has gone through
Goals
○ Data Conformance
Layer to semantically
represent source data
on clinical trials
○ Leverage external data
○ Represent provenance
in model
©PistoiaAlliance
The solution
● 1300+ triples
● 65 classes
● 153 properties
● 13 ontologies
©PistoiaAlliance
Ontologies and interoperability
● Use ontologies that fit
domain and data
● BioPortal
● It’s possible to use
multiple ontologies in
one model!
● FAIR principles I2 & I3
©PistoiaAlliance
The solution
● Harmonising terms (clinical trial
vs. clinical study vs. medical trial)
● Metadata annotation (alt labels
for compound)
● Controlled vocabularies
(indication)
● Provenance (what activity
generated the dataset and
when)
©PistoiaAlliance
Knowledge graph: from R&D to market
Problem
○ “How much data do we
have and how is it
connected?”
○ Data across stages of R&D
is unlinked
Goals
○ Understand quantity
and qualitative details
of existing data assets
○ Map relationships
between data assets
©PistoiaAlliance
The solution
● Semantic model representing data flows and major entities
● Encode provenance
Compound
registry
Electronic
Lab
Notebook
Data
Warehouse
Outside
vendor
compound_id
assay_results
assay_results
©PistoiaAlliance
The solution
● Instantiate semantic model with (meta)data
● Queryable knowledge graph with ~14M triples
● Different “views”
● Evolving model
©PistoiaAlliance
The solution
Query: how many
genomics assays exist
across all sectors of
my organisation?
RNA Seq
experiment
Single-cell
RNA-seq
assay
Whole
genome
sequencing
WGS
experiment
RSQ_12965
AssayRNA_31
SC4050_EM
8019240201G
Exp_WGS_4128019240202G
©PistoiaAlliance
The solution
Query: how many
genomics assays exist
across all sectors of
my organisation?
RNA Seq
experiment
Single-cell
RNA-seq
assay
Whole
genome
sequencing
Genomics
Assay
WGS
experiment
RSQ_12965
AssayRNA_31
SC4050_EM 8019240201G Exp_WGS_412
owl:sameAs
rdf:type rdf:type rdf:type rdf:type
rdfs:subClassOf rdfs:subClassOf
rdfs:subClassOf rdfs:subClassOf
owl:equivalentClass
8019240202G
rdf:type
©PistoiaAlliance
Wrap up
©PistoiaAlliance
Potential pitfalls
● RDF is flexible – triples could contradict your model and inference could easily
propagate wrong information in graph
○ Use SHACL (Shapes Constraint Language) validation
● Overloading triple store = slow query times
○ Restrict triples to important information
● URI schema needs to be maintained
○ Automated URI generation and URI update
● Sometimes Knowledge Graph is overkill
○ Semantic models or common data model may suffice
©PistoiaAlliance
Why you should consider KGs
● Connecting data and representing as a KG enables:
○ Querying across an organisation
○ Querying across stages of drug discovery
○ Adoption of common terminologies
○ Cleaner data
○ Clear provenance of data
● Linked data is FAIR(er) data!
©PistoiaAlliance
Why you should consider KGs
● Machine learning and knowledge graphs
○ ML can be used to develop a KG
○ Conversely: KG can be used for ML algorithm:
“...knowledge graphs are a step towards enabling machines to more
deeply understand data ... that don’t fit neatly into the rows and columns
of a relational database” [1]
[1] https://www.forbes.com/sites/bernardmarr/2019/06/26/knowledge-graphs-and-machine-learning-the-future-of-ai-analytics/#7b825f0c3a36
©PistoiaAlliance
Materials
Data engineering tools for knowledge graphs
● Data Discovery Toolkit
● Model Repository
● Knowledge Base
● Data Mapping Framework
● Data Visualization Framework
FAIR data governance is like a fractal
● Purpose and scope
○ Metrics & KPIs
○ New insights
● Retrospective vs prospective
A Data Engineer’s Guide to Semantic Models (coming soon!)
● Getting started on semantic models
● RDF, SPARQL, SHACL
Common Data Models for FAIR biomedical data
● Choosing data models
● OMOP CDM
Leveraging the OMOP Common Data Model for
Clinical Trials
● Repurposing OMOP for clinical trials
©PistoiaAlliance
Audience Q&A
Please use the questions box
Semantics of data matrices
and the STATO ontology
Join us for the next FAIR/OM CoI webinar:
Speaker: Philippe Rocca-Serra, Oxford University
Thurs 28th May at 16:00 BST
info@pistoiaalliance.org @pistoiaalliance www.pistoiaalliance.org
Thanks for your attention

More Related Content

What's hot

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchNeo4j
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 TigerGraph
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIRSarah Jones
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data EngineeringDurga Gadiraju
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine LearningNeo4j
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcarePerficient, Inc.
 
The path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data ScienceThe path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data ScienceNeo4j
 
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Neo4j
 
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...Neo4j
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data EngineeringHadi Fadlallah
 
A Universe of Knowledge Graphs
A Universe of Knowledge GraphsA Universe of Knowledge Graphs
A Universe of Knowledge GraphsNeo4j
 
Neo4j: The path to success with Graph Database and Graph Data Science
Neo4j: The path to success with Graph Database and Graph Data ScienceNeo4j: The path to success with Graph Database and Graph Data Science
Neo4j: The path to success with Graph Database and Graph Data ScienceNeo4j
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceNeo4j
 
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph SolutionNeo4j
 
Intro to Neo4j
Intro to Neo4jIntro to Neo4j
Intro to Neo4jNeo4j
 
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...Neo4j
 
Modern Data Architecture
Modern Data Architecture Modern Data Architecture
Modern Data Architecture Mark Hewitt
 

What's hot (20)

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4
 
Data Mesh 101
Data Mesh 101Data Mesh 101
Data Mesh 101
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in Healthcare
 
The path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data ScienceThe path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data Science
 
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
 
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
A Universe of Knowledge Graphs
A Universe of Knowledge GraphsA Universe of Knowledge Graphs
A Universe of Knowledge Graphs
 
Neo4j: The path to success with Graph Database and Graph Data Science
Neo4j: The path to success with Graph Database and Graph Data ScienceNeo4j: The path to success with Graph Database and Graph Data Science
Neo4j: The path to success with Graph Database and Graph Data Science
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph Solution
 
Intro to Neo4j
Intro to Neo4jIntro to Neo4j
Intro to Neo4j
 
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
 
Modern Data Architecture
Modern Data Architecture Modern Data Architecture
Modern Data Architecture
 

Similar to Knowledge graphs ilaria maresi the hyve 23apr2020

The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfAlan Morrison
 
Melissa Informatics - Data Quality and AI
Melissa Informatics - Data Quality and AIMelissa Informatics - Data Quality and AI
Melissa Informatics - Data Quality and AImelissadata
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeTom Plasterer
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSemantic Web Company
 
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014KDZ - Zentrum für Verwaltungsforschung
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsTom Plasterer
 
Automating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseAutomating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseVaticle
 
Knowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemKnowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemSubhasis Dasgupta
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...Neo4j
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...CILIP MDG
 
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...Tore Hoel
 
Reference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxReference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxChimezie Ogbuji
 
dissertation proposal writing service
dissertation proposal writing servicedissertation proposal writing service
dissertation proposal writing servicePhd Assistance
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Kees van Bochove
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Denodo
 

Similar to Knowledge graphs ilaria maresi the hyve 23apr2020 (20)

The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Melissa Informatics - Data Quality and AI
Melissa Informatics - Data Quality and AIMelissa Informatics - Data Quality and AI
Melissa Informatics - Data Quality and AI
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
 
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
Automating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseAutomating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge Base
 
Knowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemKnowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific System
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
 
BIG-DATAPPTFINAL.ppt
BIG-DATAPPTFINAL.pptBIG-DATAPPTFINAL.ppt
BIG-DATAPPTFINAL.ppt
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
Introduction to Learner Analytics Session at Oslo Open Forum Conferences prio...
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Reference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxReference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptx
 
Data science guide
Data science guideData science guide
Data science guide
 
dissertation proposal writing service
dissertation proposal writing servicedissertation proposal writing service
dissertation proposal writing service
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
 

More from Pistoia Alliance

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck finalPistoia Alliance
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiPistoia Alliance
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinarPistoia Alliance
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRPistoia Alliance
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinarPistoia Alliance
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata managementPistoia Alliance
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Pistoia Alliance
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesPistoia Alliance
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcarePistoia Alliance
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Pistoia Alliance
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the futurePistoia Alliance
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEPistoia Alliance
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Pistoia Alliance
 
Blockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab SlidesBlockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab SlidesPistoia Alliance
 

More from Pistoia Alliance (20)

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Fair by design
Fair by designFair by design
Fair by design
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIR
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcare
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoE
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 
Blockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab SlidesBlockchain and IOT and the GxP Lab Slides
Blockchain and IOT and the GxP Lab Slides
 

Recently uploaded

Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowRiya Pathan
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...narwatsonia7
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Hemostasis Physiology and Clinical correlations by Dr Faiza.pdf
Hemostasis Physiology and Clinical correlations by Dr Faiza.pdfHemostasis Physiology and Clinical correlations by Dr Faiza.pdf
Hemostasis Physiology and Clinical correlations by Dr Faiza.pdfMedicoseAcademics
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiNehru place Escorts
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...narwatsonia7
 
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...Miss joya
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Modelssonalikaur4
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknownarwatsonia7
 

Recently uploaded (20)

Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
 
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCREscort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
 
Hemostasis Physiology and Clinical correlations by Dr Faiza.pdf
Hemostasis Physiology and Clinical correlations by Dr Faiza.pdfHemostasis Physiology and Clinical correlations by Dr Faiza.pdf
Hemostasis Physiology and Clinical correlations by Dr Faiza.pdf
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
 
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
 
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Servicesauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
 

Knowledge graphs ilaria maresi the hyve 23apr2020

  • 1. Knowledge graphs and semantic models for drug discovery and healthcare on Thurs, 23rd April 2020 at 16:00 BST Hosted by: Ian Harrow, Pistoia Alliance Speaker: Ilaria Maresi, The Hyve FAIR/OM projects Community of Interest webinar series
  • 2. This webinar is being recorded
  • 3. Audience Q&A Please use the questions box
  • 4. ©PistoiaAlliance Ilaria Maresi • Ilaria is a Data Engineer at The Hyve, specialising in Semantic Modelling and Knowledge Graphs with applications in healthcare and drug discovery. • As a mathematician by training, Ilaria came to the bioinformatics field through her interest in the intersection of biology, mathematics and engineering. • In her free time, she tries to get away from her computer, and enjoys cooking and spending time outside.
  • 5. ©PistoiaAlliance Knowledge graphs and semantic models for drug discovery and healthcare Ilaria Maresi, Data Engineer ilaria@thehyve.nl
  • 6. ©PistoiaAlliance Agenda ● About me ● The problem ● What are semantic models and knowledge graphs? ○ Creating ○ Querying ● Semantic models in action ○ Clinical trials ○ Drug discovery landscape ● Wrap up ● Q&A
  • 7. ©PistoiaAlliance We enable open science by developing and implementing open source solutions and FAIRifying data in life sciences
  • 8. ©PistoiaAlliance About me Data Engineer at The Hyve ● Semantic modelling ● Knowledge graphs ● ETL pipelines ● FAIR data services Utrecht, The Netherlands
  • 10. ©PistoiaAlliance Drug discovery process Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R- and-D-The-Process-Behind-New-Medicines
  • 11. ©PistoiaAlliance Differing and unlinked identifiers Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R- and-D-The-Process-Behind-New-Medicines CMP102401 acetylsalicylic acid Aspirin Query I want to see the trajectory of compound xxx
  • 12. ©PistoiaAlliance Differing terminology Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R- and-D-The-Process-Behind-New-Medicines omics experiment lab test genomics assay Query I want to see all the genomics data related to compound xxx
  • 13. ©PistoiaAlliance Differing terminology Source: PhRMA Biopharmaceutical R&D: The Process Behind New Medicines https://www.phrma.org/en/Report/Biopharmaceutical-R- and-D-The-Process-Behind-New-Medicines non-small cell lung cancer NSCLC non-small cell malignant neoplasm of lung Query I want to see all terminated studies for non-small cell lung cancer that study compound xxx terminated recruiting terminated
  • 14. ©PistoiaAlliance What are semantic models and Knowledge Graphs?
  • 15. ©PistoiaAlliance Semantic models are ... “Semantic models of data sources represent the implicit meaning of the data by specifying the concepts and the relationships within the data. Such models are the key ingredients to automatically publish the data into knowledge graphs” – USC semantic modelling
  • 16. ©PistoiaAlliance What about knowledge graphs? A knowledge graph is several things: ● Database: contains actual data ● Graph: data items and concepts are connected via relationships (i.e nodes and edges) ● Semantic: the meaning of the data is encoded in the graph, allowing for meaning to be inferred ● Alive: constantly refreshed with new data & can be extended and revised as new data comes in
  • 17. ©PistoiaAlliance The power of a knowledge graph ● Linking concepts that are same or similar ○ Model level – entities ○ Data (instance) level – identifiers ● Harmonising data sources without transforming or forcing a common standard on the data ● The more diverse your data, the more powerful your KG will be ○ Google Knowledge Graph ○ Diffbot ○ Wikidata
  • 18. ©PistoiaAlliance A knowledge graph example DBPedia: ● Community effort to extract structured content from the information created in various Wikimedia projects ● Data coming in various formats and from various sources ● KG unifies data and enables querying
  • 19. ©PistoiaAlliance Using RDF to create semantic models ● Resource Description Framework (RDF) ● Information encoded in triples ● Almost everything needs a Uniform Resource Identifier (URI) Subject Object predicate schema:Patient schema:MedicalCondition schema:healthCondition patient_1231 AnginaPectoris schema:healthCondition
  • 20. ©PistoiaAlliance Using RDF to create semantic models patient_1231 AnginaPectoris schema:healthCondition Heart schema:associatedAnatomy AnatomicalStructure rdf:type 12/12/1950 schema:birthDate Ranolazine schema:drug Ranexa schema:nonProprietaryName
  • 21. ©PistoiaAlliance Querying a knowledge graph ● SPARQL Protocol and RDF Query Language (SPARQL)
  • 22. ©PistoiaAlliance Querying a knowledge graph For a single gene (CDX2) these are all its associated identifiers in wikidata: Ensembl ID Entrez ID NCBI Gene ID
  • 23. ©PistoiaAlliance Linked data is FAIR(er) data ● Findable: uniform resource identifiers that persist across an organization ● Accessible: triple store access can be open or only to select users ● Interoperable: RDF gives meaning to data to both humans and machines (include community standard ontologies!) ● Reusable: RDF is independent of tools/systems and can encode provenance of (meta)data
  • 25. ©PistoiaAlliance A semantic model of clinical studies Problem ○ Data across stages of clinical studies exists mostly in silos and is not harmonised ○ Cross domain analytics is currently burdensome ○ Unclear what processes data has gone through Goals ○ Data Conformance Layer to semantically represent source data on clinical trials ○ Leverage external data ○ Represent provenance in model
  • 26. ©PistoiaAlliance The solution ● 1300+ triples ● 65 classes ● 153 properties ● 13 ontologies
  • 27. ©PistoiaAlliance Ontologies and interoperability ● Use ontologies that fit domain and data ● BioPortal ● It’s possible to use multiple ontologies in one model! ● FAIR principles I2 & I3
  • 28. ©PistoiaAlliance The solution ● Harmonising terms (clinical trial vs. clinical study vs. medical trial) ● Metadata annotation (alt labels for compound) ● Controlled vocabularies (indication) ● Provenance (what activity generated the dataset and when)
  • 29. ©PistoiaAlliance Knowledge graph: from R&D to market Problem ○ “How much data do we have and how is it connected?” ○ Data across stages of R&D is unlinked Goals ○ Understand quantity and qualitative details of existing data assets ○ Map relationships between data assets
  • 30. ©PistoiaAlliance The solution ● Semantic model representing data flows and major entities ● Encode provenance Compound registry Electronic Lab Notebook Data Warehouse Outside vendor compound_id assay_results assay_results
  • 31. ©PistoiaAlliance The solution ● Instantiate semantic model with (meta)data ● Queryable knowledge graph with ~14M triples ● Different “views” ● Evolving model
  • 32. ©PistoiaAlliance The solution Query: how many genomics assays exist across all sectors of my organisation? RNA Seq experiment Single-cell RNA-seq assay Whole genome sequencing WGS experiment RSQ_12965 AssayRNA_31 SC4050_EM 8019240201G Exp_WGS_4128019240202G
  • 33. ©PistoiaAlliance The solution Query: how many genomics assays exist across all sectors of my organisation? RNA Seq experiment Single-cell RNA-seq assay Whole genome sequencing Genomics Assay WGS experiment RSQ_12965 AssayRNA_31 SC4050_EM 8019240201G Exp_WGS_412 owl:sameAs rdf:type rdf:type rdf:type rdf:type rdfs:subClassOf rdfs:subClassOf rdfs:subClassOf rdfs:subClassOf owl:equivalentClass 8019240202G rdf:type
  • 35. ©PistoiaAlliance Potential pitfalls ● RDF is flexible – triples could contradict your model and inference could easily propagate wrong information in graph ○ Use SHACL (Shapes Constraint Language) validation ● Overloading triple store = slow query times ○ Restrict triples to important information ● URI schema needs to be maintained ○ Automated URI generation and URI update ● Sometimes Knowledge Graph is overkill ○ Semantic models or common data model may suffice
  • 36. ©PistoiaAlliance Why you should consider KGs ● Connecting data and representing as a KG enables: ○ Querying across an organisation ○ Querying across stages of drug discovery ○ Adoption of common terminologies ○ Cleaner data ○ Clear provenance of data ● Linked data is FAIR(er) data!
  • 37. ©PistoiaAlliance Why you should consider KGs ● Machine learning and knowledge graphs ○ ML can be used to develop a KG ○ Conversely: KG can be used for ML algorithm: “...knowledge graphs are a step towards enabling machines to more deeply understand data ... that don’t fit neatly into the rows and columns of a relational database” [1] [1] https://www.forbes.com/sites/bernardmarr/2019/06/26/knowledge-graphs-and-machine-learning-the-future-of-ai-analytics/#7b825f0c3a36
  • 38. ©PistoiaAlliance Materials Data engineering tools for knowledge graphs ● Data Discovery Toolkit ● Model Repository ● Knowledge Base ● Data Mapping Framework ● Data Visualization Framework FAIR data governance is like a fractal ● Purpose and scope ○ Metrics & KPIs ○ New insights ● Retrospective vs prospective A Data Engineer’s Guide to Semantic Models (coming soon!) ● Getting started on semantic models ● RDF, SPARQL, SHACL Common Data Models for FAIR biomedical data ● Choosing data models ● OMOP CDM Leveraging the OMOP Common Data Model for Clinical Trials ● Repurposing OMOP for clinical trials
  • 40. Semantics of data matrices and the STATO ontology Join us for the next FAIR/OM CoI webinar: Speaker: Philippe Rocca-Serra, Oxford University Thurs 28th May at 16:00 BST