Graph Summit | London | February 22nd, 2024
Knowledge Graphs powering a fast-
moving global life sciences organisation
Our experience building a knowledge
graph platform and service to power the
next generation of insights and analytics at
AstraZeneca
Varun Bhandary
Senior Solutions Architect
Enterprise Data & AI Architecture
IGNITE (AZ)
Antonio Fabregat, PhD
Knowledge Graph Lead
Enterprise Data Office
IGNITE (AZ)
Agenda
1. Connected Data โค๏ธ Lifesciences
2. Our Challenges and Plan ๐Ÿš
3. Introducing AZโ€™s โ€œKnowledge Graph Service" ๐Ÿ“ฃ
4. A glimpse into the future of Graphs in AZ ๐Ÿ”ญ
5. Talking to your Graphs ๐Ÿ—ฃ๏ธ๐ŸŽ™๏ธ
6. Graphs are Stronger Together ๏ธ
2
AstraZeneca in UK
3
Reference : https://www.astrazeneca.co.uk/about-us/economic
4
Connected Data โค๏ธ
Lifesciences
1
Why Knowledge Graphs at Lifesciences?
6
Integration of Diverse
Data Sources
A unified framework for
connecting heterogeneous
data, enabling researchers and
decision-makers to gain
comprehensive insights across
disparate data silos.
Complexity of Biomedical
Knowledge
Facilitate advanced analytics,
hypothesis generation, and
decision support for drug
discovery, development, and
clinical research.
Semantic Search and
Discovery
Enable semantic search and
discovery by encoding
relationships between entities,
concepts, and attributes in a
graph-based data model
Data-driven Insights and
Decision Making
A powerful foundation for
advanced analytics, machine
learning etc enabling
researchers to uncover
hidden patterns
Use-Cases
7
Drug Discovery
Regulatory
Affairs Patient Study
Compounds
CRM (Engagement
& Reach) Competitive
Insights
Supply Chain
Quality
Planning
Real World
Evidence
Many moreโ€ฆ.
Knowledge Graphs representation alternatives
8
* Adapted from documentation at W3C https://www.w3.org/
Two ways of representing/storing a Knowledge Graph
RDF-star (Resource Description Framework)
Semantic Web: Good for common standards and data exchange
Data model based on 3 parts: subject, predicate and objects
Nodesโ€™ properties added as predicates. Edges with properties are โ€œtriple-resourcesโ€ (like โ€œmeta-nodesโ€)
Storage: โ€œTriple/Quad Storesโ€ Graph Databases
Any type of real-world information, can be represented in a Knowledge Graph
18 nodes (5 instances, 4 classes, 8 literals, 1 triple-resource)
19 relationships (triples)
Knowledge Graph is a way of organizing data & information in the form of a graph
A collection of interlinked concepts, entities, events that represent a network of real-world entities, the relationships between them.
LPG (Labelled-Property Graph)
Good for highly dynamic, transactional use cases
Data organized as nodes, labels, relationships and properties
Both nodes and edges can have properties
Storage: Native Graph Databases
5 nodes (5 ids, 4 Labels, 8 properties)
4 relationships (2 properties)
Our Challenges and
Plan ๐Ÿš
2
Challenges
10
Decouple & Specialise Integrate & Standardise Abstract & Automate
๏ƒผ Use the right tools for the job
Data Lake? Data Warehouse? Graph
Database? LPG? RDF? No-SQL?
๏ƒผ Modular Design with Security in
Mind
Build a component-based
architecture with coherent and
practical principles.
๏ƒผ Think of data as a product
Push and Pull Vs Serve and Consumer
๏ƒผ Make it easy to work with data
across platforms.
Searching and moving data is costly.
Move to an ELT model, leverage
first-party connectors, and
document to promote the most
optimal options.
๏ƒผ Standardise
Apply FAIR principles
๏ƒผ Document and Promote
Patterns
Data Movement, Loading,
Transformation.
๏ƒผ Template and Accelerate
Teams should be able to spend more
time analysing data and deriving
insights than managing infra.
๏ƒผ Automate
Leverage IaC, and automation
pipelines to achieve consistent
deployments.
The Plan
Data Platform
Unified Data Store
Snowflake
External Tables
Snowflake Internal
Table Storage
Unified Data Compute
Snowflake Virtual
Warehouse
Snowflake
Snowpark
SnowPipe
User Defined
Functions
Unistore
Time-Travel
Data-Lake Compute
SQL Cluster
General Purpose
Cluster
Data Lake Store
Raw Layer
Work Layer
Publish Layer
Glue Hive
Metastore
Knowledge Graph Service
Graph Data Store
LPG Storage
Composite
Utilities
Graph DS Libraries
Cypher / APOC
Graph Compute
Graph Build and
Exploration
Graph Analytics
Machine
Learning Studio
Model Build &
Train
Deploy and
Govern
Graph Exploration
Query Client
Data Browser
Graph Data
Visualization
External Data, RWE &
Partnerships
Structured Data
MDM/RDM, Ontologies,
Vocab., Dictionaries
Semi-Structured
Content & Files
Un-Structured
Content & Files
User Input
Data Acquisition
Data Sources Ingestion &
Integration
IoT &
Streaming
API
Management
Event
Store
Queue
MuleSoft
CDC
Database
API
Streaming
Compute
External Data
Transfer
DDTS
Enterprise Platforms
(i.e. SAP)
Decreasing Volume of Content
Increasing Quality of Content
Introducing AZโ€™s
โ€œKnowledge Graph
Service" ๐Ÿ“ฃ
3
Why Knowledge Graphs? and why a Service?
13
โ€ข Data management and analysis
โ€ข Overcoming data silos and integration challenges
Growing importance of knowledge graphs
โ€ข Hosting and development support for knowledge graphs
โ€ข Robust and scalable solutions
โ€ข Enhanced data-driven decision-making
Need for efficient and reliable services
โ€ข Improved data accessibility and insights
โ€ข Streamlined collaboration and innovation
Benefits for businesses and organizations
14
Why using the
Knowledge
Graph Service?
15
Why using the
Knowledge
Graph Service?
16
Why using the
Knowledge
Graph Service?
A Glimpse
into the Future
of Graphs at AZ ๐Ÿ”ญ
4
Biology | Market Strategy | Logistics | Environmental targets
18
Biological Insights
Knowledge Graph
Graph machine learning to help scientists
make faster & better drug discovery decisions
Competitive Intelligence
Knowledge Graph
One-stop-shop for competitive intelligence,
transforming a manual system into a rich service
Supply Chain
Knowledge Graph
Insights into the companyโ€™s supply chain,
streamlining processes to enhance decision-making
Sustainability
Initiative
Decision-making support system aiming to
reduce the companyโ€™s carbon footprint
Compounds
19
Compounds Synthesis
& Management
(CSMKG)
Combine several databases
Transforms operational data into business
insights to drive continuous improvements
in storage, logistics and delivery
High Throughput
Screening
(HTSKG)
Contains ยฃM worth of data
Increases the quality and efficiency
of future HTS screens
Compounds
& Fragments
(CFKG)
Creates a view of the chemical space
like a medicinal or computation chemist.
Contains all internal and selected external
libraries and allows users to modify a
search and receive feedback โ€˜liveโ€™
PharmaSci
20
Formulation
Knowledge Graph
Pre-clinical formulation design process
Leading to quicker, more effective
scientific developments
Boston Formulation
Knowledge Graph
Improves the understanding of our data
Enhances collaboration by breaking down
silos and connecting disparate data sources
Lipid Nano Particles
Knowledge Graph
Machine learning models
Predicts in-vivo activity from in-vitro
data for intra-cellular drug delivery
and LNP formulation design
Talking to your graphs
๐Ÿ—ฃ๏ธ๐ŸŽ™๏ธ
5
Have you ever thought to
have a graph expert with
you 24/7?
GenAI is here to help!
22
AZ Insights Chat
Future Evolutions of the Insights Chat
Knowledge Mesh?
23
Unified Rule, Behavior &
Meta Graph Store
User
User
Knowledge Discovery
Interface
Unified LLM
Integration
(AI Portal)
1
2
3
Domain Specific Knowledge Graphs Domain Specific Knowledge Graphs
Meta Graph Meta Graph Meta Graph
Graphs are Stronger
Together ๏ธ
Why query federation is a
key to unlocking even more
cross-functional use-cases
6
Siloed data looks likeโ€ฆ
25
26
Letโ€™s build bridges to connect โ€œsiloesโ€ of interestโ€ฆ
Query federation describes a collection of
features that enable users and systems to
run queries against multiple siloed data
sources without needing to migrate all data
to a unified system.
Federated Queries
are these BRIDGES
27
Letโ€™s build bridges to connect โ€œsiloesโ€ of interestโ€ฆ
The diagram shows the resulting subgraph for
the federated query that answers the question
โ€œFind all genes in BIKG linked with a specific disease, and then
all trials in CIKG that are testing drugs targeting those genesโ€
Biological Insights
Knowledge Graph
Competitive Intelligence
Knowledge Graph
CIKG
Acknowledgments
โ€ข Aaron Holt
โ€ข Nicolas Mervaillie
โ€ข Joe Depeau
โ€ข Job Maelane
โ€ข Yuen Leung Tang
โ€ข Jesus Barrasa
โ€ข Morgan Senechal
โ€ข Lauren Eardley
โ€ข Cinthia Willaman
โ€ข Taylan Sahin
โ€ข Melanie Hardiman
โ€ข Daniel Addison
โ€ข Delyan Ivanov
โ€ข Suzy Jones
โ€ข Andriy Nikolov
โ€ข Cristina Mihetiu
โ€ข Michaรซl Ughetto
โ€ข Karen Roberts
โ€ข Wolfgang Klute
โ€ข Michael Lainchbury
โ€ข Justin Morley
โ€ข Andy Stafford-Hughes
โ€ข Nikil Kunnappallil
โ€ข Anthony Puleo
โ€ข Ivan Figueroa
โ€ข Koushik Srinivasan
โ€ข Nick Iles
โ€ข Lena Becciolini
Enterprise Data Office | IGNITE
Enterprise Knowledge Graph
Robert Hernandez
Knowledge Engineering
Lead
Sandra Carrasco
Senior Knowledge
Graph Engineer
Antonio Fabregat
Knowledge Graph Lead
Vishal Kumar
DevOps & Data
Engineer
Preetha Mutharasu
Knowledge Graph
Engineer
Ronnie Mubayiwa
Senior DevOps Engineer
Varun Bhandary
Senior Solution Architect
Sree Balasubramanyam
Senior IT Project Manager
Prem Oliver Vincent
Scrum Master
Sangeetha Natarajan
Testing Manager
Miquel Monge
Knowledge Graph
Engineer
Pascual Lorente
Senior Knowledge
Graph Engineer
Santanu Biswas
Senior Datalake Engineer
Tarik Sidi-Mammar
Data Ops Platforms
Service Lead
Lauren Eardley
Enterprise Head of Data
Engineering Services

K7 Ultimate Security Crack FREE latest version 2025

  • 1.
    Graph Summit |London | February 22nd, 2024 Knowledge Graphs powering a fast- moving global life sciences organisation Our experience building a knowledge graph platform and service to power the next generation of insights and analytics at AstraZeneca Varun Bhandary Senior Solutions Architect Enterprise Data & AI Architecture IGNITE (AZ) Antonio Fabregat, PhD Knowledge Graph Lead Enterprise Data Office IGNITE (AZ)
  • 2.
    Agenda 1. Connected Dataโค๏ธ Lifesciences 2. Our Challenges and Plan ๐Ÿš 3. Introducing AZโ€™s โ€œKnowledge Graph Service" ๐Ÿ“ฃ 4. A glimpse into the future of Graphs in AZ ๐Ÿ”ญ 5. Talking to your Graphs ๐Ÿ—ฃ๏ธ๐ŸŽ™๏ธ 6. Graphs are Stronger Together ๏ธ 2
  • 3.
    AstraZeneca in UK 3 Reference: https://www.astrazeneca.co.uk/about-us/economic
  • 4.
  • 5.
  • 6.
    Why Knowledge Graphsat Lifesciences? 6 Integration of Diverse Data Sources A unified framework for connecting heterogeneous data, enabling researchers and decision-makers to gain comprehensive insights across disparate data silos. Complexity of Biomedical Knowledge Facilitate advanced analytics, hypothesis generation, and decision support for drug discovery, development, and clinical research. Semantic Search and Discovery Enable semantic search and discovery by encoding relationships between entities, concepts, and attributes in a graph-based data model Data-driven Insights and Decision Making A powerful foundation for advanced analytics, machine learning etc enabling researchers to uncover hidden patterns
  • 7.
    Use-Cases 7 Drug Discovery Regulatory Affairs PatientStudy Compounds CRM (Engagement & Reach) Competitive Insights Supply Chain Quality Planning Real World Evidence Many moreโ€ฆ.
  • 8.
    Knowledge Graphs representationalternatives 8 * Adapted from documentation at W3C https://www.w3.org/ Two ways of representing/storing a Knowledge Graph RDF-star (Resource Description Framework) Semantic Web: Good for common standards and data exchange Data model based on 3 parts: subject, predicate and objects Nodesโ€™ properties added as predicates. Edges with properties are โ€œtriple-resourcesโ€ (like โ€œmeta-nodesโ€) Storage: โ€œTriple/Quad Storesโ€ Graph Databases Any type of real-world information, can be represented in a Knowledge Graph 18 nodes (5 instances, 4 classes, 8 literals, 1 triple-resource) 19 relationships (triples) Knowledge Graph is a way of organizing data & information in the form of a graph A collection of interlinked concepts, entities, events that represent a network of real-world entities, the relationships between them. LPG (Labelled-Property Graph) Good for highly dynamic, transactional use cases Data organized as nodes, labels, relationships and properties Both nodes and edges can have properties Storage: Native Graph Databases 5 nodes (5 ids, 4 Labels, 8 properties) 4 relationships (2 properties)
  • 9.
  • 10.
    Challenges 10 Decouple & SpecialiseIntegrate & Standardise Abstract & Automate ๏ƒผ Use the right tools for the job Data Lake? Data Warehouse? Graph Database? LPG? RDF? No-SQL? ๏ƒผ Modular Design with Security in Mind Build a component-based architecture with coherent and practical principles. ๏ƒผ Think of data as a product Push and Pull Vs Serve and Consumer ๏ƒผ Make it easy to work with data across platforms. Searching and moving data is costly. Move to an ELT model, leverage first-party connectors, and document to promote the most optimal options. ๏ƒผ Standardise Apply FAIR principles ๏ƒผ Document and Promote Patterns Data Movement, Loading, Transformation. ๏ƒผ Template and Accelerate Teams should be able to spend more time analysing data and deriving insights than managing infra. ๏ƒผ Automate Leverage IaC, and automation pipelines to achieve consistent deployments.
  • 11.
    The Plan Data Platform UnifiedData Store Snowflake External Tables Snowflake Internal Table Storage Unified Data Compute Snowflake Virtual Warehouse Snowflake Snowpark SnowPipe User Defined Functions Unistore Time-Travel Data-Lake Compute SQL Cluster General Purpose Cluster Data Lake Store Raw Layer Work Layer Publish Layer Glue Hive Metastore Knowledge Graph Service Graph Data Store LPG Storage Composite Utilities Graph DS Libraries Cypher / APOC Graph Compute Graph Build and Exploration Graph Analytics Machine Learning Studio Model Build & Train Deploy and Govern Graph Exploration Query Client Data Browser Graph Data Visualization External Data, RWE & Partnerships Structured Data MDM/RDM, Ontologies, Vocab., Dictionaries Semi-Structured Content & Files Un-Structured Content & Files User Input Data Acquisition Data Sources Ingestion & Integration IoT & Streaming API Management Event Store Queue MuleSoft CDC Database API Streaming Compute External Data Transfer DDTS Enterprise Platforms (i.e. SAP) Decreasing Volume of Content Increasing Quality of Content
  • 12.
  • 13.
    Why Knowledge Graphs?and why a Service? 13 โ€ข Data management and analysis โ€ข Overcoming data silos and integration challenges Growing importance of knowledge graphs โ€ข Hosting and development support for knowledge graphs โ€ข Robust and scalable solutions โ€ข Enhanced data-driven decision-making Need for efficient and reliable services โ€ข Improved data accessibility and insights โ€ข Streamlined collaboration and innovation Benefits for businesses and organizations
  • 14.
  • 15.
  • 16.
  • 17.
    A Glimpse into theFuture of Graphs at AZ ๐Ÿ”ญ 4
  • 18.
    Biology | MarketStrategy | Logistics | Environmental targets 18 Biological Insights Knowledge Graph Graph machine learning to help scientists make faster & better drug discovery decisions Competitive Intelligence Knowledge Graph One-stop-shop for competitive intelligence, transforming a manual system into a rich service Supply Chain Knowledge Graph Insights into the companyโ€™s supply chain, streamlining processes to enhance decision-making Sustainability Initiative Decision-making support system aiming to reduce the companyโ€™s carbon footprint
  • 19.
    Compounds 19 Compounds Synthesis & Management (CSMKG) Combineseveral databases Transforms operational data into business insights to drive continuous improvements in storage, logistics and delivery High Throughput Screening (HTSKG) Contains ยฃM worth of data Increases the quality and efficiency of future HTS screens Compounds & Fragments (CFKG) Creates a view of the chemical space like a medicinal or computation chemist. Contains all internal and selected external libraries and allows users to modify a search and receive feedback โ€˜liveโ€™
  • 20.
    PharmaSci 20 Formulation Knowledge Graph Pre-clinical formulationdesign process Leading to quicker, more effective scientific developments Boston Formulation Knowledge Graph Improves the understanding of our data Enhances collaboration by breaking down silos and connecting disparate data sources Lipid Nano Particles Knowledge Graph Machine learning models Predicts in-vivo activity from in-vitro data for intra-cellular drug delivery and LNP formulation design
  • 21.
    Talking to yourgraphs ๐Ÿ—ฃ๏ธ๐ŸŽ™๏ธ 5 Have you ever thought to have a graph expert with you 24/7? GenAI is here to help!
  • 22.
  • 23.
    Future Evolutions ofthe Insights Chat Knowledge Mesh? 23 Unified Rule, Behavior & Meta Graph Store User User Knowledge Discovery Interface Unified LLM Integration (AI Portal) 1 2 3 Domain Specific Knowledge Graphs Domain Specific Knowledge Graphs Meta Graph Meta Graph Meta Graph
  • 24.
    Graphs are Stronger Together๏ธ Why query federation is a key to unlocking even more cross-functional use-cases 6
  • 25.
    Siloed data lookslikeโ€ฆ 25
  • 26.
    26 Letโ€™s build bridgesto connect โ€œsiloesโ€ of interestโ€ฆ Query federation describes a collection of features that enable users and systems to run queries against multiple siloed data sources without needing to migrate all data to a unified system. Federated Queries are these BRIDGES
  • 27.
    27 Letโ€™s build bridgesto connect โ€œsiloesโ€ of interestโ€ฆ The diagram shows the resulting subgraph for the federated query that answers the question โ€œFind all genes in BIKG linked with a specific disease, and then all trials in CIKG that are testing drugs targeting those genesโ€ Biological Insights Knowledge Graph Competitive Intelligence Knowledge Graph CIKG
  • 28.
    Acknowledgments โ€ข Aaron Holt โ€ขNicolas Mervaillie โ€ข Joe Depeau โ€ข Job Maelane โ€ข Yuen Leung Tang โ€ข Jesus Barrasa โ€ข Morgan Senechal โ€ข Lauren Eardley โ€ข Cinthia Willaman โ€ข Taylan Sahin โ€ข Melanie Hardiman โ€ข Daniel Addison โ€ข Delyan Ivanov โ€ข Suzy Jones โ€ข Andriy Nikolov โ€ข Cristina Mihetiu โ€ข Michaรซl Ughetto โ€ข Karen Roberts โ€ข Wolfgang Klute โ€ข Michael Lainchbury โ€ข Justin Morley โ€ข Andy Stafford-Hughes โ€ข Nikil Kunnappallil โ€ข Anthony Puleo โ€ข Ivan Figueroa โ€ข Koushik Srinivasan โ€ข Nick Iles โ€ข Lena Becciolini
  • 29.
    Enterprise Data Office| IGNITE Enterprise Knowledge Graph Robert Hernandez Knowledge Engineering Lead Sandra Carrasco Senior Knowledge Graph Engineer Antonio Fabregat Knowledge Graph Lead Vishal Kumar DevOps & Data Engineer Preetha Mutharasu Knowledge Graph Engineer Ronnie Mubayiwa Senior DevOps Engineer Varun Bhandary Senior Solution Architect Sree Balasubramanyam Senior IT Project Manager Prem Oliver Vincent Scrum Master Sangeetha Natarajan Testing Manager Miquel Monge Knowledge Graph Engineer Pascual Lorente Senior Knowledge Graph Engineer Santanu Biswas Senior Datalake Engineer Tarik Sidi-Mammar Data Ops Platforms Service Lead Lauren Eardley Enterprise Head of Data Engineering Services