SlideShare a Scribd company logo
1 of 24
GraphSummit | London | November 14th, 2023
Empowering AZ’s
Data Connectivity
Building an Internal Knowledge Graph Service
to foster Knowledge Graph projects, enhancing
Data Reusability with Federated Queries, and
harvesting LLM power to talk to the Graphs
Antonio Fabregat, PhD
Knowledge Graph Lead
Enterprise Data Office, IGNITE (AZ)
Presentation
Outline
• Revolutionizing Data Management
• Internal deployment of the Graph as a Service capability
• A view of the current AZ´s Knowledge Graph project landscape
• Queries Federation across several Knowledge Graphs
• Harvesting LLM power to talk to the graphs
2
Revolutionizing
Data Management
3
Tackling Data Growth with Knowledge Graphs
4
• Drug Discovery and Research Data
• Genomic and proteomic data, high-throughput screening results, clinical trial data, etc.
• Real-World Evidence and Patient Data
• Electronic health records (EHRs), wearable devices and remote monitoring, patient-reported outcomes, etc.
• And more…
Rapid data growth from various sources
• Data management
• Analysis tools
Increasing need for efficient
• Organize and structure data
• Facilitate easier access and analysis
Knowledge Graphs as a solution
Knowledge Graphs vs Traditional Data Management Systems
5
•Struggle with complex relationships and semantics
•Limited in capturing meaning and context
Traditional data management systems
•Excel at representing relationships and semantics
•Understand meaning and context of data
•Enable more effective analysis and insights
Knowledge graphs advantages
AI and Machine Learning with Knowledge Graphs
6
Increasing use of AI and
machine learning
in Data Analysis
•Need for efficient data representation
and processing
Knowledge Graphs
as a Solution
•Enriched data model
•Enhance AI and machine learning
algorithms' understanding
•Improve data analysis capabilities
Data Integration and Interconnectivity
7
Seamless integration of data from various sources
• Creation of a unified view of information
Overcoming Data Silos
•Interconnectivity of Knowledge Graphs
Increased value from Data Assets
•Enhanced Data Analysis and Insights
•Improved Decision-Making
Enhanced Search and Discovery
8
Improved Search and
Discovery Capabilities
More intuitive Query Languages
Uncovering hidden
relationships within data
Accurate and Relevant
Search Results
Enhanced user experience
Increased productivity
Personalisation and Recommendation
9
• Rich data representation and relationship mapping
Understanding user preferences, behaviour, and context
• Relevant content based on user interests
Personalized and targeted recommendations
• Improved customer satisfaction and retention
Increased user engagement
Knowledge Graphs representation alternatives
10
* Adapted from documentation at W3C https://www.w3.org/
Two ways of representing/storing a Knowledge Graph
RDF-star (Resource Description Framework)
Semantic Web: Good for common standards and data exchange
Data model based on 3 parts: subject, predicate and objects
Nodes’ properties added as predicates. Edges with properties are “triple-resources” (like “meta-nodes”)
Storage: “Triple/Quad Stores” Graph Databases
Any type of real-world information, can be represented in a Knowledge Graph
18 nodes (5 instances, 4 classes, 8 literals, 1 triple-resource)
19 relationships (triples)
Knowledge Graph is a way of organizing data & information in the form of a graph
A collection of interlinked concepts, entities, events that represent a network of real-world entities, the relationships between them.
LPG (Labelled-Property Graph)
Good for highly dynamic, transactional use cases
Data organized as nodes, labels, relationships and properties
Both nodes and edges can have properties
Storage: Native Graph Databases
5 nodes (5 ids, 4 Labels, 8 properties)
4 relationships (2 properties)
Internal
Graph as a Service
capability
11
Why Knowledge Graphs? and why a Service?
12
• Data management and analysis
• Overcoming data silos and integration challenges
Growing importance of knowledge graphs
• Hosting and development support for knowledge graphs
• Robust and scalable solutions
• Enhanced data-driven decision-making
Need for efficient and reliable services
• Improved data accessibility and insights
• Streamlined collaboration and innovation
Benefits for businesses and organizations
A view of the AZ´s
Knowledge Graph
Project Landscape
13
Biology | Market Strategy | Logistics | Environmental targets
14
Biological Insights
Knowledge Graph
Graph machine learning to help scientists
make faster & better drug discovery decisions
Competitive Intelligence
Knowledge Graph
One-stop-shop for competitive intelligence,
transforming a manual system into a rich service
Supply Chain
Knowledge Graph
Insights into the company’s supply chain,
streamlining processes to enhance decision-making
Sustainability
Initiative
Decision-making support system aiming to
reduce the company’s carbon footprint
Compounds
15
Compounds Synthesis
& Management
(CSMKG)
Combine several databases
Transforms operational data into business
insights to drive continuous improvements
in storage, logistics and delivery
High Throughput
Screening
(HTSKG)
Contains >£45 million worth of data
Increases the quality and efficiency
of future HTS screens
Compounds
& Fragments
(CFKG)
Creates a view of the chemical space
like a medicinal or computation chemist.
Contains all internal and selected external
libraries and allows users to modify a
search and receive feedback ‘live’
PharmaSci
16
Formulation
Knowledge Graph
Pre-clinical formulation design process
Leading to quicker, more effective
scientific developments
Boston Formulation
Knowledge Graph
Improves the understanding of our data
Enhances collaboration by breaking down
silos and connecting disparate data sources
Lipid Nano Particles
Knowledge Graph
Machine learning models
Predicts in-vivo activity from in-vitro
data for intra-cellular drug delivery
and LNP formulation design
Queries
Federation
across several
Knowledge
Graphs
17
Siloed data looks like…
18
19
Let’s build bridges to connect “siloes” of interest…
Query federation describes a collection of
features that enable users and systems to
run queries against multiple siloed data
sources without needing to migrate all data
to a unified system.
Federated Queries
are these BRIDGES
20
Let’s build bridges to connect “siloes” of interest…
The diagram shows the resulting subgraph for
the federated query that answers the question
“Find all genes in BIKG linked with a specific disease, and then
all trials in CIKG that are testing drugs targeting those genes”
Biological Insights
Knowledge Graph
Competitive Intelligence
Knowledge Graph
CIKG
Harvesting
LLM power to
talk to the
graphs
21
22
AZ Insights Chat
Acknowledgments
• Aaron Holt
• Nicolas Mervaillie
• Joe Depeau
• Job Maelane
• Yuen Leung Tang
• Jesus Barrasa
• Daniel Addison
• Delyan Ivanov
• Suzy Jones
• Wolfgang Klute
• Michael Lainchbury
• Andriy Nikolov
• Nishank Mahore
• Cristina Mihetiu
• Justin Morley
• Michaël Ughetto
• Lauren Eardley
• Karen Roberts
• Anthony Puleo
• Cinthia Willaman
• Ivan Figueroa
• Carlos Mercado
• Jorge Gutierrez
• Koushik Srinivasan
Enterprise Data Office | IGNITE
Enterprise Knowledge Graph Service
Robert Hernandez
Knowledge Engineering
Lead
Sandra Carrasco
Senior Knowledge
Graph Engineer
Antonio Fabregat
Knowledge Graph Lead
Ronnie Mubayiwa
Senior DevOps Engineer
Varun Bhandary
Senior Solution Architect
Sree Balasubramanyam
Senior IT Project Manager
Vishal Kumar
DevOps Engineer
Preetha Mutharasu
Knowledge Graph
Engineer
Prem Oliver Vincent
Scrum Master
Andy Stafford-Hughes
Testing Manager
Umapathy Boopathy
Cloud Solution Architect
Pascual Lorente
Senior Knowledge
Graph Engineer

More Related Content

What's hot

ENEL Electricity Topology Network on Neo4j Graph DB
ENEL Electricity Topology Network on Neo4j Graph DBENEL Electricity Topology Network on Neo4j Graph DB
ENEL Electricity Topology Network on Neo4j Graph DB
Neo4j
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
jexp
 

What's hot (20)

SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph Solution
 
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptxNeo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
 
Optimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j GraphOptimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j Graph
 
Neanex - Semantic Construction with Graphs
Neanex - Semantic Construction with GraphsNeanex - Semantic Construction with Graphs
Neanex - Semantic Construction with Graphs
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
ENEL Electricity Topology Network on Neo4j Graph DB
ENEL Electricity Topology Network on Neo4j Graph DBENEL Electricity Topology Network on Neo4j Graph DB
ENEL Electricity Topology Network on Neo4j Graph DB
 
AstraZeneca - Re-imagining the Data Landscape in Compound Synthesis & Management
AstraZeneca - Re-imagining the Data Landscape in Compound Synthesis & ManagementAstraZeneca - Re-imagining the Data Landscape in Compound Synthesis & Management
AstraZeneca - Re-imagining the Data Landscape in Compound Synthesis & Management
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data Science
 
Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020
 
Building social network with Neo4j and Python
Building social network with Neo4j and PythonBuilding social network with Neo4j and Python
Building social network with Neo4j and Python
 
Evotec - How can Knowledge Graphs support Druh Discovery
Evotec - How can Knowledge Graphs support Druh DiscoveryEvotec - How can Knowledge Graphs support Druh Discovery
Evotec - How can Knowledge Graphs support Druh Discovery
 
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
 
Knowledge Graphs and Generative AI
Knowledge Graphs and Generative AIKnowledge Graphs and Generative AI
Knowledge Graphs and Generative AI
 
Data Modeling with Neo4j
Data Modeling with Neo4jData Modeling with Neo4j
Data Modeling with Neo4j
 
Neo4j: The path to success with Graph Database and Graph Data Science
Neo4j: The path to success with Graph Database and Graph Data ScienceNeo4j: The path to success with Graph Database and Graph Data Science
Neo4j: The path to success with Graph Database and Graph Data Science
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
 
Risk Signature Profiles in Health Care Claims(Risk_Signature_Profiles)_.pptx
Risk Signature Profiles in Health Care Claims(Risk_Signature_Profiles)_.pptxRisk Signature Profiles in Health Care Claims(Risk_Signature_Profiles)_.pptx
Risk Signature Profiles in Health Care Claims(Risk_Signature_Profiles)_.pptx
 
Adobe Behance Scales to Millions of Users at Lower TCO with Neo4j
Adobe Behance Scales to Millions of Users at Lower TCO with Neo4jAdobe Behance Scales to Millions of Users at Lower TCO with Neo4j
Adobe Behance Scales to Millions of Users at Lower TCO with Neo4j
 
Supply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion DeckSupply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion Deck
 

Similar to AstraZeneca at Neo4j GraphSummit London 14Nov23.pptx

The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
Alan Morrison
 

Similar to AstraZeneca at Neo4j GraphSummit London 14Nov23.pptx (20)

ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
 
Solving the Disconnected Data Problem in Healthcare Using MongoDB
Solving the Disconnected Data Problem in Healthcare Using MongoDBSolving the Disconnected Data Problem in Healthcare Using MongoDB
Solving the Disconnected Data Problem in Healthcare Using MongoDB
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 
Neo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life SciencesNeo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life Sciences
 
Distributed Trust Architecture: The New Reality of ML-based Systems
Distributed Trust Architecture: The New Reality of ML-based SystemsDistributed Trust Architecture: The New Reality of ML-based Systems
Distributed Trust Architecture: The New Reality of ML-based Systems
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Data literacy
Data literacyData literacy
Data literacy
 
McGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and ScalingMcGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and Scaling
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Data Visualization in Health
Data Visualization in HealthData Visualization in Health
Data Visualization in Health
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your DataFoundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
 
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
Leveraging Graphs for AI and ML - Alicia Frame, Neo4jLeveraging Graphs for AI and ML - Alicia Frame, Neo4j
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Data Library Services In The Data Stewardship Lifecycle
Data Library Services In The Data Stewardship LifecycleData Library Services In The Data Stewardship Lifecycle
Data Library Services In The Data Stewardship Lifecycle
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 

More from Neo4j

More from Neo4j (20)

Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit MilanWorkshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
 
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphGraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 

Recently uploaded

Recently uploaded (20)

WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...
WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...
WSO2Con2024 - Facilitating Broadband Switching Services for UK Telecoms Provi...
 
WSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid EnvironmentsWSO2Con2024 - Software Delivery in Hybrid Environments
WSO2Con2024 - Software Delivery in Hybrid Environments
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
WSO2CON 2024 Slides - Unlocking Value with AI
WSO2CON 2024 Slides - Unlocking Value with AIWSO2CON 2024 Slides - Unlocking Value with AI
WSO2CON 2024 Slides - Unlocking Value with AI
 
WSO2CON 2024 - Lessons from the Field: Legacy Platforms – It's Time to Let Go...
WSO2CON 2024 - Lessons from the Field: Legacy Platforms – It's Time to Let Go...WSO2CON 2024 - Lessons from the Field: Legacy Platforms – It's Time to Let Go...
WSO2CON 2024 - Lessons from the Field: Legacy Platforms – It's Time to Let Go...
 
WSO2Con2024 - Unleashing the Financial Potential of 13 Million People
WSO2Con2024 - Unleashing the Financial Potential of 13 Million PeopleWSO2Con2024 - Unleashing the Financial Potential of 13 Million People
WSO2Con2024 - Unleashing the Financial Potential of 13 Million People
 
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAMWSO2Con2024 - Organization Management: The Revolution in B2B CIAM
WSO2Con2024 - Organization Management: The Revolution in B2B CIAM
 
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & InnovationWSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
WSO2Con2024 - Simplified Integration: Unveiling the Latest Features in WSO2 L...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
 
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public AdministrationWSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
WSO2CON 2024 - How CSI Piemonte Is Apifying the Public Administration
 

AstraZeneca at Neo4j GraphSummit London 14Nov23.pptx

  • 1. GraphSummit | London | November 14th, 2023 Empowering AZ’s Data Connectivity Building an Internal Knowledge Graph Service to foster Knowledge Graph projects, enhancing Data Reusability with Federated Queries, and harvesting LLM power to talk to the Graphs Antonio Fabregat, PhD Knowledge Graph Lead Enterprise Data Office, IGNITE (AZ)
  • 2. Presentation Outline • Revolutionizing Data Management • Internal deployment of the Graph as a Service capability • A view of the current AZ´s Knowledge Graph project landscape • Queries Federation across several Knowledge Graphs • Harvesting LLM power to talk to the graphs 2
  • 4. Tackling Data Growth with Knowledge Graphs 4 • Drug Discovery and Research Data • Genomic and proteomic data, high-throughput screening results, clinical trial data, etc. • Real-World Evidence and Patient Data • Electronic health records (EHRs), wearable devices and remote monitoring, patient-reported outcomes, etc. • And more… Rapid data growth from various sources • Data management • Analysis tools Increasing need for efficient • Organize and structure data • Facilitate easier access and analysis Knowledge Graphs as a solution
  • 5. Knowledge Graphs vs Traditional Data Management Systems 5 •Struggle with complex relationships and semantics •Limited in capturing meaning and context Traditional data management systems •Excel at representing relationships and semantics •Understand meaning and context of data •Enable more effective analysis and insights Knowledge graphs advantages
  • 6. AI and Machine Learning with Knowledge Graphs 6 Increasing use of AI and machine learning in Data Analysis •Need for efficient data representation and processing Knowledge Graphs as a Solution •Enriched data model •Enhance AI and machine learning algorithms' understanding •Improve data analysis capabilities
  • 7. Data Integration and Interconnectivity 7 Seamless integration of data from various sources • Creation of a unified view of information Overcoming Data Silos •Interconnectivity of Knowledge Graphs Increased value from Data Assets •Enhanced Data Analysis and Insights •Improved Decision-Making
  • 8. Enhanced Search and Discovery 8 Improved Search and Discovery Capabilities More intuitive Query Languages Uncovering hidden relationships within data Accurate and Relevant Search Results Enhanced user experience Increased productivity
  • 9. Personalisation and Recommendation 9 • Rich data representation and relationship mapping Understanding user preferences, behaviour, and context • Relevant content based on user interests Personalized and targeted recommendations • Improved customer satisfaction and retention Increased user engagement
  • 10. Knowledge Graphs representation alternatives 10 * Adapted from documentation at W3C https://www.w3.org/ Two ways of representing/storing a Knowledge Graph RDF-star (Resource Description Framework) Semantic Web: Good for common standards and data exchange Data model based on 3 parts: subject, predicate and objects Nodes’ properties added as predicates. Edges with properties are “triple-resources” (like “meta-nodes”) Storage: “Triple/Quad Stores” Graph Databases Any type of real-world information, can be represented in a Knowledge Graph 18 nodes (5 instances, 4 classes, 8 literals, 1 triple-resource) 19 relationships (triples) Knowledge Graph is a way of organizing data & information in the form of a graph A collection of interlinked concepts, entities, events that represent a network of real-world entities, the relationships between them. LPG (Labelled-Property Graph) Good for highly dynamic, transactional use cases Data organized as nodes, labels, relationships and properties Both nodes and edges can have properties Storage: Native Graph Databases 5 nodes (5 ids, 4 Labels, 8 properties) 4 relationships (2 properties)
  • 11. Internal Graph as a Service capability 11
  • 12. Why Knowledge Graphs? and why a Service? 12 • Data management and analysis • Overcoming data silos and integration challenges Growing importance of knowledge graphs • Hosting and development support for knowledge graphs • Robust and scalable solutions • Enhanced data-driven decision-making Need for efficient and reliable services • Improved data accessibility and insights • Streamlined collaboration and innovation Benefits for businesses and organizations
  • 13. A view of the AZ´s Knowledge Graph Project Landscape 13
  • 14. Biology | Market Strategy | Logistics | Environmental targets 14 Biological Insights Knowledge Graph Graph machine learning to help scientists make faster & better drug discovery decisions Competitive Intelligence Knowledge Graph One-stop-shop for competitive intelligence, transforming a manual system into a rich service Supply Chain Knowledge Graph Insights into the company’s supply chain, streamlining processes to enhance decision-making Sustainability Initiative Decision-making support system aiming to reduce the company’s carbon footprint
  • 15. Compounds 15 Compounds Synthesis & Management (CSMKG) Combine several databases Transforms operational data into business insights to drive continuous improvements in storage, logistics and delivery High Throughput Screening (HTSKG) Contains >£45 million worth of data Increases the quality and efficiency of future HTS screens Compounds & Fragments (CFKG) Creates a view of the chemical space like a medicinal or computation chemist. Contains all internal and selected external libraries and allows users to modify a search and receive feedback ‘live’
  • 16. PharmaSci 16 Formulation Knowledge Graph Pre-clinical formulation design process Leading to quicker, more effective scientific developments Boston Formulation Knowledge Graph Improves the understanding of our data Enhances collaboration by breaking down silos and connecting disparate data sources Lipid Nano Particles Knowledge Graph Machine learning models Predicts in-vivo activity from in-vitro data for intra-cellular drug delivery and LNP formulation design
  • 18. Siloed data looks like… 18
  • 19. 19 Let’s build bridges to connect “siloes” of interest… Query federation describes a collection of features that enable users and systems to run queries against multiple siloed data sources without needing to migrate all data to a unified system. Federated Queries are these BRIDGES
  • 20. 20 Let’s build bridges to connect “siloes” of interest… The diagram shows the resulting subgraph for the federated query that answers the question “Find all genes in BIKG linked with a specific disease, and then all trials in CIKG that are testing drugs targeting those genes” Biological Insights Knowledge Graph Competitive Intelligence Knowledge Graph CIKG
  • 21. Harvesting LLM power to talk to the graphs 21
  • 23. Acknowledgments • Aaron Holt • Nicolas Mervaillie • Joe Depeau • Job Maelane • Yuen Leung Tang • Jesus Barrasa • Daniel Addison • Delyan Ivanov • Suzy Jones • Wolfgang Klute • Michael Lainchbury • Andriy Nikolov • Nishank Mahore • Cristina Mihetiu • Justin Morley • Michaël Ughetto • Lauren Eardley • Karen Roberts • Anthony Puleo • Cinthia Willaman • Ivan Figueroa • Carlos Mercado • Jorge Gutierrez • Koushik Srinivasan
  • 24. Enterprise Data Office | IGNITE Enterprise Knowledge Graph Service Robert Hernandez Knowledge Engineering Lead Sandra Carrasco Senior Knowledge Graph Engineer Antonio Fabregat Knowledge Graph Lead Ronnie Mubayiwa Senior DevOps Engineer Varun Bhandary Senior Solution Architect Sree Balasubramanyam Senior IT Project Manager Vishal Kumar DevOps Engineer Preetha Mutharasu Knowledge Graph Engineer Prem Oliver Vincent Scrum Master Andy Stafford-Hughes Testing Manager Umapathy Boopathy Cloud Solution Architect Pascual Lorente Senior Knowledge Graph Engineer

Editor's Notes

  1. The rapid growth of data generated from various sources, including IoT devices, social media, and business applications, has led to an increasing need for efficient data management and analysis tools. Knowledge graphs help to organise and structure this vast amount of data, making it easier to access and analyse.
  2. Traditional data management systems often struggle to capture the complex relationships and semantics within data. Knowledge graphs excel at representing and understanding the meaning and context of data, allowing for more effective analysis and insights.
  3. The increasing use of AI and machine learning in data analysis requires more efficient ways to represent and process data. Knowledge graphs provide an enriched data model that allows AI and machine learning algorithms to better understand and analyse data.
  4. Knowledge graphs enable seamless integration of data from various sources, creating a unified view of information. This interconnectivity helps overcome data silos, allowing organizations to derive more value from their data assets.
  5. Knowledge graphs improve search and discovery capabilities by enabling more intuitive query languages and uncovering hidden relationships within data. This results in more accurate and relevant search results, enhancing user experience and productivity.
  6. Knowledge graphs support personalized and targeted recommendations by understanding user preferences, behaviour, and the context of their interactions. This leads to more relevant recommendations and increased user engagement.
  7. Growing importance of knowledge graphs in data management and analysis. There is need for an efficient and reliable service to support both hosting and development of knowledge graphs. AZ is investing in this area to create a robust and scalable service.
  8. When we talk about multiple siloed databases, we could imagine an archipelago. At the first glance, visiting all islands, doesn't seem an easy task!
  9. With the right infrastructure, multiple islands can be connected, and visiting them, suddenly, becomes way easier. Federating queries, across siloed databases, is like building bridges between islands. This allows running queries against multiple siloed data sources, without needing to migrate all data to a unified system.