Knowledge Graphs Ground LLMs for Accuracy and Relevance

© 2023 Neo4j, Inc. All rights reserved.
Knowledge Graphs and
Generative AI
LLMs with Graphs & Graph Data Science
1
Graph Summit Minneapolis
September 20, 2023

2
Agenda
1. Generative AI and LLMs
2. Grounding LLMs with
Knowledge Graphs
3. Demo
Phani Dathar Ph.D.
Director, Graph Data Science

3
Generative AI and LLMs
artificial intelligence (AI) system that generates text, images, or
other media in response to prompts
Generative AI
are large neural networks with billions of parameters trained on large
quantities of text data often using self-supervised / semi-supervised
approaches. Eg: GPT-4/Bard/Claude
Large Language
Models (LLM)
ML approach that models the probability distribution over a
sequence of words.
Language Model
are trained on large scale data, using no supervision or self-
supervision, that can be adapted to a wide range of downstream
tasks.
Foundation Models
(FM)

4
LLMs Give Us an Amazing Opportunity to:
Automate data
retrieval tasks
1
Improve customer
service experiences
Expedite reading, understanding,
& summarizing
2
3
Generate
content & code
4

But There Are Challenges…
Knowledge
cut-off
1
Reasonable
answers, not always
accurate
Can inherit bias through
training data
2 3
Lack of enterprise
domain knowledge
4
Inability to verify or
attribute sources
5

How can you take advantage of
this massive opportunity while
overcoming these challenges?
6

7
Use a Knowledge Graph to Ground LLMs

8
Grounding with Neo4j Knowledge Graphs
Connect Consume
Enrich
Context rich,
connected view of
your data that enables
easier decision making
Enhance your data
with graph data
science, text
embeddings, and
additional derived
context
Ground responses
with information and
context in the graph
Improve search
relevance combining
vector search and
graph traversals

9
With Knowledge Graphs
Connected Data

10
Knowledge Graphs
Structured
Unstructured
Ontologies
Natural
Relationships

What can you do with a knowledge graph?
Collaborative filtering: users who
bought X, also bought Y (open-
ended pattern matching)
What items make you more likely to
buy additional items in subsequent
transactions?
Traverse hierarchies - what items
are similar 4+ hops out?
How many flagged accounts
are in the applicant’s
network 4+ hops out?
How many login / account
variables in common?
Add these metrics to your
approval process
What completes the
connections from genes to
diseases to targets?
What genes can be reached 4+
hops out from a known drug
target?
What mechanisms in common
are there between two drugs?
Financial Domain Life Sciences Marketing and
Recommendations

12
With Cypher Patterns, Graph Data Science & Text
Embeddings
Enrich

13
Enriched Knowledge Graphs
Structured
Unstructured
Ontologies
Graph Algorithms and
Graph Queries
Semantics,
Derived Relationships and
Additional Context
Natural
Relationships

Graph Enrichment
14
Human-crafted query, human-readable result
MATCH (p1:Person)-[:ENEMY]->(:Person)<-[:ENEMY]-(p2:PERSON)
MERGE (p1)-[:FRIEND]->(p2)
AI-learned formula, machine-readable result
Predefined formula, human-readable result
PageRank(Emil) = 13.25
PageRank(Phani) = 4.83
PageRank(Katie) = 4.75
Node2Vec(Emil) =[5.4 5.1 2.4 4.5 3.1]
Node2Vec(Phani)=[2.8 1.8 7.2 0.9 3.0]
Node2Vec(Katie)=[1.4 5.2 4.4 3.9 3.2]
Queries
Algorithms
Embeddings
Machine
Learning
Workflows
Train ML models
based on results

What are Graph Algorithms?

Graph Algorithms for Insights
Outliers, Influencers, Vulnerabilities,...
Recommendations, Homophily, Outliers,...
Recommendations, What-if Analysis, Disambiguation,...
Dimensionality Reduction, Representation Learning,...
Shortest Path, Optimal path, Route Optimization,...
Link prediction, Recommendations, Next-Best Action,...

Text Embeddings for Semantics
[0.2322,0.3321,….,0.0021]
[0.3233,0.3543,….,0.0047]
[0.5674,0.2134,….,0.0054]
[0.4565,0.2345,….,0.0342]
[0.8743,0.4343,….,0.0234]
LLM APIs
(OpenAI/ VertexAI/
AWS Bedrock etc.)
Text
Embeddings
Article PDFs
(Unstructured Data)

Graph Embeddings for Context and Semantics
[0.2322,0.3321,….,0.0021]
[0.3233,0.3543,….,0.0047]
[0.5674,0.2134,….,0.0054]
[0.4565,0.2345,….,0.0342]
[0.8743,0.4343,….,0.0234]
Dario Amodei [0.0432, 0.4432, …, 0.3412]
Daniela Amodei [0.7643, 0.4214, …, 0.0043]
Shariq Hashme [0.1234, 0.4544, …, 0.0543]
Graph
Embeddings

19
Contextual and Semantic Information Retrieval
Consume

20
Contextual and Semantic Information
Vector Similarity
Search
Graph Traversals &
Pattern Matching
Knowledge Graph
Inference & ML
Find relevant documents and
content for user queries
Find people, places, and things
associated to content. Identify
patterns in connected data.
Further improve search
relevance and insights by
enhancing your Knowledge
Graph.
Use graph algorithms and ML
to discover new relationships,
entities, and groups.
Vector Databases
Graph Database
Neo4j
SEMANTICS
CONTEXT
RELEVANCE
HNSW

21
Knowledge Retrieval with Neo4j

Knowledge Retrieval with Neo4j
Q: Does claim X
have any
association with
previous fraudulent
parties?
Q: Provide a list of
claims with damage
amounts >$100,000
Q: What
policyholders may
be impacted by
Hurricane Calvin?

RAG (Retrieval Augmented Generation) Pattern with Neo4j
Neo4j
LLM API
User
Cypher Prompt + Relevant
Information
Prompt Response
Relevant Results
Retrieve relevant results from Neo4j
using LLM to generate embeddings
and/or Cypher
2
3
1
1
2
3
Combine relevant results with prompt
Instruct LLM to only use the relevant
results to generate response
LLM API
Embeddings and/or
Cypher Generation
Improved ACCURACY and RELEVANCE of
responses
E.g. What is the impact of Hurricane Calvin?
Hurricane Calvin caused minor flooding in
Hawaii….
vs…
50 policyholders may be at risk of property
damage due to Hurricane Calvin.

Knowledge Graphs Graph Feature
Engineering and
Graph ML
Graph Analytics,
Investigations and
Counterfactuals
Contextual and
Semantic Information
Retrieval
Capitalize
Interpret
Connect Analyze
Neo4j Neo4j GDS Neo4j Bloom Neo4j Connectors
Neo4j Enriches All Phases of an AI Ecosystem

25
Demo

Typical Business Resilience Data
Analyze business impact of
● software & OS vulnerabilities,
● hardware & software upgrades,
● building/geographic disasters
● changes to business data formats
…across mission critical applications
and business locations
hierarchies, flows,
relationships…

Full Data Model
CVE Data
Business Data Elements
Vendors,Software
Business Tasks
Application Instances
Data Transfers
People/Roles
Locations
IT Assets

LangChain Demo Application
• Translates English to Cypher
• Consumption using LLM model
with few shot prompting
• Data augmentation from Neo4j
response

Questions
• How many CVEs are reported till date?
• Which applications are affected by CVE CVE-2019-16942?
• Which applications are similar to Finacle Core Banking?
• What business processes are supported by the application Finacle
Core Banking?
• What is the location with the highest Crime Risk? Please include the
five most recent crime events

Knowledge Graph
Neo4j AuraDS
Graph Data
Science
Graph DB
Intelligent Apps
Knowledge
Extraction and
Ingestion
Structured
Unstructured
Ontologies
Data Sources API Layer
Customer Service
Ticket Triaging
Recommendations
News Content &
Discovery
Enterprise
Knowledge Search
Patient
Prioritization
Clinical Decision
Support Systems
Pharmacovigilance
Health Assistants
FAQ Bots
Neo4j and Generative AI
Bloom
Google Vertex AI
Azure OpenAI
Service
Amazon Bedrock
Google Vertex AI
Azure OpenAI
Service
Amazon Bedrock
Step 1: Capture Knowledge Step2: Enrich Step 3: Semantic & Contextual Search

31
phani.dathar@neo4j.com
Thank you!

Knowledge Graphs Ground LLMs for Accuracy and Relevance

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Knowledge Graphs Ground LLMs for Accuracy and Relevance

Similar to Knowledge Graphs Ground LLMs for Accuracy and Relevance (20)

More from Neo4j

More from Neo4j (20)

Recently uploaded

Recently uploaded (20)

Knowledge Graphs Ground LLMs for Accuracy and Relevance

Editor's Notes