SlideShare a Scribd company logo
1 of 40
Download to read offline
Data Engineering
for RAG
ABK of Neo4j
(Andreas Kollegger)
Generative AI
Neo4j Inc. All rights reserved 2024
2
Generative AI
● Learns random sentences
from random people
● Talks like a person but doesn't really
understand what it's saying
● Occasionally speaks absolute nonsense
● Sensitive to question phrasing
● Answers reflect the person asking
● Can't explain or verify answers
● Limited to public "knowledge"
Neo4j Inc. All rights reserved 2024
3
Generative AI
● Learns random sentences
from random people
● Talks like a person but doesn't really
understand what it's saying
● Occasionally speaks absolute nonsense
● Sensitive to question phrasing
● Answers reflect the person asking
● Can't explain or verify answers
● Limited to public "knowledge"
Neo4j Inc. All rights reserved 2024
4
How do we
integrate
with the alien
technology?
Neo4j Inc. All rights reserved 2024
5
Everything
starts with
practical work,
using RAG…
Neo4j Inc. All rights reserved 2024
6
Retrieval Augmented
Generation (RAG)
RAG is a software design pattern for
integrating GenAI Apps with custom data
sources, like a database.
Neo4j Inc. All rights reserved 2024
7
A Generative AI application
uses an LLM
to provide responses
to user prompts
(aka ChatGPT)
Neo4j Inc. All rights reserved 2024
8
8
User
Prompt
Complete
Response
GenAI
Application
LLM
User Prompt
Response
RAG augments the LLM by
intercepting a user's prompt,
then making a query to a database,
then using the query results as
context for the user's prompt,
creating a new prompt that is passed
to the LLM
for a complete, curated response
Neo4j Inc. All rights reserved 2024
9
9
Database
GenAI
Application
Complete
Response
User
Prompt
LLM
User Prompt
+ Context
Response
User Prompt
Context
1 2
1 2
This sets up a knowledge stack…
the user knows something about the
question they're asking
the application knows something
about the user
the database knows about particular
information and data
the LLM knows about whatever it
found on the internet
Neo4j Inc. All rights reserved 2024
10
10
User Knowledge
App Knowledge
Database Knowledge
LLM Knowledge
Knowledge Stack
This sets up a knowledge stack…
the user knows something about the
question they're asking
the application knows something
about the user
the database knows about particular
information and data
the LLM knows about whatever it
found on the internet
Neo4j Inc. All rights reserved 2024
11
11
User Knowledge
App Knowledge
Database Knowledge
LLM Knowledge
Knowledge Stack
Knowledge you control,
in the app and the database.
Three Sources of Data
for RAG
Each with different access patterns,
supporting different kinds of questions.
Neo4j Inc. All rights reserved 2023
12
Neo4j Inc. All rights reserved 2024
13
Pure Text
Neo4j Inc. All rights reserved 2024
14
Pure Text
Unstructured data in PDFs,
plain text files, or images
Information search: “What is Apple's primary business?”
Answer with: Implicit knowledge derived from text.
Neo4j Inc. All rights reserved 2024
15
Pure Text
Unstructured data in PDFs,
plain text files, or images
Neo4j Inc. All rights reserved 2024
16
Pure Data
Pure Text
Neo4j Inc. All rights reserved 2024
17
Pure Data
Structured data
in a database
Pure Text
Neo4j Inc. All rights reserved 2024
18
Pure Data
Structured data
in a database
Pure Text
Information query: “How many iPhones did Apple sell this quarter?”
Answer with: Explicit facts from a database query.
Neo4j Inc. All rights reserved 2024
19
Pure Text Pure Data
Mixed
Text + Data
Neo4j Inc. All rights reserved 2024
20
Pure Text Pure Data
Mixed
Text + Data
Structured data together
with long-form text
Neo4j Inc. All rights reserved 2024
21
Pure Text Pure Data
Mixed
Text + Data
Structured data together
with long-form text
Information discovery: “Which investors will be impacted by a chip shortage?”
Answer with: Combined search and data query.
Neo4j Inc. All rights reserved 2024
22
Pure Text Pure Data
Mixed
Text + Data
Neo4j Inc. All rights reserved 2024
23
Pure Text Pure Data
Mixed
Text + Data
A Knowledge Graph:
Information architecture for data, organized using graph structures,
which places data within context.
Neo4j Inc. All rights reserved 2024
24
Pure Text Pure Data
Mixed
Text + Data
A Knowledge Graph:
Information architecture for data, organized using graph structures,
which places data within context.
Graph RAG:
Supports multiple modes of information retrieval, including
information search, information query, and information discovery.
Neo4j Inc. All rights reserved 2024
25
Pure Text Pure Data
Mixed
Text + Data
Vector Search Search + Pattern Matching Graph Queries
Find relevant documents
plus context for
information search
Expand context
and rank the relevance for
information discovery
Directly query the
knowledge graph for
information query
GenAI Example:
SEC Edgar
Financial Forms
Neo4j Inc. All rights reserved 2024
26
SEC Edgar Financial Data
The EDGAR database provides free public
access to company information, allowing
research about public company financial
information and operations through the filings
they submit to the SEC.
There are two forms that we'll look at today:
1. Form 10K-filings from publicly traded
companies
2. Form 13 -filings from institutional
investment management firms
Neo4j Inc. All rights reserved 2024
27
Data Modeling Strategy
Start with a Minimum Viable Graph (MVG)
Create, Enhance, Connect then repeat to grow the graph
1. Create-identify interesting information, create records
2. Enhance-supercharge the data by enhancing some dimension
3. Connect-connect information to expand context and reveal knowledge
Neo4j Inc. All rights reserved 2024
28
Form
10k
Chunk
Chunk
Chunk
Chunk
Create -Form 10K text chunks
exercitation ullamco
laboris nisi ut aliquip
enim ad minim veniam,
quis nostrud
incididunt ut labore et
dolore magna aliqua. Ut
adipiscing elit, sed do
eiusmod tempor
Lorem ipsum dolor sit
amet, consectetur
Chunk
Neo4j Inc. All rights reserved 2024
29
2. Split Text
1. Source - Form 10K 3. Create Nodes
Form
10k
Chunk
Chunk
Chunk
Chunk
Enhance -Text with an embedding
exercitation ullamco
laboris nisi ut aliquip
enim ad minim veniam,
quis nostrud
incididunt ut labore et
dolore magna aliqua. Ut
adipiscing elit, sed do
eiusmod tempor
Lorem ipsum dolor sit
amet, consectetur
[0.6,0.2,0.1,0.7]
[0.5,0.2,0.1,0.7]
[0.4,0.2,0.1,0.7]
[0.3,0.2,0.1,0.5]
[0.2,0.2,0.1,0.7]
1. Source - Chunks
Chunk
Neo4j Inc. All rights reserved 2024
Vector Index
30
4. Add Embedding
Form
10k
Chunk
Chunk
Chunk
Chunk
Connect -Connect chunks into a list
exercitation ullamco
laboris nisi ut aliquip
enim ad minim veniam,
quis nostrud
incididunt ut labore et
dolore magna aliqua. Ut
adipiscing elit, sed do
eiusmod tempor
Lorem ipsum dolor sit
amet, consectetur
[0.6,0.2,0.1,0.7]
[0.5,0.2,0.1,0.7]
[0.4,0.2,0.1,0.7]
[0.3,0.2,0.1,0.5]
[0.2,0.2,0.1,0.7] Chunk
NEXT
1. Connect Chunks
Chunk
Chunk
Neo4j Inc. All rights reserved 2024
31
Form
10k
Chunk
Chunk
Chunk
Chunk
Create, Enhance, Connect Form 10K
exercitation ullamco
laboris nisi ut aliquip
enim ad minim veniam,
quis nostrud
incididunt ut labore et
dolore magna aliqua. Ut
adipiscing elit, sed do
eiusmod tempor
Lorem ipsum dolor sit
amet, consectetur
2. Split Text
[0.6,0.2,0.1,0.7]
[0.5,0.2,0.1,0.7]
[0.4,0.2,0.1,0.7]
[0.3,0.2,0.1,0.5]
[0.2,0.2,0.1,0.7]
4. Add Embedding
1. Source - Form 10K 3. Create Nodes
Chunk
NEXT
5. Connect
Chunk
Chunk
Extract Enhance Expand
Neo4j Inc. All rights reserved 2024
32
Benefits:
● vector similarity search to find
relevant text
● expand context window with
previous/next chunks
● enable paging through text
Neo4j Inc. All rights reserved 2024
33
Chunk
NEXT
Minimum Viable Graph
formId: string
chunkId: string
text: string
textEmbedding: float[]
vector index
Linked List of Text
Create-create separate Form nodes
for each Form 10K. Add summary.
Enhance-vector index of summary.
Connect-connect from Form to first
node in linked list. Then from each
chunk back to the Form Node.
Benefits:
● expand context of chunk with
summary text
● navigate from form to text
Neo4j Inc. All rights reserved 2024
34
Chunk
NEXT
Improve Context
cusip6: string
formId: string
summary: string
summaryEmbedding: float[]
vector index
Hierarchical Summary
Form
PART_OF
SECTION
Add Form 13
Neo4j Inc. All rights reserved 2024
35
Company
Manager
OWNS_STOCK_IN
Create-create Manager and Company
nodes
Enhance-full-text index of names
Connect-connect Manager nodes to
Company nodes through investments
Benefits:
● pattern-matching queries
● search names by text similarity
(Apple and Apple Inc)
rather than conceptual similarity
(Apple and Banana)
name: string
address: string
full-text index
shares: integer
value: float
name: string
address: string
full-text index
Structured Data
Company
Manager
OWNS_STOCK_IN
Address
L
O
C
A
T
E
D
_
A
T
L
O
C
A
T
E
D
_
A
T
Located at Address
Neo4j Inc. All rights reserved 2024
36
Create-create Address nodes
Enhance-geospatial index of address
Connect-connect Manager and
Company nodes to Address
Benefits:
● pattern-based location queries
● distance-based calculations,
search companies within radius or
bounding box
city: string
state: string
country: string
location: Point
geospatial index
Geospatial Search
Combine Graphs
Neo4j Inc. All rights reserved 2024
37
Connect-connect Company nodes to
the Form they filed
Benefits:
● expanded context for
vector-based search
● refine search results by location
● expanded pattern matches
Mixed Text & Data
Chunk
Company
FILED
Form
PART_OF
SECTION
Manager
OWNS_STOCK_IN
NEXT
Address
L
O
C
A
T
E
D
_
A
T
L
O
C
A
T
E
D
_
A
T
Create, Enhance, Connect SEC Financial Forms
Sections from a Form Form 10K Nodes Public Companies Management Firms Addresses
Source Form 10K json files (:Chunk) Form 13 CSV Form 13 CSV (:Company), (:Manager)
1. Create (:Chunk) (:Form) (:Company) (:Manager) (:Address)
2. Enhance Vector embedding Vector embedding Full-text index Full-text index Geospatial index
3. Connect (Chunk)
-[NEXT]->(Chunk)
(Chunk)
-[PART_OF]->(Form)
(Company)
-[FILED]->(Form)
(Manager)
-[OWNS_STOCK_IN]->(Company)
(Company|Manager)
-[LOCATED_AT]->(Address)
You can continue to grow the knowledge graph…
● cross-link Companies that mention each other
● add People, Places, Topics extracted from text (named entity recognition)
● add more Form data, or other related sources
● add User information to keep history, refine relevance and enable feedback
Neo4j Inc. All rights reserved 2024
38
Resources & Next Steps
Neo4j Inc. All rights reserved 2024
39
Code
github.com/neo4j-examples/sec-edgar-notebooks
Get Started with Neo4j -Aura Free
neo4j.com/cloud/aura-free/
GenAI Ecosystem & Free Learning Resources
neo4j.com/labs/genai-ecosystem/
graphacademy.neo4j.com/categories/llms/
Thank you!
andreas.kollegger@neo4j.com
Neo4j Inc. All rights reserved 2023
40

More Related Content

What's hot

Kubernetes on AWS with Amazon EKS - MAD301 - New York AWS Summit
Kubernetes on AWS with Amazon EKS - MAD301 - New York AWS SummitKubernetes on AWS with Amazon EKS - MAD301 - New York AWS Summit
Kubernetes on AWS with Amazon EKS - MAD301 - New York AWS SummitAmazon Web Services
 
Top Trends in Application Architecture That Enable.pdf
Top Trends in Application Architecture That Enable.pdfTop Trends in Application Architecture That Enable.pdf
Top Trends in Application Architecture That Enable.pdfMantoshKumarSingh7
 
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...HostedbyConfluent
 
Neo4j 4 Overview
Neo4j 4 OverviewNeo4j 4 Overview
Neo4j 4 OverviewNeo4j
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scaleMaxim Salnikov
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of usMassimo Ferre'
 
Building a Modern Data Platform on AWS
Building a Modern Data Platform on AWSBuilding a Modern Data Platform on AWS
Building a Modern Data Platform on AWSAmazon Web Services
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN FrameworkKeymate.AI
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Naoki (Neo) SATO
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYAndre Muscat
 
The Five Levels of Generative AI for Games
The Five Levels of Generative AI for GamesThe Five Levels of Generative AI for Games
The Five Levels of Generative AI for GamesJon Radoff
 
Visualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightVisualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightAmazon Web Services
 
Amazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for KubernetesAmazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for KubernetesAmazon Web Services
 
Leveraging Generative AI to Accelerate Graph Innovation for National Security...
Leveraging Generative AI to Accelerate Graph Innovation for National Security...Leveraging Generative AI to Accelerate Graph Innovation for National Security...
Leveraging Generative AI to Accelerate Graph Innovation for National Security...Neo4j
 
AWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWSAWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWSAmazon Web Services
 
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?Neo4j
 
Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...
Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...
Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...Amazon Web Services
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks FundamentalsDalibor Wijas
 
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud Platform[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud PlatformMeetupDataScienceRoma
 

What's hot (20)

Kubernetes on AWS with Amazon EKS - MAD301 - New York AWS Summit
Kubernetes on AWS with Amazon EKS - MAD301 - New York AWS SummitKubernetes on AWS with Amazon EKS - MAD301 - New York AWS Summit
Kubernetes on AWS with Amazon EKS - MAD301 - New York AWS Summit
 
Top Trends in Application Architecture That Enable.pdf
Top Trends in Application Architecture That Enable.pdfTop Trends in Application Architecture That Enable.pdf
Top Trends in Application Architecture That Enable.pdf
 
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
 
Neo4j 4 Overview
Neo4j 4 OverviewNeo4j 4 Overview
Neo4j 4 Overview
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of us
 
Building a Modern Data Platform on AWS
Building a Modern Data Platform on AWSBuilding a Modern Data Platform on AWS
Building a Modern Data Platform on AWS
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
 
The Five Levels of Generative AI for Games
The Five Levels of Generative AI for GamesThe Five Levels of Generative AI for Games
The Five Levels of Generative AI for Games
 
Visualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightVisualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSight
 
Amazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for KubernetesAmazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for Kubernetes
 
Leveraging Generative AI to Accelerate Graph Innovation for National Security...
Leveraging Generative AI to Accelerate Graph Innovation for National Security...Leveraging Generative AI to Accelerate Graph Innovation for National Security...
Leveraging Generative AI to Accelerate Graph Innovation for National Security...
 
AWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWSAWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWS
 
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
 
Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...
Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...
Accelerating Your Portfolio Migration to AWS Using AWS Migration Hub - ENT321...
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud Platform[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
 

Similar to Neo4j: Data Engineering for RAG (retrieval augmented generation)

Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j
 
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...Neo4j
 
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...Neo4j
 
Knowledge Graphs and Generative AI
Knowledge Graphs and Generative AIKnowledge Graphs and Generative AI
Knowledge Graphs and Generative AINeo4j
 
Neo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdf
Neo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdfNeo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdf
Neo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdfNeo4j
 
Modeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
Modeling Cybersecurity with Neo4j, Based on Real-Life Data InsightsModeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
Modeling Cybersecurity with Neo4j, Based on Real-Life Data InsightsNeo4j
 
El Arte de lo Possible
El Arte de lo PossibleEl Arte de lo Possible
El Arte de lo PossibleNeo4j
 
The Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdfThe Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdfNeo4j
 
The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...Neo4j
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j
 
The Path To Success With Graph Database and Analytics
The Path To Success With Graph Database and AnalyticsThe Path To Success With Graph Database and Analytics
The Path To Success With Graph Database and AnalyticsNeo4j
 
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdfEIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdfEarley Information Science
 
La strada verso il successo con i database a grafo, la Graph Data Science e l...
La strada verso il successo con i database a grafo, la Graph Data Science e l...La strada verso il successo con i database a grafo, la Graph Data Science e l...
La strada verso il successo con i database a grafo, la Graph Data Science e l...Neo4j
 
The Art of the Possible with Graph by Dr Jim Webber Neo4j.pptx
The Art of the Possible with Graph by Dr Jim Webber Neo4j.pptxThe Art of the Possible with Graph by Dr Jim Webber Neo4j.pptx
The Art of the Possible with Graph by Dr Jim Webber Neo4j.pptxNeo4j
 
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptxKnowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptxNeo4j
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j
 
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryAge of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryInside Analysis
 
The path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data ScienceThe path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data ScienceNeo4j
 
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...Neo4j
 

Similar to Neo4j: Data Engineering for RAG (retrieval augmented generation) (20)

Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
 
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
 
Knowledge Graphs and Generative AI
Knowledge Graphs and Generative AIKnowledge Graphs and Generative AI
Knowledge Graphs and Generative AI
 
Neo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdf
Neo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdfNeo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdf
Neo4j Generative AI workshop at GraphSummit London 14 Nov 2023.pdf
 
Modeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
Modeling Cybersecurity with Neo4j, Based on Real-Life Data InsightsModeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
Modeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
 
El Arte de lo Possible
El Arte de lo PossibleEl Arte de lo Possible
El Arte de lo Possible
 
The Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdfThe Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdf
 
The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
The Path To Success With Graph Database and Analytics
The Path To Success With Graph Database and AnalyticsThe Path To Success With Graph Database and Analytics
The Path To Success With Graph Database and Analytics
 
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdfEIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
 
La strada verso il successo con i database a grafo, la Graph Data Science e l...
La strada verso il successo con i database a grafo, la Graph Data Science e l...La strada verso il successo con i database a grafo, la Graph Data Science e l...
La strada verso il successo con i database a grafo, la Graph Data Science e l...
 
The Art of the Possible with Graph by Dr Jim Webber Neo4j.pptx
The Art of the Possible with Graph by Dr Jim Webber Neo4j.pptxThe Art of the Possible with Graph by Dr Jim Webber Neo4j.pptx
The Art of the Possible with Graph by Dr Jim Webber Neo4j.pptx
 
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptxKnowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryAge of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide Discovery
 
The path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data ScienceThe path to success with Graph Database and Graph Data Science
The path to success with Graph Database and Graph Data Science
 
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
 

More from Neo4j

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeNeo4j
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j
 
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...Neo4j
 
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AIDeloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AINeo4j
 
Ingka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by DesignIngka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by DesignNeo4j
 
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxGraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxNeo4j
 
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptxEmil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptxNeo4j
 
Identification of insulin-resistance genes with Knowledge Graphs topology and...
Identification of insulin-resistance genes with Knowledge Graphs topology and...Identification of insulin-resistance genes with Knowledge Graphs topology and...
Identification of insulin-resistance genes with Knowledge Graphs topology and...Neo4j
 

More from Neo4j (20)

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
 
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AIDeloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
 
Ingka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by DesignIngka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by Design
 
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxGraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
 
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptxEmil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
 
Identification of insulin-resistance genes with Knowledge Graphs topology and...
Identification of insulin-resistance genes with Knowledge Graphs topology and...Identification of insulin-resistance genes with Knowledge Graphs topology and...
Identification of insulin-resistance genes with Knowledge Graphs topology and...
 

Recently uploaded

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Recently uploaded (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 

Neo4j: Data Engineering for RAG (retrieval augmented generation)

  • 1. Data Engineering for RAG ABK of Neo4j (Andreas Kollegger)
  • 2. Generative AI Neo4j Inc. All rights reserved 2024 2
  • 3. Generative AI ● Learns random sentences from random people ● Talks like a person but doesn't really understand what it's saying ● Occasionally speaks absolute nonsense ● Sensitive to question phrasing ● Answers reflect the person asking ● Can't explain or verify answers ● Limited to public "knowledge" Neo4j Inc. All rights reserved 2024 3
  • 4. Generative AI ● Learns random sentences from random people ● Talks like a person but doesn't really understand what it's saying ● Occasionally speaks absolute nonsense ● Sensitive to question phrasing ● Answers reflect the person asking ● Can't explain or verify answers ● Limited to public "knowledge" Neo4j Inc. All rights reserved 2024 4
  • 5. How do we integrate with the alien technology? Neo4j Inc. All rights reserved 2024 5
  • 6. Everything starts with practical work, using RAG… Neo4j Inc. All rights reserved 2024 6
  • 7. Retrieval Augmented Generation (RAG) RAG is a software design pattern for integrating GenAI Apps with custom data sources, like a database. Neo4j Inc. All rights reserved 2024 7
  • 8. A Generative AI application uses an LLM to provide responses to user prompts (aka ChatGPT) Neo4j Inc. All rights reserved 2024 8 8 User Prompt Complete Response GenAI Application LLM User Prompt Response
  • 9. RAG augments the LLM by intercepting a user's prompt, then making a query to a database, then using the query results as context for the user's prompt, creating a new prompt that is passed to the LLM for a complete, curated response Neo4j Inc. All rights reserved 2024 9 9 Database GenAI Application Complete Response User Prompt LLM User Prompt + Context Response User Prompt Context 1 2 1 2
  • 10. This sets up a knowledge stack… the user knows something about the question they're asking the application knows something about the user the database knows about particular information and data the LLM knows about whatever it found on the internet Neo4j Inc. All rights reserved 2024 10 10 User Knowledge App Knowledge Database Knowledge LLM Knowledge Knowledge Stack
  • 11. This sets up a knowledge stack… the user knows something about the question they're asking the application knows something about the user the database knows about particular information and data the LLM knows about whatever it found on the internet Neo4j Inc. All rights reserved 2024 11 11 User Knowledge App Knowledge Database Knowledge LLM Knowledge Knowledge Stack Knowledge you control, in the app and the database.
  • 12. Three Sources of Data for RAG Each with different access patterns, supporting different kinds of questions. Neo4j Inc. All rights reserved 2023 12
  • 13. Neo4j Inc. All rights reserved 2024 13 Pure Text
  • 14. Neo4j Inc. All rights reserved 2024 14 Pure Text Unstructured data in PDFs, plain text files, or images
  • 15. Information search: “What is Apple's primary business?” Answer with: Implicit knowledge derived from text. Neo4j Inc. All rights reserved 2024 15 Pure Text Unstructured data in PDFs, plain text files, or images
  • 16. Neo4j Inc. All rights reserved 2024 16 Pure Data Pure Text
  • 17. Neo4j Inc. All rights reserved 2024 17 Pure Data Structured data in a database Pure Text
  • 18. Neo4j Inc. All rights reserved 2024 18 Pure Data Structured data in a database Pure Text Information query: “How many iPhones did Apple sell this quarter?” Answer with: Explicit facts from a database query.
  • 19. Neo4j Inc. All rights reserved 2024 19 Pure Text Pure Data Mixed Text + Data
  • 20. Neo4j Inc. All rights reserved 2024 20 Pure Text Pure Data Mixed Text + Data Structured data together with long-form text
  • 21. Neo4j Inc. All rights reserved 2024 21 Pure Text Pure Data Mixed Text + Data Structured data together with long-form text Information discovery: “Which investors will be impacted by a chip shortage?” Answer with: Combined search and data query.
  • 22. Neo4j Inc. All rights reserved 2024 22 Pure Text Pure Data Mixed Text + Data
  • 23. Neo4j Inc. All rights reserved 2024 23 Pure Text Pure Data Mixed Text + Data A Knowledge Graph: Information architecture for data, organized using graph structures, which places data within context.
  • 24. Neo4j Inc. All rights reserved 2024 24 Pure Text Pure Data Mixed Text + Data A Knowledge Graph: Information architecture for data, organized using graph structures, which places data within context. Graph RAG: Supports multiple modes of information retrieval, including information search, information query, and information discovery.
  • 25. Neo4j Inc. All rights reserved 2024 25 Pure Text Pure Data Mixed Text + Data Vector Search Search + Pattern Matching Graph Queries Find relevant documents plus context for information search Expand context and rank the relevance for information discovery Directly query the knowledge graph for information query
  • 26. GenAI Example: SEC Edgar Financial Forms Neo4j Inc. All rights reserved 2024 26
  • 27. SEC Edgar Financial Data The EDGAR database provides free public access to company information, allowing research about public company financial information and operations through the filings they submit to the SEC. There are two forms that we'll look at today: 1. Form 10K-filings from publicly traded companies 2. Form 13 -filings from institutional investment management firms Neo4j Inc. All rights reserved 2024 27
  • 28. Data Modeling Strategy Start with a Minimum Viable Graph (MVG) Create, Enhance, Connect then repeat to grow the graph 1. Create-identify interesting information, create records 2. Enhance-supercharge the data by enhancing some dimension 3. Connect-connect information to expand context and reveal knowledge Neo4j Inc. All rights reserved 2024 28
  • 29. Form 10k Chunk Chunk Chunk Chunk Create -Form 10K text chunks exercitation ullamco laboris nisi ut aliquip enim ad minim veniam, quis nostrud incididunt ut labore et dolore magna aliqua. Ut adipiscing elit, sed do eiusmod tempor Lorem ipsum dolor sit amet, consectetur Chunk Neo4j Inc. All rights reserved 2024 29 2. Split Text 1. Source - Form 10K 3. Create Nodes
  • 30. Form 10k Chunk Chunk Chunk Chunk Enhance -Text with an embedding exercitation ullamco laboris nisi ut aliquip enim ad minim veniam, quis nostrud incididunt ut labore et dolore magna aliqua. Ut adipiscing elit, sed do eiusmod tempor Lorem ipsum dolor sit amet, consectetur [0.6,0.2,0.1,0.7] [0.5,0.2,0.1,0.7] [0.4,0.2,0.1,0.7] [0.3,0.2,0.1,0.5] [0.2,0.2,0.1,0.7] 1. Source - Chunks Chunk Neo4j Inc. All rights reserved 2024 Vector Index 30 4. Add Embedding
  • 31. Form 10k Chunk Chunk Chunk Chunk Connect -Connect chunks into a list exercitation ullamco laboris nisi ut aliquip enim ad minim veniam, quis nostrud incididunt ut labore et dolore magna aliqua. Ut adipiscing elit, sed do eiusmod tempor Lorem ipsum dolor sit amet, consectetur [0.6,0.2,0.1,0.7] [0.5,0.2,0.1,0.7] [0.4,0.2,0.1,0.7] [0.3,0.2,0.1,0.5] [0.2,0.2,0.1,0.7] Chunk NEXT 1. Connect Chunks Chunk Chunk Neo4j Inc. All rights reserved 2024 31
  • 32. Form 10k Chunk Chunk Chunk Chunk Create, Enhance, Connect Form 10K exercitation ullamco laboris nisi ut aliquip enim ad minim veniam, quis nostrud incididunt ut labore et dolore magna aliqua. Ut adipiscing elit, sed do eiusmod tempor Lorem ipsum dolor sit amet, consectetur 2. Split Text [0.6,0.2,0.1,0.7] [0.5,0.2,0.1,0.7] [0.4,0.2,0.1,0.7] [0.3,0.2,0.1,0.5] [0.2,0.2,0.1,0.7] 4. Add Embedding 1. Source - Form 10K 3. Create Nodes Chunk NEXT 5. Connect Chunk Chunk Extract Enhance Expand Neo4j Inc. All rights reserved 2024 32
  • 33. Benefits: ● vector similarity search to find relevant text ● expand context window with previous/next chunks ● enable paging through text Neo4j Inc. All rights reserved 2024 33 Chunk NEXT Minimum Viable Graph formId: string chunkId: string text: string textEmbedding: float[] vector index Linked List of Text
  • 34. Create-create separate Form nodes for each Form 10K. Add summary. Enhance-vector index of summary. Connect-connect from Form to first node in linked list. Then from each chunk back to the Form Node. Benefits: ● expand context of chunk with summary text ● navigate from form to text Neo4j Inc. All rights reserved 2024 34 Chunk NEXT Improve Context cusip6: string formId: string summary: string summaryEmbedding: float[] vector index Hierarchical Summary Form PART_OF SECTION
  • 35. Add Form 13 Neo4j Inc. All rights reserved 2024 35 Company Manager OWNS_STOCK_IN Create-create Manager and Company nodes Enhance-full-text index of names Connect-connect Manager nodes to Company nodes through investments Benefits: ● pattern-matching queries ● search names by text similarity (Apple and Apple Inc) rather than conceptual similarity (Apple and Banana) name: string address: string full-text index shares: integer value: float name: string address: string full-text index Structured Data
  • 36. Company Manager OWNS_STOCK_IN Address L O C A T E D _ A T L O C A T E D _ A T Located at Address Neo4j Inc. All rights reserved 2024 36 Create-create Address nodes Enhance-geospatial index of address Connect-connect Manager and Company nodes to Address Benefits: ● pattern-based location queries ● distance-based calculations, search companies within radius or bounding box city: string state: string country: string location: Point geospatial index Geospatial Search
  • 37. Combine Graphs Neo4j Inc. All rights reserved 2024 37 Connect-connect Company nodes to the Form they filed Benefits: ● expanded context for vector-based search ● refine search results by location ● expanded pattern matches Mixed Text & Data Chunk Company FILED Form PART_OF SECTION Manager OWNS_STOCK_IN NEXT Address L O C A T E D _ A T L O C A T E D _ A T
  • 38. Create, Enhance, Connect SEC Financial Forms Sections from a Form Form 10K Nodes Public Companies Management Firms Addresses Source Form 10K json files (:Chunk) Form 13 CSV Form 13 CSV (:Company), (:Manager) 1. Create (:Chunk) (:Form) (:Company) (:Manager) (:Address) 2. Enhance Vector embedding Vector embedding Full-text index Full-text index Geospatial index 3. Connect (Chunk) -[NEXT]->(Chunk) (Chunk) -[PART_OF]->(Form) (Company) -[FILED]->(Form) (Manager) -[OWNS_STOCK_IN]->(Company) (Company|Manager) -[LOCATED_AT]->(Address) You can continue to grow the knowledge graph… ● cross-link Companies that mention each other ● add People, Places, Topics extracted from text (named entity recognition) ● add more Form data, or other related sources ● add User information to keep history, refine relevance and enable feedback Neo4j Inc. All rights reserved 2024 38
  • 39. Resources & Next Steps Neo4j Inc. All rights reserved 2024 39 Code github.com/neo4j-examples/sec-edgar-notebooks Get Started with Neo4j -Aura Free neo4j.com/cloud/aura-free/ GenAI Ecosystem & Free Learning Resources neo4j.com/labs/genai-ecosystem/ graphacademy.neo4j.com/categories/llms/