SlideShare a Scribd company logo
1 of 68
Download to read offline
Agenda
● Logistics
● Introduction
● Use Case Explanation
● Modeling
● Building the solution
● Q & A
Logistics
WIFI Access:
Restrooms:
Chargers:
Material for the
workshop:
https://github.com/cskardon/gsummit2023
A short overview of the Neo4j Product
Graph components
Node (Vertex)
• The main data element from which graphs are constructed
Keanu
Reeves
The
Matrix
Graph components
Node (Vertex)
• The main data element from which graphs are constructed
Relationship (Edge)
• A link between two nodes
• Direction
• Type
Keanu
Reeves
The
Matrix
A node without
relationships is
permitted, a
relationship
without nodes is
not
Property graph database
Person Movie
Keanu
Reeves
The
Matrix
Node (Vertex)
Relationship (Edge)
Label
• Define node role (optional)
Property graph database
Node (Vertex)
Relationship (Edge)
Label
• Define node role (optional)
• Can have more than one
Person
Action
Movie
Property graph database
Node (Vertex)
Relationship (Edge)
Label
• Define node role (optional)
• Can have more than one
Properties
• Enrich
• nodes
• relationships
• No need for nulls
Person
Action
Movie
Cypher: powerful and expressive query language
:LOVES
Cypher: Matching
:MARRIED_TO
Neo4j Graph Data Science
Pop Quiz
Which of the coloured
nodes would be
considered the
most “important”?
Graphs Contain Implicit Knowledge
D
D has the highest degree centrality (7)
This is the most connected individual in
the network. If important is now well you
are personally known, you pick D.
G
G has the highest closeness centrality (0.52)
Information will disperse through the network
more quickly through this individual. If you
need to get a message out rapidly, choose G.
I
I has the highest betweenness centrality (0.59)
This person is an efficient connector of other
people. Risk of network disruption is higher if you
lose this individual
Most Important?
Better Predictions With Data You Already Have
● Traditional ML ignores network structure because it’s difficult to extract
● Graphs use relationships to unlock otherwise unattainable predictions
● Add graph data to existing ML pipelines to increase accuracy
Machine Learning Pipeline
Graph Algorithm Categories
Determines the importance of
distinct nodes in the network
Centrality /
Importance
Finds optimal paths or evaluates
route availability and quality
Pathfinding
& Search
Detects group clustering or
partition
Community
Detection
Compute low-dimensional vector
representations of nodes in a graph, and
allow you to train supervised machine
learning models
Embeddings
& ML
Evaluates how alike nodes are
by neighbours and relationships
Similarity
Estimates the likelihood of
nodes forming a future
relationship
Link
Prediction
Available Algorithms (as of 2.5)
• Page Rank
• Article Rank
• Betweenness Centrality
• Influence Maximization (CELF)
• Closeness Centrality
• Degree Centrality
• Eigenvector Centrality
• Harmonic Centrality
• Hyperlink Induced Topic Search (HITS)
Centrality /
Importance
• Delta-Stepping Single-Source Shortest Path
• Dijkstra Source-Target Shortest Path
• Dijkstra Single-Source Shortest Path
• A* Shortest Path
• Yen’s Shortest Path
• Breadth First Search
• Depth First Search
• Random Walk
• Bellman-Ford Single-Source Shortest Path
• Minimum Weight Spanning Tree
• Minimum Directed Steiner Tree
• Minimum Weight k-Spanning Tree
• All Pairs Shortest Path
• Longest Path for DAG
Pathfinding
& Search
• Louvain
• Weakly Connected Components
• Conductance metric
• K-Core Decomposition
• K-1 Coloring
• K-Means Clustering
• Label Propagation
• Leiden
• Local Clustering Coefficient
• Modularity metric
• Modularity Optimization
• Strongly Connected Components
• Triangle Count
• Approximate Maximum k-cut
• Speaker-Listener Label Propagation
Community
Detection
https://neo4j.com/docs/graph-data-science/current
• FastRP
• GraphSAGE
• Node2Vec
• HashGNN
• Node Classifaction Pipelines
• Link Prediction Pipelines
• Node Regression Pipelines
Embeddings
& ML
• Node Similarity
• K-Nearest Neighbors (KNN)
• Cosine Similarity
• Euclidean Similarity
• Euclidean Distance Similarity
• Jaccard Similarity
• Overlap Similarity
• Pearson Similarity
Similarity
• Adamic Adar
• Common Neighbors
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbors
Link
Prediction
Visualisation
Data Visualization with Neo4j Bloom
Neo4j’s user-friendly graph database visualization, exploration and collaboration tool.
● Visually explore graphs
● Prototype faster
● Visualize and discover
● Easy for non-technical users
NeoDash
● Fully open source
﹣ https://github.com/neo4j-labs/neodash
﹣ Extensible
● Can be integrated in existing portal
● Supported through Neo4j’s Professional
Services team
Digital Twin - An Overview
What is a Digital Twin?
A Digital Twin is a digital
representation of a […] real-world
physical product, system, or process
[…] that serves as the effectively
indistinguishable digital counterpart of
it for practical purposes, such as
simulation, integration, testing,
monitoring and maintenance.
It has been done before
EU Rail Network
● Challenge: Legacy technology could
not section and analyse train journeys
● Solution: Neo4j Knowledge Graph
● Identify and avoid bottlenecks
What is graph data modeling?
Collaborative effort where the application domain is analysed by
stakeholders and developers to come up with the optimal model for use
with Neo4j.
Who are the stakeholders?
● Business analysts
● Architects
● Managers
● Project leaders
● Data Scientists
Graph data modeling workflow
Cypher
queries
Define
model
Import
data
Additional
questions
Modify
model
Refactor
database
Cypher
queries
Developers
Defining the model
Define
model
Sample
data
Define
questions
High-level
requirements
Identify
entities
Identify
connections
Test
model
Test
scalability
Stakeholders &
Developers
Defining the model
Define
model
Sample
data
Define
questions
High-level
requirements
Identify
entities
Identify
connections
Test
model
Test
scalability
Stakeholders &
Developers
GENERIC GRAPH SPECIFIC
Domain knowledge –
High level requirements
We’ve already got this!
● Normally…
● In this case we’ve talked about the Domain
● You will have the knowledge of your own Domain
Sample Data
Get Sample Data
Static Rail Network*
- Sections of lines
- Length,
- Speed
- Operational Points (Stations etc)
- Geolocation information,
- Points of Interest (POI) along lines
*Source: Register of Infrastructure (RINF) - https://data-interop.era.europa.eu/
Operational Points (OP) - Data Explanation
CSV Header titles:
The identifier of the OP
The type of the OP (Station, Switch, Border Point…)
The name of the OP
The latitude of the OP
The longitude of the OP
Section Length Data
CSV Header titles:
The identifier of the start OP for the section
The identifier of the end OP for the section
The distince (in Kilometers) of the section
Section Speed Data
CSV Header titles:
The identifier of the start OP for the section
The identifier of the end OP for the section
The maximum speed allowed on that section
Point of Interest (POI) Data
CSV Header titles:
The name of the City the POI is in, or close to
A short description of of the POI
A link to a picture of the POI
A link to a web page about the POI
The latitude of the POI
The Longitude of the POI
Domain Questions
Data Modeling – Example Domain Questions
1. What is the route from Operational Point X to Operational Point Y?
﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch?
2. What is an alternative route if an Operational Point on a route is closed?
﹣ A Switch is broken and we need to reroute Trains
3. How many routes are affected if I need to upgrade an Operational Point?
﹣ A Switch needs to be upgraded to support the network
4. What POIs are along a route?
﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
Identifying entities and
connections
Identify Entities from Questions
Entities are the nouns in the domain questions:
1. What ingredients are used in a recipe?
2. Who is married to this person?
● The generic nouns often become labels in the model
● Use domain knowledge deciding how to further group or differentiate entities
Person
Recipe
Ingredient
Identify Connections between Entities
Connections are the verbs in the domain questions:
● What ingredients are used in a recipe?
● Who is married to this person?
Recipe Ingredient
Person
Person
Using our Questions – Question 1
1. What is the route from Operational Point X to Operational Point Y?
﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch?
2. What is an alternative route if an Operational Point on a Section is closed?
﹣ A Switch is broken and we need to reroute Trains
3. How many routes are affected if I need to upgrade an Operational Point?
﹣ A Switch needs to be upgraded to support the network
4. What POIs are near Station Operational Points on a Section?
﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
Operational Point Operational Point
from to
Using our Questions – Question 1 – Model
1. What is the route from Operational Point X to Operational Point Y?
﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch?
TO
Using our Questions – Question 2
1. What is the route from Operational Point X to Operational Point Y?
﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch?
2. What is an alternative route if an Operational Point on a Section is closed?
﹣ A Switch is broken and we need to reroute Trains
3. How many routes are affected if I need to upgrade an Operational Point?
﹣ A Switch needs to be upgraded to support the network
4. What POIs are near Station Operational Points on a Section?
﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
Operational Point Section
Using our Questions – Question 2 – Model
2. What is an alternative route if an Operational Point on a Section is closed?
﹣ A Switch is broken and we need to reroute Trains
TO
Section
SECTION
Using our Questions – Question 3
1. What is the route from Operational Point X to Operational Point Y?
﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch?
2. What is an alternative route if an Operational Point on a Section is closed?
﹣ A Switch is broken and we need to reroute Trains
3. How many routes are affected if I need to upgrade an Operational Point?
﹣ A Switch needs to be upgraded to support the network
4. What POIs are near Station Operational Points on a Section?
﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
Operational Point
Using our Questions – Question 3 – Model
3. How many routes are affected if I need to upgrade an Operational Point?
﹣ A Switch needs to be upgraded to support the network
SECTION
Using our Questions – Question 4
1. What is the route from Operational Point X to Operational Point Y?
﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch?
2. What is an alternative route if an Operational Point on a Section is closed?
﹣ A Switch is broken and we need to reroute Trains
3. How many routes are affected if I need to upgrade an Operational Point?
﹣ A Switch needs to be upgraded to support the network
4. What POIs are near Station Operational Points on a Section?
﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
POI Operational Point
Station
near Section
Using our Questions – Question 4 – Model
4. What POIs are near Station Operational Points on a Section?
﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
Added ‘Station’
as a label
SECTION
Hang on…
We have duplicate entries
Wait, what?
How do we
deal with
this?
Name nodes
SECTION
NAMED
Add a name node
How does that look in reality?
Whilst we’re here, let’s talk countries
● Multiple ways to model
﹣ Properties
﹣ Labels
● What should we choose?
Cheat mode
enabled!
Before we go further
Environment for later
● Neo4j Desktop
● Neo4j Sandbox
Before we go further – Sandbox Login
● Create a sandbox for later
● https://sandbox.neo4j.com/
Before we go further – Blank Sandbox
● https://sandbox.neo4j.com/
● Create a ‘Blank Sandbox’
● If you want to get ahead of the game
﹣ Open Browser and run the code from
https://bit.ly/gsummit2023-load
Examples
● Go to the Neo4j Sandbox you created earlier (or Desktop!)
﹣ https://sandbox.neo4j.com/
● ‘Open with Browser’
Get the code
● Open the Github page:
● https://github.com/cskardon/gsummit2023
62
Bloom - Sandbox
Bloom - Desktop
• Open Bloom
• Open Bloom
Bloom – Import Perspective
NeoDash
● Open NeoDash:
﹣ http://neodash.graphapp.io/
66
NeoDash - Sandbox
NeoDash - Desktop
• Open Sandbox
• Get connection details
• Host: localhost
• User: neo4j
• Password: YOURS!
NeoDash – Import Dashboard
● Create New Dashboard
● Import from Source files
Contact us at
sales@neo4j.com

More Related Content

Similar to Neo4j workshop at GraphSummit London 14 Nov 2023.pdf

PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...
PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...
PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...Tristan Penman
 
State of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAMState of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAMNeo4j
 
Unit1 - Individual Project Due on 03292014A company wan.docx
Unit1 - Individual Project       Due on  03292014A company wan.docxUnit1 - Individual Project       Due on  03292014A company wan.docx
Unit1 - Individual Project Due on 03292014A company wan.docxdickonsondorris
 
Semantic web design for www.data.gov.sg - Presentation
Semantic web design for www.data.gov.sg - PresentationSemantic web design for www.data.gov.sg - Presentation
Semantic web design for www.data.gov.sg - PresentationMuthu Kumaar Thangavelu
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesCédric Fauvet
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discoverymarkgrover
 
Discussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docxDiscussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docxduketjoy27252
 
Discussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docxDiscussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docxedgar6wallace88877
 
UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...
UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...
UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...Peter Brusilovsky
 
Clickstream data with spark
Clickstream data with sparkClickstream data with spark
Clickstream data with sparkMarissa Saunders
 
Revolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
Revolutionise your Machine Learning Workflow using Scikit-Learn PipelinesRevolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
Revolutionise your Machine Learning Workflow using Scikit-Learn PipelinesPhilip Goddard
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questionsLarry Smarr
 
Value Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesValue Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesDevOps.com
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-Systeminside-BigData.com
 
Transport for London - London's Operations Digital Twin
Transport for London - London's Operations Digital TwinTransport for London - London's Operations Digital Twin
Transport for London - London's Operations Digital TwinNeo4j
 
Effective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSEffective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSsethuraman R
 
GIS in the Rockies
GIS in the RockiesGIS in the Rockies
GIS in the Rockiesmkhinke
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceNeo4j
 

Similar to Neo4j workshop at GraphSummit London 14 Nov 2023.pdf (20)

PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...
PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...
PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet...
 
State of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAMState of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAM
 
Unit1 - Individual Project Due on 03292014A company wan.docx
Unit1 - Individual Project       Due on  03292014A company wan.docxUnit1 - Individual Project       Due on  03292014A company wan.docx
Unit1 - Individual Project Due on 03292014A company wan.docx
 
Semantic web design for www.data.gov.sg - Presentation
Semantic web design for www.data.gov.sg - PresentationSemantic web design for www.data.gov.sg - Presentation
Semantic web design for www.data.gov.sg - Presentation
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discovery
 
Discussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docxDiscussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docx
 
Discussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docxDiscussion Board Deliverable Length 4 – 6 paragraphs An.docx
Discussion Board Deliverable Length 4 – 6 paragraphs An.docx
 
UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...
UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...
UMAP 2019 talk Evaluating Visual Explanations for Similarity-Based Recommenda...
 
Clickstream data with spark
Clickstream data with sparkClickstream data with spark
Clickstream data with spark
 
Revolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
Revolutionise your Machine Learning Workflow using Scikit-Learn PipelinesRevolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
Revolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questions
 
Value Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesValue Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the Trenches
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
 
SDN Framework and APIs
SDN Framework and APIsSDN Framework and APIs
SDN Framework and APIs
 
Transport for London - London's Operations Digital Twin
Transport for London - London's Operations Digital TwinTransport for London - London's Operations Digital Twin
Transport for London - London's Operations Digital Twin
 
Moran wsmx
Moran wsmxMoran wsmx
Moran wsmx
 
Effective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSEffective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoS
 
GIS in the Rockies
GIS in the RockiesGIS in the Rockies
GIS in the Rockies
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data Science
 

More from Neo4j

Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit MilanNeo4j
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...Neo4j
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jNeo4j
 
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphGraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphNeo4j
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...Neo4j
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaNeo4j
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...Neo4j
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxNeo4j
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNeo4j
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 

More from Neo4j (20)

Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
 
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphGraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 

Recently uploaded

WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceIES VE
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingWSO2
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 

Recently uploaded (20)

WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 

Neo4j workshop at GraphSummit London 14 Nov 2023.pdf

  • 1.
  • 2. Agenda ● Logistics ● Introduction ● Use Case Explanation ● Modeling ● Building the solution ● Q & A
  • 3. Logistics WIFI Access: Restrooms: Chargers: Material for the workshop: https://github.com/cskardon/gsummit2023
  • 4. A short overview of the Neo4j Product
  • 5. Graph components Node (Vertex) • The main data element from which graphs are constructed Keanu Reeves The Matrix
  • 6. Graph components Node (Vertex) • The main data element from which graphs are constructed Relationship (Edge) • A link between two nodes • Direction • Type Keanu Reeves The Matrix A node without relationships is permitted, a relationship without nodes is not
  • 7. Property graph database Person Movie Keanu Reeves The Matrix Node (Vertex) Relationship (Edge) Label • Define node role (optional)
  • 8. Property graph database Node (Vertex) Relationship (Edge) Label • Define node role (optional) • Can have more than one Person Action Movie
  • 9. Property graph database Node (Vertex) Relationship (Edge) Label • Define node role (optional) • Can have more than one Properties • Enrich • nodes • relationships • No need for nulls Person Action Movie
  • 10. Cypher: powerful and expressive query language :LOVES
  • 12. Neo4j Graph Data Science
  • 13. Pop Quiz Which of the coloured nodes would be considered the most “important”?
  • 14. Graphs Contain Implicit Knowledge D D has the highest degree centrality (7) This is the most connected individual in the network. If important is now well you are personally known, you pick D. G G has the highest closeness centrality (0.52) Information will disperse through the network more quickly through this individual. If you need to get a message out rapidly, choose G. I I has the highest betweenness centrality (0.59) This person is an efficient connector of other people. Risk of network disruption is higher if you lose this individual Most Important?
  • 15. Better Predictions With Data You Already Have ● Traditional ML ignores network structure because it’s difficult to extract ● Graphs use relationships to unlock otherwise unattainable predictions ● Add graph data to existing ML pipelines to increase accuracy Machine Learning Pipeline
  • 16. Graph Algorithm Categories Determines the importance of distinct nodes in the network Centrality / Importance Finds optimal paths or evaluates route availability and quality Pathfinding & Search Detects group clustering or partition Community Detection Compute low-dimensional vector representations of nodes in a graph, and allow you to train supervised machine learning models Embeddings & ML Evaluates how alike nodes are by neighbours and relationships Similarity Estimates the likelihood of nodes forming a future relationship Link Prediction
  • 17. Available Algorithms (as of 2.5) • Page Rank • Article Rank • Betweenness Centrality • Influence Maximization (CELF) • Closeness Centrality • Degree Centrality • Eigenvector Centrality • Harmonic Centrality • Hyperlink Induced Topic Search (HITS) Centrality / Importance • Delta-Stepping Single-Source Shortest Path • Dijkstra Source-Target Shortest Path • Dijkstra Single-Source Shortest Path • A* Shortest Path • Yen’s Shortest Path • Breadth First Search • Depth First Search • Random Walk • Bellman-Ford Single-Source Shortest Path • Minimum Weight Spanning Tree • Minimum Directed Steiner Tree • Minimum Weight k-Spanning Tree • All Pairs Shortest Path • Longest Path for DAG Pathfinding & Search • Louvain • Weakly Connected Components • Conductance metric • K-Core Decomposition • K-1 Coloring • K-Means Clustering • Label Propagation • Leiden • Local Clustering Coefficient • Modularity metric • Modularity Optimization • Strongly Connected Components • Triangle Count • Approximate Maximum k-cut • Speaker-Listener Label Propagation Community Detection https://neo4j.com/docs/graph-data-science/current • FastRP • GraphSAGE • Node2Vec • HashGNN • Node Classifaction Pipelines • Link Prediction Pipelines • Node Regression Pipelines Embeddings & ML • Node Similarity • K-Nearest Neighbors (KNN) • Cosine Similarity • Euclidean Similarity • Euclidean Distance Similarity • Jaccard Similarity • Overlap Similarity • Pearson Similarity Similarity • Adamic Adar • Common Neighbors • Preferential Attachment • Resource Allocations • Same Community • Total Neighbors Link Prediction
  • 19. Data Visualization with Neo4j Bloom Neo4j’s user-friendly graph database visualization, exploration and collaboration tool. ● Visually explore graphs ● Prototype faster ● Visualize and discover ● Easy for non-technical users
  • 20. NeoDash ● Fully open source ﹣ https://github.com/neo4j-labs/neodash ﹣ Extensible ● Can be integrated in existing portal ● Supported through Neo4j’s Professional Services team
  • 21. Digital Twin - An Overview
  • 22. What is a Digital Twin? A Digital Twin is a digital representation of a […] real-world physical product, system, or process […] that serves as the effectively indistinguishable digital counterpart of it for practical purposes, such as simulation, integration, testing, monitoring and maintenance.
  • 23. It has been done before EU Rail Network ● Challenge: Legacy technology could not section and analyse train journeys ● Solution: Neo4j Knowledge Graph ● Identify and avoid bottlenecks
  • 24.
  • 25. What is graph data modeling? Collaborative effort where the application domain is analysed by stakeholders and developers to come up with the optimal model for use with Neo4j. Who are the stakeholders? ● Business analysts ● Architects ● Managers ● Project leaders ● Data Scientists
  • 26. Graph data modeling workflow Cypher queries Define model Import data Additional questions Modify model Refactor database Cypher queries Developers
  • 29. Domain knowledge – High level requirements
  • 30. We’ve already got this! ● Normally… ● In this case we’ve talked about the Domain ● You will have the knowledge of your own Domain
  • 32. Get Sample Data Static Rail Network* - Sections of lines - Length, - Speed - Operational Points (Stations etc) - Geolocation information, - Points of Interest (POI) along lines *Source: Register of Infrastructure (RINF) - https://data-interop.era.europa.eu/
  • 33. Operational Points (OP) - Data Explanation CSV Header titles: The identifier of the OP The type of the OP (Station, Switch, Border Point…) The name of the OP The latitude of the OP The longitude of the OP
  • 34. Section Length Data CSV Header titles: The identifier of the start OP for the section The identifier of the end OP for the section The distince (in Kilometers) of the section
  • 35. Section Speed Data CSV Header titles: The identifier of the start OP for the section The identifier of the end OP for the section The maximum speed allowed on that section
  • 36. Point of Interest (POI) Data CSV Header titles: The name of the City the POI is in, or close to A short description of of the POI A link to a picture of the POI A link to a web page about the POI The latitude of the POI The Longitude of the POI
  • 38. Data Modeling – Example Domain Questions 1. What is the route from Operational Point X to Operational Point Y? ﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch? 2. What is an alternative route if an Operational Point on a route is closed? ﹣ A Switch is broken and we need to reroute Trains 3. How many routes are affected if I need to upgrade an Operational Point? ﹣ A Switch needs to be upgraded to support the network 4. What POIs are along a route? ﹣ Can we make revenue from referral commissions? Find busier routes during tourism season?
  • 40. Identify Entities from Questions Entities are the nouns in the domain questions: 1. What ingredients are used in a recipe? 2. Who is married to this person? ● The generic nouns often become labels in the model ● Use domain knowledge deciding how to further group or differentiate entities Person Recipe Ingredient
  • 41. Identify Connections between Entities Connections are the verbs in the domain questions: ● What ingredients are used in a recipe? ● Who is married to this person? Recipe Ingredient Person Person
  • 42. Using our Questions – Question 1 1. What is the route from Operational Point X to Operational Point Y? ﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch? 2. What is an alternative route if an Operational Point on a Section is closed? ﹣ A Switch is broken and we need to reroute Trains 3. How many routes are affected if I need to upgrade an Operational Point? ﹣ A Switch needs to be upgraded to support the network 4. What POIs are near Station Operational Points on a Section? ﹣ Can we make revenue from referral commissions? Find busier routes during tourism season? Operational Point Operational Point from to
  • 43. Using our Questions – Question 1 – Model 1. What is the route from Operational Point X to Operational Point Y? ﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch? TO
  • 44. Using our Questions – Question 2 1. What is the route from Operational Point X to Operational Point Y? ﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch? 2. What is an alternative route if an Operational Point on a Section is closed? ﹣ A Switch is broken and we need to reroute Trains 3. How many routes are affected if I need to upgrade an Operational Point? ﹣ A Switch needs to be upgraded to support the network 4. What POIs are near Station Operational Points on a Section? ﹣ Can we make revenue from referral commissions? Find busier routes during tourism season? Operational Point Section
  • 45. Using our Questions – Question 2 – Model 2. What is an alternative route if an Operational Point on a Section is closed? ﹣ A Switch is broken and we need to reroute Trains TO Section SECTION
  • 46. Using our Questions – Question 3 1. What is the route from Operational Point X to Operational Point Y? ﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch? 2. What is an alternative route if an Operational Point on a Section is closed? ﹣ A Switch is broken and we need to reroute Trains 3. How many routes are affected if I need to upgrade an Operational Point? ﹣ A Switch needs to be upgraded to support the network 4. What POIs are near Station Operational Points on a Section? ﹣ Can we make revenue from referral commissions? Find busier routes during tourism season? Operational Point
  • 47. Using our Questions – Question 3 – Model 3. How many routes are affected if I need to upgrade an Operational Point? ﹣ A Switch needs to be upgraded to support the network SECTION
  • 48. Using our Questions – Question 4 1. What is the route from Operational Point X to Operational Point Y? ﹣ What’s the quickest way to get a repair crew from Technical Services to a given Switch? 2. What is an alternative route if an Operational Point on a Section is closed? ﹣ A Switch is broken and we need to reroute Trains 3. How many routes are affected if I need to upgrade an Operational Point? ﹣ A Switch needs to be upgraded to support the network 4. What POIs are near Station Operational Points on a Section? ﹣ Can we make revenue from referral commissions? Find busier routes during tourism season? POI Operational Point Station near Section
  • 49. Using our Questions – Question 4 – Model 4. What POIs are near Station Operational Points on a Section? ﹣ Can we make revenue from referral commissions? Find busier routes during tourism season? Added ‘Station’ as a label SECTION
  • 50. Hang on… We have duplicate entries Wait, what? How do we deal with this?
  • 52. How does that look in reality?
  • 53. Whilst we’re here, let’s talk countries ● Multiple ways to model ﹣ Properties ﹣ Labels ● What should we choose? Cheat mode enabled!
  • 54. Before we go further Environment for later ● Neo4j Desktop ● Neo4j Sandbox
  • 55. Before we go further – Sandbox Login ● Create a sandbox for later ● https://sandbox.neo4j.com/
  • 56. Before we go further – Blank Sandbox ● https://sandbox.neo4j.com/ ● Create a ‘Blank Sandbox’ ● If you want to get ahead of the game ﹣ Open Browser and run the code from https://bit.ly/gsummit2023-load
  • 57.
  • 58. Examples ● Go to the Neo4j Sandbox you created earlier (or Desktop!) ﹣ https://sandbox.neo4j.com/ ● ‘Open with Browser’
  • 59. Get the code ● Open the Github page: ● https://github.com/cskardon/gsummit2023
  • 60.
  • 61. 62 Bloom - Sandbox Bloom - Desktop • Open Bloom • Open Bloom
  • 62. Bloom – Import Perspective
  • 63.
  • 64. NeoDash ● Open NeoDash: ﹣ http://neodash.graphapp.io/
  • 65. 66 NeoDash - Sandbox NeoDash - Desktop • Open Sandbox • Get connection details • Host: localhost • User: neo4j • Password: YOURS!
  • 66. NeoDash – Import Dashboard ● Create New Dashboard ● Import from Source files
  • 67.