SlideShare a Scribd company logo
1 of 35
Neo4j for
Graph Data Science™
Alicia Frame @AliciaFrame1
Lead Product Manager for Graph Data Science
Amy E. Hodler @AmyHodler
Director, Product Marketing & Programs for Graph Data Science
2
• Graph Data Science
(GDS)
• Neo4j for GDS and the
GDS Library
• DEMO!
• Questions
#GraphDataScience
#Neo4j
Alicia Frame
Lead Product Manager
Graph Data Science
Amy E. Hodler
Director, Product
Marketing & Programs
Graph Data Science
It’s Not What You Know
It’s Who You Know And Where They Are
Whose pay will
increase the most?
Photo by Helena Lopes on Unsplash
Network Structure
is highly predictive of
pay and promotions
• People Near Structural Holes
• Organizational Misfits
“Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum
“Structural Holes and Good Ideas” R. Burt
Relationships and Network Structure
Strongest Predictors of Behavior & Complex Outcomes
“Research into networks reveal that,
surprisingly, the most connected
people inside a tight group within a
single industry are less valuable than
the people who span the gaps ...”
7
“…jumping from ladder to ladder is a
more effective strategy, and that lateral
or even downward moves across an
organization are more promising in the
longer run . . .”
It’s a counter-intuitive
notion
8
Which is why
graph data science
is so powerful
9
“Data science is an interdisciplinary
field that uses scientific methods,
processes, algorithms and systems
to extract knowledge and insights
from structured and unstructured
data.” - Wikipedia
10
What is data science?
Data scientists use data to
answer questions.
Graph Data Science is a
science-driven approach to gain
knowledge from the relationships
and structures in data, typically to
power predictions.
11
What is Graph data science?
Data scientists use
relationships to answer
questions.
Query (e.g. Cypher/Python)
Real-time, local decisioning
and pattern matching
Graph Algorithms
Global analysis
and iterations
You know what you’re
looking for and making a
decision
You’re learning the overall structure
of a network, updating data, and
predicting
Local
Patterns
Global
Computation
Relationships and
network structures
are highly predictive
and underutilized
– and already in your data.
Graph are a natural way to
store and use this predictive
information, but different
than what you’re doing today.
How do you continually put more
accurate, predictive models into
production quickly?
Predictive
Maintenance
Churn
Prediction
Fraud
Detection
Life
Sciences
Personalized
Recommendations
Cybersecurity
Disambiguation &
Segmentation
Search &
Master Data Mgmt.
Graph Data Science Applications
Just a few examples…
15
• 27 Million warranty & service documents
parsed for text to knowledge graph
• Graph is context for AI to learn “prime
examples” and anticipate maintenance
• Improves satisfaction and equipment
lifespan
• Connecting 50 research databases, 100k’s of
Excel workbooks, 30 bio-sample databases
• Bytes 4 Diabetes Award for use of a
knowledge graph, graph analytics, and AI
• Customized views for flexible research angles
• Almost 70% of CC fraud was missed
• ~1B Nodes and Relationships to analyse
• Graph analytics with queries & algorithms
help find $ millions of fraud in 1st year
Improving Analytics, ML & AI for Enterprises
Caterpillar’s AI Supply
Chain & Maintenance
German Center for
Diabetes Research (DZD)
Financial Fraud
Detection & Recovery Top 10
Bank
Evolution of Graph Data Science
Decision
Support
Graph Based
Predictions
Predictions within
a Graphs
16
Graph Feature
Engineering
Graph
Embeddings
Graph Native
Learning
Knowledge
Graphs
Graph
Analytics
16
Evolution of Graph Data Science
Graph Feature
Engineering
Graph
Embeddings
Graph
Networks
17
Graph
AnalyticsKnowledge
Graphs
Graph search
and queries
Support domain
experts
Deceptively Simple Queries
Collaborative filtering: users who
bought X, also bought Y (open-ended
pattern matching)
What items make you more likely to
buy additional items in subsequent
transactions?
Traverse hierarchies - what items are
similar 4+ hops out?
Difficult for RDMS systems
Knowledge Graph Queries
e.g. Retail Recommendation
18
Evolution of Graph Data Science
Graph Feature
Engineering
Graph
Embeddings
Graph
Networks
19
Knowledge
Graphs
Graph
Analytics
Graph queries &
algorithms for
offline analysis
Understanding
Structures
• Degree Centrality
• Closeness Centrality
• CC Variations: Harmonic, Dangalchev,
Wasserman & Faust
• Betweenness Centrality
• Approximate Betweenness Centrality
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count
• Clustering Coefficients
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• Balanced Triad (identification)
Graph Algorithms & Functions in Neo4j
• Shortest Path
• Single-Source Shortest Path
• All Pairs Shortest Path
• A* Shortest Path
• Yen’s K Shortest Path
• Minimum Weight Spanning Tree
• K-Spanning Tree (MST)
• Random Walk
• Degree Centrality
• Closeness Centrality
• CC Variations: Harmonic, Dangalchev,
Wasserman & Faust
• Betweenness Centrality & Approximate
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count
• Clustering Coefficients
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• K-1 Coloring
• Modularity Optimization
• Euclidean Distance
• Cosine Similarity
• Node Similarity (Jaccard)
• Overlap Similarity
• Pearson Similarity
• Approximate KNN
Pathfinding
& Search
Centrality /
Importance
Community
Detection
Similarity
Link
Prediction
• Adamic Adar
• Common Neighbors
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbors
...and also Auxiliary Functions:
• Random graph generation
• One hot encoding
• Distributions & metrics
45
Graph Algorithms
e.g. Retail Recommendations
Graph algorithms enable reasoning
about network structure
Louvain to identify customer
segmentation based on topology
PageRank to measure
transaction volumes
Connected components
identify unique users
Jaccard to measure purchasing
similarity
21
Evolution of Graph Data Science
Graph
Embeddings
Graph
Networks
22
Knowledge
Graphs
Graph
Analytics
Graph Feature
Engineering
Graph algorithms
& queries for
machine learning
Improve Prediction
Accuracy
Graph Feature Engineering
Feature Engineering is how we combine and process the
data to create new, more meaningful features, such as
clustering or connectivity metrics.
23
Evolution of Graph Data Science
Decision
Support
Graph Based
Predictions
Predictions within
a Graphs
24
Graph Feature
Engineering
Graph
Embeddings
Graph Native
Learning
Knowledge
Graphs
Graph
Analytics
FUTURE
24
for Enterprise Graph Data Science
Neo4j Graph Data
Science Library
Scalable Graph
Algorithms & Analytics
Workspace
Native Graph
Creation & Persistence
Neo4j
Database
Visual Graph
Exploration
& Prototyping
Neo4j
Bloom
Practical Integrated Intuitive
A graph catalog that creates an
efficient analytics workspace
• Reshape transactional database into an in memory analytics
graph, and manage these operations
• Optimized for analytics with global traversals and aggregation
Algorithms
• Run on the loaded graph to compute metrics about the
topology and connectivity
• Highly parallelized and scale to billions of nodes
26
What is the GDS Library?
• Neo4j automates data
transformations
• Fast iterations & layering
• Production ready features,
parallelization & enterprise
support
Neo4j for GDS
enterprise-grade features and scale   
A graph-specific analytics workspace that’s mutable – integrated
with a native-graph database
Mutable In-Memory Workspace
Computational Graph
Native Graph Store
Answer previously intractable questions
with the data you already have
• Deep Path Analytics & Structural Pattern Matching
• Community & Neighbors Detection
• Influencer and Risk Identification
• Disambiguation
• Link and Behavior Prediction
Massive scale to 10’s billions of nodes with optimized
algorithms
Increase your predictive accuracy with
Neo4j GDS Algorithms  
Take advantage of hardened, validated graph algorithms that
enable reasoning about network structure.
Find Value Faster with Neo4j’s
practical Graph Data Science framework
Drastically simplified and
standardized API that
enables custom, flexible
configurations
Documentation, training,
and examples so getting
started is simple
Explore graphs and
algorithm results visually
with Bloom
Share insights across
teams for better
collaboration
Friendly data science
experience with logical
guardrails like memory
mgmt.
Reshapping, node &
relationship aggregation /
deduplication and
multipartite algos
30
Simplify Your Data Science Experience
Dozens of
libraries,
hundreds of
algos & no docs!
How do we
shape data into
a graph in the
first place?
We’ve picked a
library...good
luck learning
the syntax
WTF? We have to
build the entire
ETL pipeline for
this?
Are the results
right? How do
we get into
production?
Data Modeling
Which
Algorithms?
Learn Syntax What Now?
Reshape
31
Simplify Your Data Science Experience
Dozens of
libraries,
hundreds of
algos & no docs!
With Neo4j it’s
already a
graph
We’ve picked a
library...good
luck learning
the syntax
WTF? We have to
build the entire
ETL pipeline for
this?
Are the results
right? How do
we get into
production?
Data Modeling
Which
Algorithms?
Learn Syntax What Now?
Reshape
We have
validated algos,
clear docs, and
tutorials
Neo4j syntax is
standardized
and simplified
We seamlessly
reshape your
data with 1
command
Easily write
results to Neo4j
& move straight
into production
32
DEMO
• Real data from online retailer in the UK with all-occasion items
• 28K nodes and 1.1M relationships
• “Data mining for the online retail industry” by D. Chen et al
• Neo4j 4.0 and Graph Data Science Library 1.2
33
Retail Data
1. Customer segmentation
a. Break down the graph into customers with similar buying patterns
i. Similarities, Community Detection, Mutate & Export Graph
2. Item recommendations
a. What item should we recommend to different customer
segments?
i. Co-Purchase Similarity, Centarlities, Mutate & Export Graph
3. Explore and answer our business questions
a. Recommendation for a person that buys something specific?
b. What to promote to drive sales in a category?
34
Retail Demo
Thank You
&
Questions
35
- O’Reilly Book on Graph Algorithms
neo4j.com/graph-algorithms-book/
- GDS website, whitepapers, links
neo4j.com/use-cases/graph-data-s
cience-artificial-intelligence/
- Data for Retail example github.com
/AliciaFrame/GDS_Retail_Demo
- GDS Sandbox sandbox.neo4j.com/
?usecase=graph-data-science
Resources
Alicia Frame
Alicia.Frame@Neo4j.com
@AliciaFrame1
Amy E. Hodler
Amy.Hodler@Neo4j.com
@AmyHodler

More Related Content

What's hot

Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Toronto Metropolitan University
 

What's hot (20)

Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Representation learning on graphs
Representation learning on graphsRepresentation learning on graphs
Representation learning on graphs
 
Graph database Use Cases
Graph database Use CasesGraph database Use Cases
Graph database Use Cases
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
 
Intro to Neo4j - Nicole White
Intro to Neo4j - Nicole WhiteIntro to Neo4j - Nicole White
Intro to Neo4j - Nicole White
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
GPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge GraphGPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge Graph
 
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptxKnowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
Knowledge Graphs and Generative AI_GraphSummit Minneapolis Sept 20.pptx
 
Road to NODES - Healthcare Analytics
Road to NODES - Healthcare AnalyticsRoad to NODES - Healthcare Analytics
Road to NODES - Healthcare Analytics
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data Science
 
Introduction to Graph Databases
Introduction to Graph DatabasesIntroduction to Graph Databases
Introduction to Graph Databases
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Schemaless Databases
Schemaless DatabasesSchemaless Databases
Schemaless Databases
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
 
Graph Representation Learning
Graph Representation LearningGraph Representation Learning
Graph Representation Learning
 
ntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technologyntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technology
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 

Similar to What Is GDS and Neo4j’s GDS Library

Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
Caserta
 
Graph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics WebinarGraph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics Webinar
Neo4j
 
Introduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & BahrainIntroduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & Bahrain
Neo4j
 
Big data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big DataBig data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big Data
Christos Hadjinikolis
 
Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science
Neo4j
 

Similar to What Is GDS and Neo4j’s GDS Library (20)

GraphTour London 2020 - Graphs for AI, Amy Hodler
GraphTour London 2020  - Graphs for AI, Amy HodlerGraphTour London 2020  - Graphs for AI, Amy Hodler
GraphTour London 2020 - Graphs for AI, Amy Hodler
 
GraphTour 2020 - Graphs & AI: A Path for Data Science
GraphTour 2020 - Graphs & AI: A Path for Data ScienceGraphTour 2020 - Graphs & AI: A Path for Data Science
GraphTour 2020 - Graphs & AI: A Path for Data Science
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
How Graphs are Changing AI
How Graphs are Changing AIHow Graphs are Changing AI
How Graphs are Changing AI
 
Graph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics WebinarGraph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics Webinar
 
Azure Databricks for Data Scientists
Azure Databricks for Data ScientistsAzure Databricks for Data Scientists
Azure Databricks for Data Scientists
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - Webinar
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4jTransforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4jTransforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
 
Neo4j GraphTalk Oslo - Introduction to Graphs
Neo4j GraphTalk Oslo - Introduction to GraphsNeo4j GraphTalk Oslo - Introduction to Graphs
Neo4j GraphTalk Oslo - Introduction to Graphs
 
Relationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine LearningRelationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine Learning
 
State of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAMState of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAM
 
Government GraphSummit: Leveraging Graphs for AI and ML
Government GraphSummit: Leveraging Graphs for AI and MLGovernment GraphSummit: Leveraging Graphs for AI and ML
Government GraphSummit: Leveraging Graphs for AI and ML
 
Introduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & BahrainIntroduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & Bahrain
 
Leveraging Graphs for Better AI
Leveraging Graphs for Better AILeveraging Graphs for Better AI
Leveraging Graphs for Better AI
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Einstieg in Neo4j Graph Data Science
Einstieg in Neo4j Graph Data ScienceEinstieg in Neo4j Graph Data Science
Einstieg in Neo4j Graph Data Science
 
Big data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big DataBig data week 2018 - Graph Analytics on Big Data
Big data week 2018 - Graph Analytics on Big Data
 
Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science
 
Leveraging Graphs for Better AI
Leveraging Graphs for Better AILeveraging Graphs for Better AI
Leveraging Graphs for Better AI
 

More from Neo4j

More from Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

What Is GDS and Neo4j’s GDS Library

  • 1. Neo4j for Graph Data Science™ Alicia Frame @AliciaFrame1 Lead Product Manager for Graph Data Science Amy E. Hodler @AmyHodler Director, Product Marketing & Programs for Graph Data Science
  • 2. 2 • Graph Data Science (GDS) • Neo4j for GDS and the GDS Library • DEMO! • Questions #GraphDataScience #Neo4j Alicia Frame Lead Product Manager Graph Data Science Amy E. Hodler Director, Product Marketing & Programs Graph Data Science
  • 3. It’s Not What You Know
  • 4. It’s Who You Know And Where They Are
  • 6. Photo by Helena Lopes on Unsplash Network Structure is highly predictive of pay and promotions • People Near Structural Holes • Organizational Misfits “Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum “Structural Holes and Good Ideas” R. Burt
  • 7. Relationships and Network Structure Strongest Predictors of Behavior & Complex Outcomes “Research into networks reveal that, surprisingly, the most connected people inside a tight group within a single industry are less valuable than the people who span the gaps ...” 7 “…jumping from ladder to ladder is a more effective strategy, and that lateral or even downward moves across an organization are more promising in the longer run . . .”
  • 9. Which is why graph data science is so powerful 9
  • 10. “Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data.” - Wikipedia 10 What is data science? Data scientists use data to answer questions.
  • 11. Graph Data Science is a science-driven approach to gain knowledge from the relationships and structures in data, typically to power predictions. 11 What is Graph data science? Data scientists use relationships to answer questions.
  • 12. Query (e.g. Cypher/Python) Real-time, local decisioning and pattern matching Graph Algorithms Global analysis and iterations You know what you’re looking for and making a decision You’re learning the overall structure of a network, updating data, and predicting Local Patterns Global Computation
  • 13. Relationships and network structures are highly predictive and underutilized – and already in your data. Graph are a natural way to store and use this predictive information, but different than what you’re doing today. How do you continually put more accurate, predictive models into production quickly?
  • 15. 15 • 27 Million warranty & service documents parsed for text to knowledge graph • Graph is context for AI to learn “prime examples” and anticipate maintenance • Improves satisfaction and equipment lifespan • Connecting 50 research databases, 100k’s of Excel workbooks, 30 bio-sample databases • Bytes 4 Diabetes Award for use of a knowledge graph, graph analytics, and AI • Customized views for flexible research angles • Almost 70% of CC fraud was missed • ~1B Nodes and Relationships to analyse • Graph analytics with queries & algorithms help find $ millions of fraud in 1st year Improving Analytics, ML & AI for Enterprises Caterpillar’s AI Supply Chain & Maintenance German Center for Diabetes Research (DZD) Financial Fraud Detection & Recovery Top 10 Bank
  • 16. Evolution of Graph Data Science Decision Support Graph Based Predictions Predictions within a Graphs 16 Graph Feature Engineering Graph Embeddings Graph Native Learning Knowledge Graphs Graph Analytics 16
  • 17. Evolution of Graph Data Science Graph Feature Engineering Graph Embeddings Graph Networks 17 Graph AnalyticsKnowledge Graphs Graph search and queries Support domain experts
  • 18. Deceptively Simple Queries Collaborative filtering: users who bought X, also bought Y (open-ended pattern matching) What items make you more likely to buy additional items in subsequent transactions? Traverse hierarchies - what items are similar 4+ hops out? Difficult for RDMS systems Knowledge Graph Queries e.g. Retail Recommendation 18
  • 19. Evolution of Graph Data Science Graph Feature Engineering Graph Embeddings Graph Networks 19 Knowledge Graphs Graph Analytics Graph queries & algorithms for offline analysis Understanding Structures
  • 20. • Degree Centrality • Closeness Centrality • CC Variations: Harmonic, Dangalchev, Wasserman & Faust • Betweenness Centrality • Approximate Betweenness Centrality • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Triangle Count • Clustering Coefficients • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity • Balanced Triad (identification) Graph Algorithms & Functions in Neo4j • Shortest Path • Single-Source Shortest Path • All Pairs Shortest Path • A* Shortest Path • Yen’s K Shortest Path • Minimum Weight Spanning Tree • K-Spanning Tree (MST) • Random Walk • Degree Centrality • Closeness Centrality • CC Variations: Harmonic, Dangalchev, Wasserman & Faust • Betweenness Centrality & Approximate • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Triangle Count • Clustering Coefficients • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity • K-1 Coloring • Modularity Optimization • Euclidean Distance • Cosine Similarity • Node Similarity (Jaccard) • Overlap Similarity • Pearson Similarity • Approximate KNN Pathfinding & Search Centrality / Importance Community Detection Similarity Link Prediction • Adamic Adar • Common Neighbors • Preferential Attachment • Resource Allocations • Same Community • Total Neighbors ...and also Auxiliary Functions: • Random graph generation • One hot encoding • Distributions & metrics 45
  • 21. Graph Algorithms e.g. Retail Recommendations Graph algorithms enable reasoning about network structure Louvain to identify customer segmentation based on topology PageRank to measure transaction volumes Connected components identify unique users Jaccard to measure purchasing similarity 21
  • 22. Evolution of Graph Data Science Graph Embeddings Graph Networks 22 Knowledge Graphs Graph Analytics Graph Feature Engineering Graph algorithms & queries for machine learning Improve Prediction Accuracy
  • 23. Graph Feature Engineering Feature Engineering is how we combine and process the data to create new, more meaningful features, such as clustering or connectivity metrics. 23
  • 24. Evolution of Graph Data Science Decision Support Graph Based Predictions Predictions within a Graphs 24 Graph Feature Engineering Graph Embeddings Graph Native Learning Knowledge Graphs Graph Analytics FUTURE 24
  • 25. for Enterprise Graph Data Science Neo4j Graph Data Science Library Scalable Graph Algorithms & Analytics Workspace Native Graph Creation & Persistence Neo4j Database Visual Graph Exploration & Prototyping Neo4j Bloom Practical Integrated Intuitive
  • 26. A graph catalog that creates an efficient analytics workspace • Reshape transactional database into an in memory analytics graph, and manage these operations • Optimized for analytics with global traversals and aggregation Algorithms • Run on the loaded graph to compute metrics about the topology and connectivity • Highly parallelized and scale to billions of nodes 26 What is the GDS Library?
  • 27. • Neo4j automates data transformations • Fast iterations & layering • Production ready features, parallelization & enterprise support Neo4j for GDS enterprise-grade features and scale    A graph-specific analytics workspace that’s mutable – integrated with a native-graph database Mutable In-Memory Workspace Computational Graph Native Graph Store
  • 28. Answer previously intractable questions with the data you already have • Deep Path Analytics & Structural Pattern Matching • Community & Neighbors Detection • Influencer and Risk Identification • Disambiguation • Link and Behavior Prediction Massive scale to 10’s billions of nodes with optimized algorithms Increase your predictive accuracy with Neo4j GDS Algorithms   Take advantage of hardened, validated graph algorithms that enable reasoning about network structure.
  • 29. Find Value Faster with Neo4j’s practical Graph Data Science framework Drastically simplified and standardized API that enables custom, flexible configurations Documentation, training, and examples so getting started is simple Explore graphs and algorithm results visually with Bloom Share insights across teams for better collaboration Friendly data science experience with logical guardrails like memory mgmt. Reshapping, node & relationship aggregation / deduplication and multipartite algos
  • 30. 30 Simplify Your Data Science Experience Dozens of libraries, hundreds of algos & no docs! How do we shape data into a graph in the first place? We’ve picked a library...good luck learning the syntax WTF? We have to build the entire ETL pipeline for this? Are the results right? How do we get into production? Data Modeling Which Algorithms? Learn Syntax What Now? Reshape
  • 31. 31 Simplify Your Data Science Experience Dozens of libraries, hundreds of algos & no docs! With Neo4j it’s already a graph We’ve picked a library...good luck learning the syntax WTF? We have to build the entire ETL pipeline for this? Are the results right? How do we get into production? Data Modeling Which Algorithms? Learn Syntax What Now? Reshape We have validated algos, clear docs, and tutorials Neo4j syntax is standardized and simplified We seamlessly reshape your data with 1 command Easily write results to Neo4j & move straight into production
  • 33. • Real data from online retailer in the UK with all-occasion items • 28K nodes and 1.1M relationships • “Data mining for the online retail industry” by D. Chen et al • Neo4j 4.0 and Graph Data Science Library 1.2 33 Retail Data
  • 34. 1. Customer segmentation a. Break down the graph into customers with similar buying patterns i. Similarities, Community Detection, Mutate & Export Graph 2. Item recommendations a. What item should we recommend to different customer segments? i. Co-Purchase Similarity, Centarlities, Mutate & Export Graph 3. Explore and answer our business questions a. Recommendation for a person that buys something specific? b. What to promote to drive sales in a category? 34 Retail Demo
  • 35. Thank You & Questions 35 - O’Reilly Book on Graph Algorithms neo4j.com/graph-algorithms-book/ - GDS website, whitepapers, links neo4j.com/use-cases/graph-data-s cience-artificial-intelligence/ - Data for Retail example github.com /AliciaFrame/GDS_Retail_Demo - GDS Sandbox sandbox.neo4j.com/ ?usecase=graph-data-science Resources Alicia Frame Alicia.Frame@Neo4j.com @AliciaFrame1 Amy E. Hodler Amy.Hodler@Neo4j.com @AmyHodler