Graphs for Ai and ML

Graphs for AI and ML
Dr. Jim Webber
Chief Scientist, Neo4j
@jimwebber

● Some definitions
● Accidental Skynet
● Graph theory
● Contemporary graph ML
● The future of graph AI
Overview

● ML - Machine Learning
○ Finding functions from historical data to guide future
interactions within a given domain
● AI - Artificial Intelligence
● The property of a system that it appears intelligent to its users
● Often, but not always, using ML techniques
● Or ML implementations that can be cheaply retrained to address
neighbouring domains
A Bluffer’s Guide to AI-cronyms

● Predictive analytics
● Use past data to predict the future
● General purpose AI
● ML with transfer learning such that learned experiences in one
domain can be applied elsewhere
● Human-like AI
Often conflated with

ML all the things
Where are we today?

Extract all the features!
• What do we do? Turn it to
vectors and pump it through a
classification or regression
model
• That’s actually not a bad
thing
• But we can do so much before
we even get to ML…
• … if we have graph data

Credit: Graph Algorithms, Holder and Needham, O’Reilly 2019

http://www.bbc.co.uk/london/travel/downloads/tube_map.html

• Nodes with optional properties and optional labels
• Named, directed relationships with optional properties
• Relationships have exactly one start and end node
• Which may be the same node
Labeled Property graph model

Fearless querying
MATCH path = (:author {name:’Jim Webber’}
-[*]->(:character {name:’The Doctor’})
RETURN path
OR
MATCH (me:author {name:’Jim Webber’},
(doc:character {name:’The Doctor’}),
path = shortestPath((me)-[*]->(doc))
RETURN path

Take a step back
We can be smarter about this

Realtime Predictive Analytics
(circa 2008)
+ +
=

Not AI, but extremely effective
Credit: https://medium.com/basecs/breaking-down-breadth-first-search-cebe696709d9

Credit:
https://www.networkworld.com/article/3211410
/lan-wan/the-10-most-powerful-companies-in-
enterprise-networking.html

Toolkit matures into
proper database
• Cypher and Neo4j server make
real time graph analytical
patterns simple to apply
• Amazing and humane to
implement

Firstname:
Mickey
Surname: Smith
DoB: 19781006
SKU: 5e175641
Product:
Badgers
Nadgers Ale
SKU: 2555f258
Product:
Peewee Pilsner
Category: beer
SKU: 49d102bc
Product: Baby
Dry Nights
Category:
nappies
Category: baby Category:
alcoholic
drinks
SKU: 49d102bc
Product: XBox
360
Category:
consumer
electronics
Category:
console
BOUGHTBOUGHT
MEMBER_OF
MEMBER_OFMEMBER_OF
MEMBER_OFMEMBER_OF

Firstname: *
Surname: *
DoB: 1996 > x
> 1972
Category: beerCategory:
nappies
BOUGHTCategory: game
console
Young fathers pattern

Firstname: *
Surname: *
DoB: 1996 > x
> 1972
Category: beerCategory:
nappies
!BOUGHTCategory: game
console
Business opportunity

(beer)(nappies)
(console)
(daddy)
() ()
()

(d)-[:BOUGHT]->()-[:MEMBER_OF]->(n)
(d)-[:BOUGHT]->()-[:MEMBER_OF]->(b)
(d)-[:BOUGHT]->()-[:MEMBER_OF]->(c)
Flatten the graph

(d:Person)-[:BOUGHT]->()-[:MEMBER_OF]->(n:Category)
(d:Person)-[:BOUGHT]->()-[:MEMBER_OF]->(b:Category)
(d:Person)-[:BOUGHT]->()-[:MEMBER_OF]->(c:Category)
Include any labels

MATCH (d:Person)-[:BOUGHT]->()-[:MEMBER_OF]->(n:Category),
(d:Person)-[:BOUGHT]->()-[:MEMBER_OF]->(b:Category)
Add a MATCH clause

(d:Person)-[:BOUGHT]->()-[:MEMBER_OF]->(b:Category),
(c:Category)
WHERE NOT((d)-[:BOUGHT]->()-[:MEMBER_OF]->(c))
Constrain the Pattern

(c:Category)
WHERE n.category = "nappies" AND
b.category = "beer" AND
c.category = "console" AND
NOT((d)-[:BOUGHT]->()-[:MEMBER_OF]->(c))
Add property constraints

(c:Category)
WHERE n.category = "nappies" AND
b.category = "beer" AND
c.category = "console" AND
NOT((d)-[:BOUGHT]->()-[:MEMBER_OF]->(c))
RETURN DISTINCT d AS daddy
Profit!

==> +---------------------------------------------+
==> | daddy |
==> +---------------------------------------------+
==> | Node[15]{name:"Rory Williams",dob:19880121} |
==> +---------------------------------------------+
==> 1 row
==> 0 ms
==>
neo4j-sh (0)$
Results

Which sushi restaurants
in NYC do my friends
like?
Facebook Graph Search
See http://maxdemarzi.com/

Simple Query, Intelligent Results
MATCH (:Person {name: 'Jim'})
-[:IS_FRIEND_OF]->(:Person)
-[:LIKES]->(restaurant:Restaurant)
-[:LOCATED_IN]->(:Place {location: 'New York'}),
(restaurant)-[:SERVES]->(:Cuisine {cuisine: 'Sushi'})
RETURN restaurant

Graph Theory
• Rich knowledge of how graphs
operate in many domains
• Off the shelf algorithms to
process those graphs for
information, insight, predictions
• Low barrier to entry
• Amazingly powerful

Triadic Closure
name: Kyle
name: Stan name: Kenny

Triadic Closure
name: Kyle
name: Kyle
FRIEND

Structural Balance
name:
Cartman
name: Craig name: Tweek

Structural Balance
name:
Cartman
name:
Cartman
FRIEND

Structural Balance
name:
Cartman
name:
Cartman
ENEMY

Structural Balance
name: Kyle
name: Kyle
FRIEND

Structural Balance is a key
predictive technique
And it’s domain-agnostic

Allies and Enemies
UK
GermanyFrance
Russia Italy
Austria

Predicting WWI
[Easley and Kleinberg]

It if a node has strong relationships to two neighbours, then these
neighbours must have at least a weak relationship between them.
[Wikipedia]
Strong Triadic Closure

Triadic Closure
(weak relationship)
name: Kenny
name: Stan name: Cartman

Triadic Closure
(weak relationship)
name: Kenny
name: Kenny
FRIEND 50%

• Relationships can have “strength” as well as intent
• Think: weighting on a relationship in a property graph
• Weak links play another super-important structural role in graph
theory
• They bridge neighbourhoods
Weak relationships

Local Bridges
FRIEND
name: Kenny
name: Stanname: Kyle
FRIEND
FRIEND
name: Sally
name: Bebename: Wendy
FRIEND
FRIEND 50%
name:
Cartman
FRIEND
ENEMY

“If a node A in a network satisfies the Strong Triadic Closure Property
and is involved in at least two strong relationships, then any local
bridge it is involved in must be a weak relationship.”
[Easley and Kleinberg]
Local Bridge Property

• (NP) Hard problem
• Repeatedly remove the spanning links between dense regions
• Or recursively merge nodes into ever larger “subgraph” nodes
• Choose your algorithm carefully – some are better than others for
a given domain
• Can use to (almost exactly) predict the
break up of the karate club!
Graph Partitioning

University Karate Clubs
(predicted by Graph Theory)
9

University Karate Clubs
(what actually happened!)

• Label Propagation
• Union Find / Weakly Connected Components
• Strongly Connected Components
• Triangle-Count / Clustering Coefficient
ClusteringCentrality
• PageRank
• Betweenness
• Closeness
• Degree
Path Finding
• Breadth-first search
• Depth-first search
• Single-source shortest path
• All-pairs shortest path
• Minimum weight spanning
tree
Graph Algorithms in Neo4j

Amazing Native Graph Performance

Credit: https://reezocar.blob.core.windows.net/blog/2015/09/k2000.jpg

Find and stop spammers
Extract graph structure over time
Not message content!
(Fakhraei et al, KDD 2015)
Learning to stop bad guys
Result: find and classify 70% spammers with 90% accuracy

Much of modern graph ML is still about turning graphs to vectors
Graph2Vec and friends
Highly complementary techniques
Mixing structural data and features gives better results
Better data into the model, better results out
But we don’t have to always vectorize graphs...
Graph ML

Knowledge Graphs
• Semantic domain knowledge for
inference and understanding
• E.g. eBay Google Assistant
• What’s the next best question to ask
when a potential customer says they
want a bag?
• Price? Function? Colour?
• Depends on context! Demographic,
history, user journey.
• Richly connected data makes the
system seem intelligent
• But it’s “just” data and algorithms in
reality

Graph Convolutional
Neural Networks
A general architecture for
predicting node and relationship
attributes in graphs.
(Kipf and Welling, ICLR 2017)
Credit: Andrew Docherty (CSIRO), YowData 2017
https://www.youtube.com/watch?v=Gmxz41L70Fg

Graph Networks for
Structured Causal Models
• Position paper from Google,
MIT, Edinburgh
• Structured representations and
computations (graphs) are key
• Goal: generalize beyond direct
experience
• Like human infants can
https://arxiv.org/pdf/1806.01261.pdf

Thanks for listening
Ask the experts session tomorrow 14:30
@jimwebber

Graphs for Ai and ML

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Graphs for Ai and ML

Similar to Graphs for Ai and ML (20)

More from Neo4j

More from Neo4j (20)

Recently uploaded

Recently uploaded (20)

Graphs for Ai and ML

Editor's Notes