Logical Inference in a Hyper-Relational Database

Join our community at grakn.ai/community
T H E D A T A B A S E F O R A I
Logical Inference in a  
Hyper-Relational database
 
By Domenico Corapi
Lead Engineer at GRAKN.AI
1

Follow us @GraknLabs
• @DomenicoCorapi
• PhD in Computational Logic /
Machine Learning at Imperial
College
• Web Services and Distributed
Systems at Microsoft Yammer and
Skyscanner
• Finance stuff at Citi and
Bloomberg
WHO AM I?

What is inference?
LAZY INFERENCE IN GRAKN

• We know some facts about the
world: a Knowledge Base
• Using some inference rules,
what other facts can we derive?
Knowledge
Base
Conclusions
Inference
INFERENCE

• Deductive reasoning
• Abductive reasoning
• Inductive reasoning
• Probabilistic reasoning
• …
INFERENCE
Knowledge
Base
Today is a sunny day
I don’t work if it’s a sunny day
or if it’s a bank holiday
Conclusions
Inference
I don’t work today
Deduction

INFERENCE
Knowledge
Base
Conclusions
Inference
I don’t work today
SELECT * FROM …
Sunny days table, bank holiday table
+ Application logic

Some notation and
useful concepts

SOME NOTATION
First-order logic Formal logic
Computational logic
Prolog
Datalog
Answer Set Programming
Graql rules

SOME NOTATION
• human(socrates) 
socrates is human
• human(X) 
X is human
• likes(socrates, X), paradox(X) 
socrates likes X and X is a paradox
• likes(socrates, X) <- paradox(X) 
socrates likes X if X is a paradox
• h <- b1, b2, …, bn 
Definite clause

DEDUCTION
• human(socrates)
• human(X) -> mortal(X)
• mortal(socrates)

DEDUCTION
• good_at(me, numbers)
• made_of(stock_market, numbers)
• good_at(me, stock_market)
?
good_at(P, X),
made_of(Y, X) ->
good_at(P, Y)

HERBRAND BASE
• human(socrates)
• human(socrates)
A Herbrand Base is the
set of all possible ground
(no variables) assertions
Herbrand Base:

INTERPRETATION
• human(socrates)
A Interpretation is an
assignment of truth
values to elements in the
Herbrand Base
Herbrand Base:
• {}
• {human(socrates)}
• {mortal(socrates)}
• {mortal(socrates), human(socrates)}
Possible interpretations:

MODEL
• human(socrates)
• human(X) -> mortal(X)A Model for a Knowledge
Base KB is an
interpretation of KB that
satisfies each clause in KB
Possible interpretations:
• {}
• {human(socrates)}
• {mortal(socrates)}
• {mortal(socrates), human(socrates)}Model

SOME FUNDAMENTAL THEOREM
A Knowledge Base with no negation has one and only
one minimal model

COMPUTATIONAL LOGIC IN A NUTSHELL
KB ⊨ G
G is entailed by KB
KB ⊢ G
G is provable from KB
• What language is KB in?
• What semantics is represented by ⊨?
• What proof procedure is used in ⊢?
• In what conditions is true?

CLOSED WORLD ASSUMPTION
Flights
Airline Number Origin Dest
AS 98 ANC SEA
AA 2336 LAX PBI
US 840 SFO CLT
AA 258 LAX MIA
AS 153 SEA ANC
…
select * from flights
where origin = ‘LHR’
What do we know about entries that are NOT in the table?
Not added yet?
They don’t exist?
We don’t know?

NEGATION IS HARD
man(adam).
single(X) <- man(X), not husband(X).
husband(X) <- man(X), not single(X).
Herbrand Base:
{man(adam),  
single(adam),
husband(adam)}
Minimal models:
• {man(adam), single(adam)}
• {man(adam), husband(adam)}

Bottom Up
Inference

BOTTOM UP INFERENCE
• Answer Set Programming
• Multiple models as answer
• Use of disjunction, cardinality constraints, integrity constraints,
classical negation, cwa negation… (no functors)
• Variables are typed

BOTTOM UP INFERENCE
1. Ground the knowledge base
2. Compute the models

GROUNDING
• human(plato)
• human(socrates)
• human(socrates) -> mortal(socrates)
• human(plato) -> mortal(plato)
• human(plato)
• human(socrates)

COMPUTE THE MODELS
(NOT human(socrates) OR mortal(socrates))  
AND  
(NOT human(plato) OR mortal(plato))  
AND  
human(plato)  
AND  
human(socrates)
SAT SOLVER

Grakn

Entity
GRAKN KNOWLEDGE MODEL
has has
plays relates
Attribute
Relationship
Role

Entity
GRAKN KNOWLEDGE MODEL
Attribute
Relationship
Role
socrates, plato
mortal
e.g. mortal(socrates)
teaches
e.g. teaches(socrates, plato)
teacher, student
e.g. teaches(teacher: socrates,
student: plato)

LOCATIONS

Lazy inference in
Grakn

RULES IN GRAKN
human(X) -> mortal(X)
when {
$x isa human
}, then {
$x has mortal “true”
}
human sub mortal

RULES IN GRAKN
is_located_in(X, Y), is_located_in(Y, Z) -> is_located_in(X, Z)
when {
(geo-entity: $x, entity-location: $y) isa is-located-in;
(geo-entity: $y, entity-location: $z) isa is-located-in;
}, then {
(geo-entity: $x, entity-location: $z) isa is-located-in;
}

STEP BY STEP
match $x isa city; $y has name 'Poland'; ($x, $y) isa is-located-in; get;
iterator.next()
1. Pop a subgoal from the stack
2. If subgoal is an answer return
3. Else expand goal into subgoals and push to stack. 
Back to 1.

STEP BY STEP
$y has name ‘Poland’
($x, $y) isa is-located-in
match $x isa city
G
lookup $y such that $y has name ‘Poland’
match $x isa city
($x, NODE_POLAND) isa is-located-in
match $x isa city
$y / NODE_POLAND
lookup ($x, NODE_POLAND) isa is-located-in
match $x isa city

STEP BY STEP
match $x isa city
G
G
match $x isa city
$y / NODE_POLAND
match $x isa city
…
lookup $y has name Poland
…
match NODE_SILESIA isa city
$x / NODE_SILESIA

STEP BY STEP
match $x isa city
G
G
match $x isa city
$y / NODE_POLAND
match $x isa city
$x / NODE_SILESIA
{}

STEP BY STEP
match $x isa city
$x / NODE_SILESIA
match NODE_MASOVIA isa city
{}
$x / NODE_MASOVIA
{}
match $x isa city is-located-in($x, NODE_POLAND)
<- is-located-in($x, $z),
is-located-in($z, NODE_POLAND)
($x, $z) isa is-located-in
($z, NODE_POLAND) isa is-located-in 
match $x isa city
is_located_in(X, Z) <-
is_located_in(X, Y),
is_located_in(Y, Z)
X/$x
Z/NODE_POLAND

STEP BY STEP
($x, $z) isa is-located-in
($z, NODE_POLAND) isa is-located-in
($x, NODE_MASOVIA) isa is-located-in 
match $x isa city
$z / NODE_MASOVIA
Answer: {$x: NODE_WARSAW}
…
…

SLD RESOLUTION
This is very similar to what Prolog implementations do
Differences
• Things are represented as nodes! No index lookup, but just getting the edges
• Interleaving graph lookups with resolution steps
• Type checks and roles in relationships
• Fetching results lazily. We keep a pointer rather than saving goals on the stack
• Caching intermediate results (similar to tabling in Prolog)

CONCLUSION
Inference is a more powerful and general way to derive conclusions from data.
Grakn uses rules to define higher level concepts and applies a top-down procedure to perform
inference.
Lookups are expensive, we need to be lazy. 
Using a stack to save the state of the computation. We keep pointers to nodes in the underlying
graph from which we resume the inference procedure.

Thank you!
@GraknLabs
https://grakn.ai/
https://github.com/graknlabs/grakn
domenico@grakn.ai
@DomenicoCorapi

Logical Inference in a Hyper-Relational Database

Recommended

Recommended

More Related Content

What's hot

What's hot (8)

Similar to Logical Inference in a Hyper-Relational Database

Similar to Logical Inference in a Hyper-Relational Database (20)

More from Vaticle

More from Vaticle (20)

Recently uploaded

Recently uploaded (20)

Logical Inference in a Hyper-Relational Database