SlideShare a Scribd company logo
1 of 42
Download to read offline
Mariano Rodríguez-Muro and Diego Calvanese
KRDB Research Group
Free University of Bozen-Bolzano
KR’12
Realizing OBDA
with data dependencies
Thursday, June 14, 2012
OBDA and -Quest-
DL-Lite (OWL 2 QL) - promise
• Light-weight DL
• Allows for QA by query rewriting into FO-queries (i.e,. SQL)
• Mapping techniques that allow for OBDA
(sound, GAV mappings)
• Fast query rewriting techniques
• Implementations available
2
Thursday, June 14, 2012
OBDA and -Quest-
• UCQ = 729 CQs,
UNION of 729 SPJ SQL
queries
• Datalog = 9 rules,
1 SPJ of UNIONS (3
nested UNIONS of 9
elements each)
An trivial example
3
A
B
E D F
C
G H I
q(x) ← A(x), R(x, y), A(y), R(y, z), A(z)
Thursday, June 14, 2012
OBDA and -Quest-
• Focused on “size” of the
rewriting and dealing
with
ended up playing an
“encoding” game...
• The role of SQL engines,
their features and
limitations has been
neglected
DL-Lite (OWL 2 QL) - reality
• Query answering by query
rewriting still slow, but
not due to the rewriting
technique
• RDBMS cannot handle our
queries efficiently
4
A  ∃R
Thursday, June 14, 2012
OBDA and -Quest- 5
“Efficient query answering in OBDA or QAO requires exploiting
and optimizing every element/resource in the query answering
system, not just query rewriting algorithms”
Thursday, June 14, 2012
OBDA and -Quest-
ABOX DEPENDENCIES
Describing out data sources
6
Thursday, June 14, 2012
OBDA and -Quest-
QAO revisited
Reasoner Application
TBox
ABox
Inputs
Thursday, June 14, 2012
OBDA and -Quest-
QAO revisited
Reasoner Application
TBox
ABox
Inputs
Source
Thursday, June 14, 2012
OBDA and -Quest-
QAO revisited
Reasoner Application
TBox
ABox
Inputs
SourceOBDA
Model
Thursday, June 14, 2012
OBDA and -Quest-
OBDA
Reasoner
Source
Application
Direct
Communication
TBox
OBDA
Model
Inputs
Thursday, June 14, 2012
CardiacArrest  Condition
Clog  Condition
Patient  ∃name
Patient  ∃age
Patient  ∃ssn
Patient  ∃affectedBy
OBDA and -Quest-
An ontology
9
∃name  Patient
∃age  Patient
∃ssn  Patient
∃affectedBy  Patient
∃affectedBy−
 Condition
Thursday, June 14, 2012
OBDA and -Quest-
An example
id [PKEY] name age ssn
12 John 37 xxx-999
Table: patient
patient_id [FKEY] c_code
12 33
Table: conditionOBDA Model
Thursday, June 14, 2012
OBDA and -Quest-
An example
SELECT id,name,age,ssn FROM patient
:person/{$id} a Patient; name $name;
age $age^^xsd:int; ssn $ssn
id [PKEY] name age ssn
12 John 37 xxx-999
Table: patient
patient_id [FKEY] c_code
12 33
Table: conditionOBDA Model
Thursday, June 14, 2012
OBDA and -Quest-
An example
SELECT id,name,age,ssn FROM patient
:person/{$id} a Patient; name $name;
age $age^^xsd:int; ssn $ssn
id [PKEY] name age ssn
12 John 37 xxx-999
Table: patient
patient_id [FKEY] c_code
12 33
Table: condition
SELECT id, c_id FROM condition
:person/{$id} affectedBy :cond/{$id}.
:cond/{$id} a Condition
OBDA Model
Thursday, June 14, 2012
OBDA and -Quest-
An example
SELECT id,name,age,ssn FROM patient
:person/{$id} a Patient; name $name;
age $age^^xsd:int; ssn $ssn
id [PKEY] name age ssn
12 John 37 xxx-999
Table: patient
patient_id [FKEY] c_code
12 33
Table: condition
SELECT id, c_id FROM condition
:person/{$id} affectedBy :cond/{$id}.
:cond/{$id} a Condition
SELECT id FROM condition WHERE c_code = 33
:cond/{$id} a CardiacArrest
SELECT id,c_id FROM condition WHERE c_code = 27
:cond/{$id} a Clog
OBDA Model
Thursday, June 14, 2012
R := P | P−
∀x.B1(x) ∈ A → B2(x) ∈ A
∀x, y.R1(x, y) ∈ A → R2(x, y) ∈ A
B1 A B2
R1 A R2
OBDA and -Quest-
ABox dependencies for DL-Lite
11
Concept descriptions
Role descriptions
ABox constraints
• Semantics:
B := A | ∃R
• In addition to TBox and ABox (or OBDA model), we introduce
a set of ABox dependencies Sigma Σ
Thursday, June 14, 2012
CardiacArrest A Condition
Clog A Condition
Patient A ∃name
Patient A ∃age
Patient A ∃ssn
Patient A ∃affectedBy
∃name A Patient
∃age A Patient
∃ssn A Patient
∃affectedBy A Patient
∃affectedBy−
A Condition
OBDA and -Quest-
ABox constraints,example
12
Thursday, June 14, 2012
OBDA and -Quest-
ABOX CONSTRAINTS
13
Optimizing your TBox
Thursday, June 14, 2012
Given a TBox T and a set of ABox dependencies Σ we want to compute T 
s.t. each α ∈ T 
is not redundant w.r.t. Σ
OBDA and -Quest-
Redundancy inT w.r.t.Sigma
14
Thursday, June 14, 2012
Given a TBox T and a set of ABox dependencies Σ we want to compute T 
s.t. each α ∈ T 
is not redundant w.r.t. Σ
OBDA and -Quest-
Redundancy inT w.r.t.Sigma
14
A T -chain from B to C is a sequence of inclusion assertions (Bi  B
i)n
i=0
for some n ≥ 0, such that:
1. B0 = B, B
n = C, and
2. for 1 ≤ i ≤ n, we have that B
i−1 and Bi are basic concepts s.t., either
(i) B
i−1 = Bi, or (ii) B
i−1 = ∃R
and Bi = ∃R−
, for some basic role R
.
Respectively for role chains.
Chains in T and Sigma
Thursday, June 14, 2012
OBDA and -Quest-
Redundancy w.r.t.Sigma
15
Thursday, June 14, 2012
B  C is directly redundant in T w.r.t. Σ if (i) Σ |= B A C and (ii) for
every T -chain (Bi  B
i)n
i=0 with B
n = B in T , there is a Σ-chain (Bi A B
i)n
i=0.
Similarly for roles.
OBDA and -Quest-
Redundancy w.r.t.Sigma
16
Direct redundancy
T-chain Sigma-chain
Thursday, June 14, 2012
B  C is redundant in T w.r.t. Σ if
(a) it is directly redundant, or
(b) there exists B
= B s.t. (i) T |= B
 C, (ii) B
 C is not directly
redundant in T w.r.t. Σ, and (iii) B  B
is directly redundant.
OBDA and -Quest-
Redundancy w.r.t.Sigma
17
Redundancy
T-chain Sigma-chain
Thursday, June 14, 2012
OBDA and -Quest-
Redundancy w.r.t.Sigma
18
• We can compute T’ by reducing redundancy checks to
reachability in DAGs
• Computable in polynomial time
• Optimal (no redundant axioms in T’)
T 
is the set of inclusion assertions {α ∈ sat(T ) | α is not redundant in
sat(T ) w.r.t. sat(Σ)}.
Thursday, June 14, 2012
OBDA and -Quest-
ABOX CONSTRAINTS
19
Enforcing dependencies
Thursday, June 14, 2012
OBDA and -Quest-
OBDA with OWL 2 QL
OBDA
Model Manipulate mappings
Manipulate the data
Inference shifting. Separate
ground reasoning from existential
reasoning (efficiently)
Thursday, June 14, 2012
OBDA and -Quest-
OBDA with OWL 2 QL
Reasoner Application
TBox
ABox
Inputs
Thursday, June 14, 2012
OBDA and -Quest-
OBDA with OWL 2 QL
Reasoner Application
TBox
ABox
Inputs
Thursday, June 14, 2012
OBDA and -Quest-
OBDA with OWL 2 QL
Reasoner Application
TBox
ABox
Inputs
OBDA
Model
Thursday, June 14, 2012
OBDA and -Quest-
Creating semanticABox storage
24
ABox
1. Encode the hierarchies of T in indexes and intervals
Thursday, June 14, 2012
OBDA and -Quest-
Creating semanticABox storage
25
ABox
1. Encode the hierarchies of T in indexes and intervals
A
B
E D F
C
G H I
Thursday, June 14, 2012
OBDA and -Quest-
Creating semanticABox storage
26
ABox
1. Encode the hierarchies of T in indexes and intervals
A
B
E D F
C
G H I
1
2
3 4 5
6
7 8 9
Thursday, June 14, 2012
OBDA and -Quest-
Creating semanticABox storage
26
ABox
1. Encode the hierarchies of T in indexes and intervals
A
B
E D F
C
G H I
1
2
3 4 5
6
7 8 9
Insert into your DB using those indexes
C IDX
d 4
h 8
A = D(d), H(h)
Thursday, June 14, 2012
A
B
E D F
C
G H I
1 [1,8]
2 [2,5]
3
[3,3]
4
[4,4]
5
[5,5]
6 [6,9]
7
[7,7]
8
[8,8]
9
[9,9]
OBDA and -Quest-
Creating semanticABox storage
27
ABox
3. Define intervals to retrieve your data
Thursday, June 14, 2012
A
B
E D F
C
G H I
1 [1,8]
2 [2,5]
3
[3,3]
4
[4,4]
5
[5,5]
6 [6,9]
7
[7,7]
8
[8,8]
9
[9,9]
OBDA and -Quest-
Creating semanticABox storage
27
ABox
3. Define intervals to retrieve your data
?x a A
SELECT c FROM t WHERE
IDX = 1 AND IDX = 8
OBDA
Model
4. Create mappings using those intervals
Thursday, June 14, 2012
A
B
E D F
C
G H I
1 [1,8]
2 [2,5]
3
[3,3]
4
[4,4]
5
[5,5]
6 [6,9]
7
[7,7]
8
[8,8]
9
[9,9]
OBDA and -Quest-
Creating semanticABox storage
27
ABox
3. Define intervals to retrieve your data
?x a A
SELECT c FROM t WHERE
IDX = 1 AND IDX = 8
OBDA
Model
4. Create mappings using those intervals
5. Complement with mappings to cover
domain, range and inverse inferences
Thursday, June 14, 2012
Experiments
28
• Experimentation using Stanford’s “Resource Index”
• Semantic search over
annotated documents
• 200 ontologies from Bio-portal
(only hierarchies 200k concepts,
millions of subClassOf)
Thursday, June 14, 2012
Experiments
29
• Current system uses chase
• Naive chase: 7 days
• Optimized chase: 40 mins
• Cost 16 GB + 70 GB chase data
• Split second responses
• Pure query rewriting approaches
• not-feasible (UCQs or Datalog)
• Semantic index-based rewriting
• DAG computation and indexing 5 mins
• Cost: 16 GB
• Single queries, split second responses
Thursday, June 14, 2012
OBDA and -Quest-
CONCLUSIONS
Thursday, June 14, 2012
OBDA and -Quest-
Conclusions
• Introduced ABox dependencies to describe the structure
of data
• Showed how to optimize TBoxes w.r.t. dependencies,
• Introduced the idea of “shifting” inferences from TBox
reasoning to Mapping reasoning.
• Introduced “Semantic Index” repositories
• Not mentioned - “Equivalence optimization”
31
Thursday, June 14, 2012
OBDA and -Quest-
Conclusions
• Inference shifting is also possible in strict OBDA (see
T-mappings [AMW’09])
• All (including T-mappings) is implemented in Quest and
-ontop-, now available as a P4 plugin.
• Combined with a fast rewriting technique (see [Kontchakov
et. al 12] and [Rosati, 12]) T-mappings and Semantic indexes
allow for OBDA and QAO in practice
• Practical and Theoretical evidence that we have covered
relevant cases
• Time to put things to practice!
(and look at the future, hybrid
approaches, EL, Datalog+-, OWL 2 RL)
32
Thursday, June 14, 2012
OBDA and -Quest-
THANKYOU
Thursday, June 14, 2012

More Related Content

Viewers also liked (15)

Introduction to query rewriting optimisation with dependencies
Introduction to query rewriting optimisation with dependenciesIntroduction to query rewriting optimisation with dependencies
Introduction to query rewriting optimisation with dependencies
 
SWT Lab 3
SWT Lab 3SWT Lab 3
SWT Lab 3
 
4 sw architectures and sparql
4 sw architectures and sparql4 sw architectures and sparql
4 sw architectures and sparql
 
4 sesame
4 sesame4 sesame
4 sesame
 
SWT Lecture Session 4 - Sesame
SWT Lecture Session 4 - SesameSWT Lecture Session 4 - Sesame
SWT Lecture Session 4 - Sesame
 
SWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLSWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQL
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
 
SWT Lab 1
SWT Lab 1SWT Lab 1
SWT Lab 1
 
7 advanced uses of rdfs
7 advanced uses of rdfs7 advanced uses of rdfs
7 advanced uses of rdfs
 
SWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaSWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jena
 
SWT Lab 2
SWT Lab 2SWT Lab 2
SWT Lab 2
 
SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1
 
DL'12 mastro at work
DL'12 mastro at workDL'12 mastro at work
DL'12 mastro at work
 
SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
Stanford'12 Intro to Ontology Based Data Access for RDBMS through query rewri...
 

Similar to KR12 Semantic Index and TBox optimisation with respect to dependencies

Ontology-based data access: why it is so cool!
Ontology-based data access: why it is so cool!Ontology-based data access: why it is so cool!
Ontology-based data access: why it is so cool!Josef Hardi
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Olaf Hartig
 
Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15
Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15
Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15MLconf
 
袁晓如:大数据时代可视化和可视分析的机遇与挑战
袁晓如:大数据时代可视化和可视分析的机遇与挑战袁晓如:大数据时代可视化和可视分析的机遇与挑战
袁晓如:大数据时代可视化和可视分析的机遇与挑战hdhappy001
 
Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-
Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-
Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-Cemal Ardil
 
RVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer ToolRVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer ToolMDC_UNICA
 
Power JSON with PostgreSQL
Power JSON with PostgreSQLPower JSON with PostgreSQL
Power JSON with PostgreSQLEDB
 
Learning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physicsLearning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physicsBalázs Kégl
 
Exchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge BasesExchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge Basesnet2-project
 
Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection aftab alam
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureMasud Rahman
 
Data Time Travel by Delta Time Machine
Data Time Travel by Delta Time MachineData Time Travel by Delta Time Machine
Data Time Travel by Delta Time MachineDatabricks
 
polystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdfpolystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdfRim Moussa
 

Similar to KR12 Semantic Index and TBox optimisation with respect to dependencies (20)

Ontology-based data access: why it is so cool!
Ontology-based data access: why it is so cool!Ontology-based data access: why it is so cool!
Ontology-based data access: why it is so cool!
 
Property Alignment on Linked Open Data
Property Alignment on Linked Open DataProperty Alignment on Linked Open Data
Property Alignment on Linked Open Data
 
PAL
PALPAL
PAL
 
Fa18_P2.pptx
Fa18_P2.pptxFa18_P2.pptx
Fa18_P2.pptx
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
 
Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15
Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15
Jorge Silva, Sr. Research Statistician Developer, SAS at MLconf ATL - 9/18/15
 
袁晓如:大数据时代可视化和可视分析的机遇与挑战
袁晓如:大数据时代可视化和可视分析的机遇与挑战袁晓如:大数据时代可视化和可视分析的机遇与挑战
袁晓如:大数据时代可视化和可视分析的机遇与挑战
 
Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-
Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-
Compact binary-tree-representation-of-logic-function-with-enhanced-throughput-
 
RVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer ToolRVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer Tool
 
Data structure-question-bank
Data structure-question-bankData structure-question-bank
Data structure-question-bank
 
Lec11 object-re-id
Lec11 object-re-idLec11 object-re-id
Lec11 object-re-id
 
Power JSON with PostgreSQL
Power JSON with PostgreSQLPower JSON with PostgreSQL
Power JSON with PostgreSQL
 
Learning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physicsLearning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physics
 
Relational Databases_RDF Graphs_and_Constraints.pdf
Relational Databases_RDF Graphs_and_Constraints.pdfRelational Databases_RDF Graphs_and_Constraints.pdf
Relational Databases_RDF Graphs_and_Constraints.pdf
 
Exchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge BasesExchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge Bases
 
Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection
 
Object oriented databases
Object oriented databasesObject oriented databases
Object oriented databases
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lecture
 
Data Time Travel by Delta Time Machine
Data Time Travel by Delta Time MachineData Time Travel by Delta Time Machine
Data Time Travel by Delta Time Machine
 
polystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdfpolystore_NYC_inrae_sysinfo2021-1.pdf
polystore_NYC_inrae_sysinfo2021-1.pdf
 

More from Mariano Rodriguez-Muro (11)

SWT Lab 5
SWT Lab 5SWT Lab 5
SWT Lab 5
 
SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2
 
SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mapping
 
SWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSSWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFS
 
SWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFSSWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFS
 
SWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQLSWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQL
 
5 rdfs
5 rdfs5 rdfs
5 rdfs
 
SWT Lecture Session 1 - Introduction
SWT Lecture Session 1 - IntroductionSWT Lecture Session 1 - Introduction
SWT Lecture Session 1 - Introduction
 
ontop: A tutorial
ontop: A tutorialontop: A tutorial
ontop: A tutorial
 
OXFORD'13 Optimising OWL 2 QL query rewriring
OXFORD'13 Optimising OWL 2 QL query rewriringOXFORD'13 Optimising OWL 2 QL query rewriring
OXFORD'13 Optimising OWL 2 QL query rewriring
 
IMAS'08 obda plugin
IMAS'08 obda pluginIMAS'08 obda plugin
IMAS'08 obda plugin
 

Recently uploaded

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

KR12 Semantic Index and TBox optimisation with respect to dependencies

  • 1. Mariano Rodríguez-Muro and Diego Calvanese KRDB Research Group Free University of Bozen-Bolzano KR’12 Realizing OBDA with data dependencies Thursday, June 14, 2012
  • 2. OBDA and -Quest- DL-Lite (OWL 2 QL) - promise • Light-weight DL • Allows for QA by query rewriting into FO-queries (i.e,. SQL) • Mapping techniques that allow for OBDA (sound, GAV mappings) • Fast query rewriting techniques • Implementations available 2 Thursday, June 14, 2012
  • 3. OBDA and -Quest- • UCQ = 729 CQs, UNION of 729 SPJ SQL queries • Datalog = 9 rules, 1 SPJ of UNIONS (3 nested UNIONS of 9 elements each) An trivial example 3 A B E D F C G H I q(x) ← A(x), R(x, y), A(y), R(y, z), A(z) Thursday, June 14, 2012
  • 4. OBDA and -Quest- • Focused on “size” of the rewriting and dealing with ended up playing an “encoding” game... • The role of SQL engines, their features and limitations has been neglected DL-Lite (OWL 2 QL) - reality • Query answering by query rewriting still slow, but not due to the rewriting technique • RDBMS cannot handle our queries efficiently 4 A ∃R Thursday, June 14, 2012
  • 5. OBDA and -Quest- 5 “Efficient query answering in OBDA or QAO requires exploiting and optimizing every element/resource in the query answering system, not just query rewriting algorithms” Thursday, June 14, 2012
  • 6. OBDA and -Quest- ABOX DEPENDENCIES Describing out data sources 6 Thursday, June 14, 2012
  • 7. OBDA and -Quest- QAO revisited Reasoner Application TBox ABox Inputs Thursday, June 14, 2012
  • 8. OBDA and -Quest- QAO revisited Reasoner Application TBox ABox Inputs Source Thursday, June 14, 2012
  • 9. OBDA and -Quest- QAO revisited Reasoner Application TBox ABox Inputs SourceOBDA Model Thursday, June 14, 2012
  • 11. CardiacArrest Condition Clog Condition Patient ∃name Patient ∃age Patient ∃ssn Patient ∃affectedBy OBDA and -Quest- An ontology 9 ∃name Patient ∃age Patient ∃ssn Patient ∃affectedBy Patient ∃affectedBy− Condition Thursday, June 14, 2012
  • 12. OBDA and -Quest- An example id [PKEY] name age ssn 12 John 37 xxx-999 Table: patient patient_id [FKEY] c_code 12 33 Table: conditionOBDA Model Thursday, June 14, 2012
  • 13. OBDA and -Quest- An example SELECT id,name,age,ssn FROM patient :person/{$id} a Patient; name $name; age $age^^xsd:int; ssn $ssn id [PKEY] name age ssn 12 John 37 xxx-999 Table: patient patient_id [FKEY] c_code 12 33 Table: conditionOBDA Model Thursday, June 14, 2012
  • 14. OBDA and -Quest- An example SELECT id,name,age,ssn FROM patient :person/{$id} a Patient; name $name; age $age^^xsd:int; ssn $ssn id [PKEY] name age ssn 12 John 37 xxx-999 Table: patient patient_id [FKEY] c_code 12 33 Table: condition SELECT id, c_id FROM condition :person/{$id} affectedBy :cond/{$id}. :cond/{$id} a Condition OBDA Model Thursday, June 14, 2012
  • 15. OBDA and -Quest- An example SELECT id,name,age,ssn FROM patient :person/{$id} a Patient; name $name; age $age^^xsd:int; ssn $ssn id [PKEY] name age ssn 12 John 37 xxx-999 Table: patient patient_id [FKEY] c_code 12 33 Table: condition SELECT id, c_id FROM condition :person/{$id} affectedBy :cond/{$id}. :cond/{$id} a Condition SELECT id FROM condition WHERE c_code = 33 :cond/{$id} a CardiacArrest SELECT id,c_id FROM condition WHERE c_code = 27 :cond/{$id} a Clog OBDA Model Thursday, June 14, 2012
  • 16. R := P | P− ∀x.B1(x) ∈ A → B2(x) ∈ A ∀x, y.R1(x, y) ∈ A → R2(x, y) ∈ A B1 A B2 R1 A R2 OBDA and -Quest- ABox dependencies for DL-Lite 11 Concept descriptions Role descriptions ABox constraints • Semantics: B := A | ∃R • In addition to TBox and ABox (or OBDA model), we introduce a set of ABox dependencies Sigma Σ Thursday, June 14, 2012
  • 17. CardiacArrest A Condition Clog A Condition Patient A ∃name Patient A ∃age Patient A ∃ssn Patient A ∃affectedBy ∃name A Patient ∃age A Patient ∃ssn A Patient ∃affectedBy A Patient ∃affectedBy− A Condition OBDA and -Quest- ABox constraints,example 12 Thursday, June 14, 2012
  • 18. OBDA and -Quest- ABOX CONSTRAINTS 13 Optimizing your TBox Thursday, June 14, 2012
  • 19. Given a TBox T and a set of ABox dependencies Σ we want to compute T s.t. each α ∈ T is not redundant w.r.t. Σ OBDA and -Quest- Redundancy inT w.r.t.Sigma 14 Thursday, June 14, 2012
  • 20. Given a TBox T and a set of ABox dependencies Σ we want to compute T s.t. each α ∈ T is not redundant w.r.t. Σ OBDA and -Quest- Redundancy inT w.r.t.Sigma 14 A T -chain from B to C is a sequence of inclusion assertions (Bi B i)n i=0 for some n ≥ 0, such that: 1. B0 = B, B n = C, and 2. for 1 ≤ i ≤ n, we have that B i−1 and Bi are basic concepts s.t., either (i) B i−1 = Bi, or (ii) B i−1 = ∃R and Bi = ∃R− , for some basic role R . Respectively for role chains. Chains in T and Sigma Thursday, June 14, 2012
  • 21. OBDA and -Quest- Redundancy w.r.t.Sigma 15 Thursday, June 14, 2012
  • 22. B C is directly redundant in T w.r.t. Σ if (i) Σ |= B A C and (ii) for every T -chain (Bi B i)n i=0 with B n = B in T , there is a Σ-chain (Bi A B i)n i=0. Similarly for roles. OBDA and -Quest- Redundancy w.r.t.Sigma 16 Direct redundancy T-chain Sigma-chain Thursday, June 14, 2012
  • 23. B C is redundant in T w.r.t. Σ if (a) it is directly redundant, or (b) there exists B = B s.t. (i) T |= B C, (ii) B C is not directly redundant in T w.r.t. Σ, and (iii) B B is directly redundant. OBDA and -Quest- Redundancy w.r.t.Sigma 17 Redundancy T-chain Sigma-chain Thursday, June 14, 2012
  • 24. OBDA and -Quest- Redundancy w.r.t.Sigma 18 • We can compute T’ by reducing redundancy checks to reachability in DAGs • Computable in polynomial time • Optimal (no redundant axioms in T’) T is the set of inclusion assertions {α ∈ sat(T ) | α is not redundant in sat(T ) w.r.t. sat(Σ)}. Thursday, June 14, 2012
  • 25. OBDA and -Quest- ABOX CONSTRAINTS 19 Enforcing dependencies Thursday, June 14, 2012
  • 26. OBDA and -Quest- OBDA with OWL 2 QL OBDA Model Manipulate mappings Manipulate the data Inference shifting. Separate ground reasoning from existential reasoning (efficiently) Thursday, June 14, 2012
  • 27. OBDA and -Quest- OBDA with OWL 2 QL Reasoner Application TBox ABox Inputs Thursday, June 14, 2012
  • 28. OBDA and -Quest- OBDA with OWL 2 QL Reasoner Application TBox ABox Inputs Thursday, June 14, 2012
  • 29. OBDA and -Quest- OBDA with OWL 2 QL Reasoner Application TBox ABox Inputs OBDA Model Thursday, June 14, 2012
  • 30. OBDA and -Quest- Creating semanticABox storage 24 ABox 1. Encode the hierarchies of T in indexes and intervals Thursday, June 14, 2012
  • 31. OBDA and -Quest- Creating semanticABox storage 25 ABox 1. Encode the hierarchies of T in indexes and intervals A B E D F C G H I Thursday, June 14, 2012
  • 32. OBDA and -Quest- Creating semanticABox storage 26 ABox 1. Encode the hierarchies of T in indexes and intervals A B E D F C G H I 1 2 3 4 5 6 7 8 9 Thursday, June 14, 2012
  • 33. OBDA and -Quest- Creating semanticABox storage 26 ABox 1. Encode the hierarchies of T in indexes and intervals A B E D F C G H I 1 2 3 4 5 6 7 8 9 Insert into your DB using those indexes C IDX d 4 h 8 A = D(d), H(h) Thursday, June 14, 2012
  • 34. A B E D F C G H I 1 [1,8] 2 [2,5] 3 [3,3] 4 [4,4] 5 [5,5] 6 [6,9] 7 [7,7] 8 [8,8] 9 [9,9] OBDA and -Quest- Creating semanticABox storage 27 ABox 3. Define intervals to retrieve your data Thursday, June 14, 2012
  • 35. A B E D F C G H I 1 [1,8] 2 [2,5] 3 [3,3] 4 [4,4] 5 [5,5] 6 [6,9] 7 [7,7] 8 [8,8] 9 [9,9] OBDA and -Quest- Creating semanticABox storage 27 ABox 3. Define intervals to retrieve your data ?x a A SELECT c FROM t WHERE IDX = 1 AND IDX = 8 OBDA Model 4. Create mappings using those intervals Thursday, June 14, 2012
  • 36. A B E D F C G H I 1 [1,8] 2 [2,5] 3 [3,3] 4 [4,4] 5 [5,5] 6 [6,9] 7 [7,7] 8 [8,8] 9 [9,9] OBDA and -Quest- Creating semanticABox storage 27 ABox 3. Define intervals to retrieve your data ?x a A SELECT c FROM t WHERE IDX = 1 AND IDX = 8 OBDA Model 4. Create mappings using those intervals 5. Complement with mappings to cover domain, range and inverse inferences Thursday, June 14, 2012
  • 37. Experiments 28 • Experimentation using Stanford’s “Resource Index” • Semantic search over annotated documents • 200 ontologies from Bio-portal (only hierarchies 200k concepts, millions of subClassOf) Thursday, June 14, 2012
  • 38. Experiments 29 • Current system uses chase • Naive chase: 7 days • Optimized chase: 40 mins • Cost 16 GB + 70 GB chase data • Split second responses • Pure query rewriting approaches • not-feasible (UCQs or Datalog) • Semantic index-based rewriting • DAG computation and indexing 5 mins • Cost: 16 GB • Single queries, split second responses Thursday, June 14, 2012
  • 40. OBDA and -Quest- Conclusions • Introduced ABox dependencies to describe the structure of data • Showed how to optimize TBoxes w.r.t. dependencies, • Introduced the idea of “shifting” inferences from TBox reasoning to Mapping reasoning. • Introduced “Semantic Index” repositories • Not mentioned - “Equivalence optimization” 31 Thursday, June 14, 2012
  • 41. OBDA and -Quest- Conclusions • Inference shifting is also possible in strict OBDA (see T-mappings [AMW’09]) • All (including T-mappings) is implemented in Quest and -ontop-, now available as a P4 plugin. • Combined with a fast rewriting technique (see [Kontchakov et. al 12] and [Rosati, 12]) T-mappings and Semantic indexes allow for OBDA and QAO in practice • Practical and Theoretical evidence that we have covered relevant cases • Time to put things to practice! (and look at the future, hybrid approaches, EL, Datalog+-, OWL 2 RL) 32 Thursday, June 14, 2012