SlideShare a Scribd company logo
Completeness Statements about RDF Data Sources and
Their Use for Query Answering
Fariz Darari
Werner Nutt
Giuseppe Pirrò
Simon Razniewski
Oct 23rd, 2013
Supported by the project MAGIC: Managing Completeness of Data,
funded by the province of Bozen- Bolzano
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 1 / 31
Motivation
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 2 / 31
Motivation
Completeness
statement about the
IMDB data source
Quentin Tarantino
was the character
Mr. Brown
……………
……………
……………
http://www.imdb.com/title/tt0105236/fullcredits?ref_=tt_ov_st_sm#cast
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 3 / 31
Motivation
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 4 / 31
Motivation
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 5 / 31
Motivation
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 6 / 31
Incomplete Data Source
Quentin Tarantino is missing..
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 7 / 31
Incomplete Data Source
An incomplete data source of Tarantino movies,
Gdbp = (Ga
dbp, Gi
dbp):
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 8 / 31
Incomplete Data Source
An incomplete data source of Tarantino movies,
Gdbp = (Ga
dbp,Gi
dbp):
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 9 / 31
Incomplete Data Source
An incomplete data source of Tarantino movies,
Gdbp = (Ga
dbp, Gi
dbp):
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 10 / 31
Incomplete Data Source
Incomplete Data Source
An incomplete data source is a pair of two graphs,
G = (Ga
, Gi
), where Ga
⊆ Gi
.
We call Ga
the available graph and Gi
the ideal graph.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 11 / 31
Incomplete Data Source
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 12 / 31
Completeness Statements: Examples
To express that a source is complete
for all the triples about movies directed by Tarantino,
we use the statement
Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅),
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 13 / 31
Completeness Statements: Examples
To express that a source is complete
for all the triples about movies directed by Tarantino,
we use the statement
Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅),
To express that a source is complete
for all triples about actors in movies directed by Tarantino,
we use
Cact =
Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara))
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 13 / 31
Completeness Statement: Definition
Let P1 be a non-empty BGP (Basic Graph Pattern) and
P2 a BGP.
A completeness statement is defined as
Compl(P1 | P2)
where we call P1 the pattern and P2 the condition
of the statement.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 14 / 31
Satisfaction of Completeness Statements
To the statement
C = Compl(P1 | P2),
we associate the CONSTRUCT query
QC = (P1, P1 ∪ P2).
Then, we say:
C is satisfied by an IDS G = (Ga
, Gi
), written G |= C, if
QC Gi ⊆ Ga
.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 15 / 31
Completeness Statements
Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅)
Question: Gdbp |= Cdir ?
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 16 / 31
Completeness Statements
Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅)
Question: Gdbp |= Cdir ?
Yes, because QCdir Gi
dbp
⊆ Ga
dbp.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 16 / 31
Completeness Statements
Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara)
Question: Gdbp |= Cact ?
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 17 / 31
Completeness Statements
Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara)
Question: Gdbp |= Cact ?
No, because among the results of QCact Gi
dbp
,
there is (reservoirDogs, actor, tarantino) not in Ga
dbp.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 17 / 31
Completeness Statements in RDF
Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara)
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 18 / 31
Query Completeness: Example
Qdir = ({?m}, {(?m, type, Movie), (?m, director, tarantino)})
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 19 / 31
Query Completeness: Example
Qdir = ({?m}, {(?m, type, Movie), (?m, director, tarantino)})
Then, Qdir Gi
dbp
= Qdir Ga
dbp
, and therefore Gdbp |= Compl(Qdir ).
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 19 / 31
Query Completeness: Definition
Definition
Let Q be a SELECT query. We write
Compl(Q)
to say that Q is complete.
An incomplete data source G = (Ga
, Gi
) satisfies Compl(Q),
written
G |= Compl(Q) if and only if Q Gi = Q Ga .
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 20 / 31
Completeness Entailment
Let C be a set of completeness statements and
Q a SELECT query.
We say that C entails the completeness of Q, written
C |= Compl(Q),
if any incomplete data source satisfying C
also satisfies Compl(Q).
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 21 / 31
Completeness Reasoning
Transfer Operator
For any set C of completeness statements and a graph G,
we define the transfer operator TC:
TC(G) =
C∈ C
QC G
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 22 / 31
Completeness Reasoning
Transfer Operator
For any set C of completeness statements and a graph G,
we define the transfer operator TC:
TC(G) =
C∈ C
QC G
Prototypical Graph
Let Q = (W, P) be a SELECT query.
The prototypical graph ˜P is the graph resulting
from the mapping of variables in P to fresh, unique IRIs.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 22 / 31
Completeness of Basic Queries
Theorem
Let C be a set of completeness statements and
Q = (W, P) a basic query. Then,
C |= Compl(Q) if and only if ˜P = TC(˜P).
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 23 / 31
Completeness Reasoning: Example
Consider the set of completeness statements
Cdir,act = { Cdir , Cact }
and the query
Qdir+act = ({ ?m }, Pdir+act )
where
Pdir+act =
{ (?m, type, Movie), (?m, director, tarantino), (?m, actor, tarantino) }.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 24 / 31
Completeness Reasoning: Example
Consider the set of completeness statements
Cdir,act = { Cdir , Cact }
and the query
Qdir+act = ({ ?m }, Pdir+act )
Then,
˜Pdir+act =
{ (c?m, type, Movie), (c?m, director, tarantino), (c?m, actor, tarantino) }
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 25 / 31
Completeness Reasoning: Example
Cdir,act = { Cdir , Cact }
Qdir+act = ({ ?m }, Pdir+act )
Therefore,
TCdir,act
(˜Pdir+act )
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
Completeness Reasoning: Example
Cdir,act = { Cdir , Cact }
Qdir+act = ({ ?m }, Pdir+act )
Therefore,
TCdir,act
(˜Pdir+act )
= QCdir ˜Pdir+act
∪ QCact ˜Pdir+act
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
Completeness Reasoning: Example
Cdir,act = { Cdir , Cact }
Qdir+act = ({ ?m }, Pdir+act )
Therefore,
TCdir,act
(˜Pdir+act )
= QCdir ˜Pdir+act
∪ QCact ˜Pdir+act
= { (c?m, type, Movie), (c?m, director, tara),
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
Completeness Reasoning: Example
Cdir,act = { Cdir , Cact }
Qdir+act = ({ ?m }, Pdir+act )
Therefore,
TCdir,act
(˜Pdir+act )
= QCdir ˜Pdir+act
∪ QCact ˜Pdir+act
= { (c?m, type, Movie), (c?m, director, tara), (c?m, actor, tara) }
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
Completeness Reasoning: Example
Cdir,act = { Cdir , Cact }
Qdir+act = ({ ?m }, Pdir+act )
Therefore,
TCdir,act
(˜Pdir+act )
= QCdir ˜Pdir+act
∪ QCact ˜Pdir+act
= { (c?m, type, Movie), (c?m, director, tara), (c?m, actor, tara) }
= ˜Pdir+act .
Thus, Cdir,act |= Compl(Qdir+act )
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
The framework can also be applied to:
DISTINCT Queries
OPT Queries
Queries with RDFS Data Sources
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 27 / 31
Federated Framework
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 28 / 31
CoRner: Completeness Reasoner
Can check the completeness of a subset of SPARQL
queries depending on the completeness statements of
a single data source.
Developed in Java using the Apache Jena library.
Takes three inputs:
Completeness statements in RDF format
A SPARQL query
(optional) an RDFS ontology
Available at http://rdfcorner.wordpress.com/.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 29 / 31
Conclusions and Future Work
We developed a theoretical framework for the expression of
completeness statements about data sources, from which
we can ensure query completeness.
We provide a compact RDF representation of completeness
statements, which can be embedded in VoID descriptions,
and implemented the framework in CoRner.
We are interested in studying completeness reasoning with
more expressive queries and OWL 2.
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 30 / 31
Terima kasih, grazie, danke!!
Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 31 / 31

More Related Content

More from Fariz Darari

Data X Museum - Hari Museum Internasional 2022 - WMID
Data X Museum - Hari Museum Internasional 2022 - WMIDData X Museum - Hari Museum Internasional 2022 - WMID
Data X Museum - Hari Museum Internasional 2022 - WMID
Fariz Darari
 
[PUBLIC] quiz-01-midterm-solutions.pdf
[PUBLIC] quiz-01-midterm-solutions.pdf[PUBLIC] quiz-01-midterm-solutions.pdf
[PUBLIC] quiz-01-midterm-solutions.pdf
Fariz Darari
 
Free AI Kit - Game Theory
Free AI Kit - Game TheoryFree AI Kit - Game Theory
Free AI Kit - Game Theory
Fariz Darari
 
Neural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An IntroNeural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An Intro
Fariz Darari
 
NLP guest lecture: How to get text to confess what knowledge it has
NLP guest lecture: How to get text to confess what knowledge it hasNLP guest lecture: How to get text to confess what knowledge it has
NLP guest lecture: How to get text to confess what knowledge it has
Fariz Darari
 
Supply and Demand - AI Talents
Supply and Demand - AI TalentsSupply and Demand - AI Talents
Supply and Demand - AI Talents
Fariz Darari
 
Basic Python Programming: Part 01 and Part 02
Basic Python Programming: Part 01 and Part 02Basic Python Programming: Part 01 and Part 02
Basic Python Programming: Part 01 and Part 02
Fariz Darari
 
AI in education done properly
AI in education done properlyAI in education done properly
AI in education done properly
Fariz Darari
 
Artificial Neural Networks: Pointers
Artificial Neural Networks: PointersArtificial Neural Networks: Pointers
Artificial Neural Networks: Pointers
Fariz Darari
 
Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019
Fariz Darari
 
Defense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWDDefense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWD
Fariz Darari
 
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz DarariSeminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Fariz Darari
 
Foundations of Programming - Java OOP
Foundations of Programming - Java OOPFoundations of Programming - Java OOP
Foundations of Programming - Java OOP
Fariz Darari
 
Recursion in Python
Recursion in PythonRecursion in Python
Recursion in Python
Fariz Darari
 
Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)
Fariz Darari
 
Testing in Python: doctest and unittest
Testing in Python: doctest and unittestTesting in Python: doctest and unittest
Testing in Python: doctest and unittest
Fariz Darari
 
Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...
Fariz Darari
 
Research Writing - 2018.07.18
Research Writing - 2018.07.18Research Writing - 2018.07.18
Research Writing - 2018.07.18
Fariz Darari
 
KOI - Knowledge Of Incidents - SemEval 2018
KOI - Knowledge Of Incidents - SemEval 2018KOI - Knowledge Of Incidents - SemEval 2018
KOI - Knowledge Of Incidents - SemEval 2018
Fariz Darari
 
Comparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness ReasoningComparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness Reasoning
Fariz Darari
 

More from Fariz Darari (20)

Data X Museum - Hari Museum Internasional 2022 - WMID
Data X Museum - Hari Museum Internasional 2022 - WMIDData X Museum - Hari Museum Internasional 2022 - WMID
Data X Museum - Hari Museum Internasional 2022 - WMID
 
[PUBLIC] quiz-01-midterm-solutions.pdf
[PUBLIC] quiz-01-midterm-solutions.pdf[PUBLIC] quiz-01-midterm-solutions.pdf
[PUBLIC] quiz-01-midterm-solutions.pdf
 
Free AI Kit - Game Theory
Free AI Kit - Game TheoryFree AI Kit - Game Theory
Free AI Kit - Game Theory
 
Neural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An IntroNeural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An Intro
 
NLP guest lecture: How to get text to confess what knowledge it has
NLP guest lecture: How to get text to confess what knowledge it hasNLP guest lecture: How to get text to confess what knowledge it has
NLP guest lecture: How to get text to confess what knowledge it has
 
Supply and Demand - AI Talents
Supply and Demand - AI TalentsSupply and Demand - AI Talents
Supply and Demand - AI Talents
 
Basic Python Programming: Part 01 and Part 02
Basic Python Programming: Part 01 and Part 02Basic Python Programming: Part 01 and Part 02
Basic Python Programming: Part 01 and Part 02
 
AI in education done properly
AI in education done properlyAI in education done properly
AI in education done properly
 
Artificial Neural Networks: Pointers
Artificial Neural Networks: PointersArtificial Neural Networks: Pointers
Artificial Neural Networks: Pointers
 
Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019
 
Defense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWDDefense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWD
 
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz DarariSeminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
 
Foundations of Programming - Java OOP
Foundations of Programming - Java OOPFoundations of Programming - Java OOP
Foundations of Programming - Java OOP
 
Recursion in Python
Recursion in PythonRecursion in Python
Recursion in Python
 
Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)
 
Testing in Python: doctest and unittest
Testing in Python: doctest and unittestTesting in Python: doctest and unittest
Testing in Python: doctest and unittest
 
Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...
 
Research Writing - 2018.07.18
Research Writing - 2018.07.18Research Writing - 2018.07.18
Research Writing - 2018.07.18
 
KOI - Knowledge Of Incidents - SemEval 2018
KOI - Knowledge Of Incidents - SemEval 2018KOI - Knowledge Of Incidents - SemEval 2018
KOI - Knowledge Of Incidents - SemEval 2018
 
Comparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness ReasoningComparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness Reasoning
 

Recently uploaded

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
Claudio Di Ciccio
 

Recently uploaded (20)

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
 

[ISWC 2013] Completeness statements about RDF data sources and their use for query answering

  • 1. Completeness Statements about RDF Data Sources and Their Use for Query Answering Fariz Darari Werner Nutt Giuseppe Pirrò Simon Razniewski Oct 23rd, 2013 Supported by the project MAGIC: Managing Completeness of Data, funded by the province of Bozen- Bolzano Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 1 / 31
  • 2. Motivation Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 2 / 31
  • 3. Motivation Completeness statement about the IMDB data source Quentin Tarantino was the character Mr. Brown …………… …………… …………… http://www.imdb.com/title/tt0105236/fullcredits?ref_=tt_ov_st_sm#cast Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 3 / 31
  • 4. Motivation Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 4 / 31
  • 5. Motivation Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 5 / 31
  • 6. Motivation Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 6 / 31
  • 7. Incomplete Data Source Quentin Tarantino is missing.. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 7 / 31
  • 8. Incomplete Data Source An incomplete data source of Tarantino movies, Gdbp = (Ga dbp, Gi dbp): Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 8 / 31
  • 9. Incomplete Data Source An incomplete data source of Tarantino movies, Gdbp = (Ga dbp,Gi dbp): Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 9 / 31
  • 10. Incomplete Data Source An incomplete data source of Tarantino movies, Gdbp = (Ga dbp, Gi dbp): Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 10 / 31
  • 11. Incomplete Data Source Incomplete Data Source An incomplete data source is a pair of two graphs, G = (Ga , Gi ), where Ga ⊆ Gi . We call Ga the available graph and Gi the ideal graph. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 11 / 31
  • 12. Incomplete Data Source Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 12 / 31
  • 13. Completeness Statements: Examples To express that a source is complete for all the triples about movies directed by Tarantino, we use the statement Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅), Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 13 / 31
  • 14. Completeness Statements: Examples To express that a source is complete for all the triples about movies directed by Tarantino, we use the statement Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅), To express that a source is complete for all triples about actors in movies directed by Tarantino, we use Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara)) Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 13 / 31
  • 15. Completeness Statement: Definition Let P1 be a non-empty BGP (Basic Graph Pattern) and P2 a BGP. A completeness statement is defined as Compl(P1 | P2) where we call P1 the pattern and P2 the condition of the statement. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 14 / 31
  • 16. Satisfaction of Completeness Statements To the statement C = Compl(P1 | P2), we associate the CONSTRUCT query QC = (P1, P1 ∪ P2). Then, we say: C is satisfied by an IDS G = (Ga , Gi ), written G |= C, if QC Gi ⊆ Ga . Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 15 / 31
  • 17. Completeness Statements Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅) Question: Gdbp |= Cdir ? Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 16 / 31
  • 18. Completeness Statements Cdir = Compl((?m, type, Movie), (?m, director, tarantino) | ∅) Question: Gdbp |= Cdir ? Yes, because QCdir Gi dbp ⊆ Ga dbp. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 16 / 31
  • 19. Completeness Statements Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara) Question: Gdbp |= Cact ? Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 17 / 31
  • 20. Completeness Statements Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara) Question: Gdbp |= Cact ? No, because among the results of QCact Gi dbp , there is (reservoirDogs, actor, tarantino) not in Ga dbp. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 17 / 31
  • 21. Completeness Statements in RDF Cact = Compl((?m, actor, ?a) | (?m, type, Movie), (?m, director, tara) Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 18 / 31
  • 22. Query Completeness: Example Qdir = ({?m}, {(?m, type, Movie), (?m, director, tarantino)}) Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 19 / 31
  • 23. Query Completeness: Example Qdir = ({?m}, {(?m, type, Movie), (?m, director, tarantino)}) Then, Qdir Gi dbp = Qdir Ga dbp , and therefore Gdbp |= Compl(Qdir ). Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 19 / 31
  • 24. Query Completeness: Definition Definition Let Q be a SELECT query. We write Compl(Q) to say that Q is complete. An incomplete data source G = (Ga , Gi ) satisfies Compl(Q), written G |= Compl(Q) if and only if Q Gi = Q Ga . Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 20 / 31
  • 25. Completeness Entailment Let C be a set of completeness statements and Q a SELECT query. We say that C entails the completeness of Q, written C |= Compl(Q), if any incomplete data source satisfying C also satisfies Compl(Q). Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 21 / 31
  • 26. Completeness Reasoning Transfer Operator For any set C of completeness statements and a graph G, we define the transfer operator TC: TC(G) = C∈ C QC G Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 22 / 31
  • 27. Completeness Reasoning Transfer Operator For any set C of completeness statements and a graph G, we define the transfer operator TC: TC(G) = C∈ C QC G Prototypical Graph Let Q = (W, P) be a SELECT query. The prototypical graph ˜P is the graph resulting from the mapping of variables in P to fresh, unique IRIs. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 22 / 31
  • 28. Completeness of Basic Queries Theorem Let C be a set of completeness statements and Q = (W, P) a basic query. Then, C |= Compl(Q) if and only if ˜P = TC(˜P). Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 23 / 31
  • 29. Completeness Reasoning: Example Consider the set of completeness statements Cdir,act = { Cdir , Cact } and the query Qdir+act = ({ ?m }, Pdir+act ) where Pdir+act = { (?m, type, Movie), (?m, director, tarantino), (?m, actor, tarantino) }. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 24 / 31
  • 30. Completeness Reasoning: Example Consider the set of completeness statements Cdir,act = { Cdir , Cact } and the query Qdir+act = ({ ?m }, Pdir+act ) Then, ˜Pdir+act = { (c?m, type, Movie), (c?m, director, tarantino), (c?m, actor, tarantino) } Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 25 / 31
  • 31. Completeness Reasoning: Example Cdir,act = { Cdir , Cact } Qdir+act = ({ ?m }, Pdir+act ) Therefore, TCdir,act (˜Pdir+act ) Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
  • 32. Completeness Reasoning: Example Cdir,act = { Cdir , Cact } Qdir+act = ({ ?m }, Pdir+act ) Therefore, TCdir,act (˜Pdir+act ) = QCdir ˜Pdir+act ∪ QCact ˜Pdir+act Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
  • 33. Completeness Reasoning: Example Cdir,act = { Cdir , Cact } Qdir+act = ({ ?m }, Pdir+act ) Therefore, TCdir,act (˜Pdir+act ) = QCdir ˜Pdir+act ∪ QCact ˜Pdir+act = { (c?m, type, Movie), (c?m, director, tara), Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
  • 34. Completeness Reasoning: Example Cdir,act = { Cdir , Cact } Qdir+act = ({ ?m }, Pdir+act ) Therefore, TCdir,act (˜Pdir+act ) = QCdir ˜Pdir+act ∪ QCact ˜Pdir+act = { (c?m, type, Movie), (c?m, director, tara), (c?m, actor, tara) } Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
  • 35. Completeness Reasoning: Example Cdir,act = { Cdir , Cact } Qdir+act = ({ ?m }, Pdir+act ) Therefore, TCdir,act (˜Pdir+act ) = QCdir ˜Pdir+act ∪ QCact ˜Pdir+act = { (c?m, type, Movie), (c?m, director, tara), (c?m, actor, tara) } = ˜Pdir+act . Thus, Cdir,act |= Compl(Qdir+act ) Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 26 / 31
  • 36. The framework can also be applied to: DISTINCT Queries OPT Queries Queries with RDFS Data Sources Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 27 / 31
  • 37. Federated Framework Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 28 / 31
  • 38. CoRner: Completeness Reasoner Can check the completeness of a subset of SPARQL queries depending on the completeness statements of a single data source. Developed in Java using the Apache Jena library. Takes three inputs: Completeness statements in RDF format A SPARQL query (optional) an RDFS ontology Available at http://rdfcorner.wordpress.com/. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 29 / 31
  • 39. Conclusions and Future Work We developed a theoretical framework for the expression of completeness statements about data sources, from which we can ensure query completeness. We provide a compact RDF representation of completeness statements, which can be embedded in VoID descriptions, and implemented the framework in CoRner. We are interested in studying completeness reasoning with more expressive queries and OWL 2. Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 30 / 31
  • 40. Terima kasih, grazie, danke!! Fariz Darari (ISWC 2013) Completeness Reasoning @ Linked Data 31 / 31