SlideShare a Scribd company logo
Data Exchange over RDF

       Andr´s Letelier
            e
   Advisor: Marcelo Arenas

Pontificia Universidad Cat´lica de Chile
                         o


       September 1, 2011
What is data exchange?




   Problem
   Data under one schema S needs to be restructured and translated
   into a target schema T


                              S −→ T
                              IS −→ IT
Schema mappings



  Question
  Which source instances corresponds to which target instances?

  Answer
  Schema mappings:

                 M ⊆ Instances(S) × Instances(T)

  Usually, schema mappings are defined as M = (S, T, ΣST )
Definition (Solution)
I2 is a solution of I1 under M iif (I1 , I2 ) ∈ M
The set of all solutions for I1 under M is denoted by SolM (I1 )
Resource Description Framework (RDF)



      Data model for representing information about World Wide
      Web resources
      W3C Recommendation (1998)
      Part of the semantic web stack
      Directed, labeled graphs
      Blank nodes (labeled nulls)
      Basically, sets of triples (s, p, o)
Example
 D=   {
          (B1   name    paul)
          (B1   email   paul@example.edu)
          (B2   name    john)
          (B2   city    Liverpool)
                                            }
SPARQL (pronounced “sparkle”)


      Query language for RDF
      W3C Recommendation(2008)
      Standard for querying RDF datasets
      Returns sets of partial mappings
      Operators:
          Projection
          AND (inner join)
          OPT (left join)
          FILTER
          UNION
          and more
Example

          P1 = (?X, name, ?Y )

                     ?X    ?Y
          P1   D   = B1   paul
                     B2   john
Example

          P2 = (?X, name, ?Y ) AND (?X, email, ?Z)

                        ?X   ?Y            ?Z
           P2   D   =
                        B1   paul   paul@example.edu
Example

          P3 = (?X, name, ?Y ) OPT (?X, email, ?Z)

                      ?X    ?Y           ?Z
           P3   D   = B1   paul   paul@example.edu
                      B2   john
Well-designed SPARQL patterns


   Definition (Well-designed patterns)
   A pattern P is well designed if for every subpattern P of the form
   P1 OPT P2 , every variable that appears in P2 and outside P also
   appears in P1 .

   Example
       (?X, name, ?Y ) OPT ((?X, email, ?Z) OPT (?X, city, ?A))
       is well-designed
       (?X, name, ?Y ) OPT ((?W, email, ?Z) OPT (?X, city, ?A))
       is not
Data Exchange over RDF




      S and T are fixed to be RDF triples
      Tuple generating dependencies have to be redefined
      But first, we need some definitions...
RDF Tuple Generating Dependencies



   Let P be a SPARQL pattern, µ1 and µ2 be partial mappings, and
   Ω1 and Ω2 be sets of mappings. Then:
       var(P ) are the variables mentioned in P
       dom(µ1 ) is the domain of µ1
       A SPARQL SELECT query (denoted by (W, P ), where
       W ⊆ var(P )) is the projection of the evaluation of P onto
       the variables in W
RDF Tuple Generating Dependencies



   Let P be a SPARQL pattern, µ1 and µ2 be partial mappings, and
   Ω1 and Ω2 be sets of mappings. Then:
       µ1 is subsumed by µ2 (µ1 µ2 ) if dom(µ1 ) ⊆ dom(µ2 ), for
       every ?X in dom(µ1 ) that is not bound to a blank node we
       have that µ1 (?X) = µ2 (?X) and for every pair of variables
       ?X and ?Y in dom(µ1 ) such that µ1 (?X) = µ1 (?Y ) it is the
       case that µ2 (?X) = µ2 (?Y ).
       Ω1 is subsumed by Ω2 (Ω1 Ω2 ) if for every mapping µ1 in
       Ω1 there exists a mapping µ2 in Ω2 such that µ1 µ2 .
RDF Tuple Generating Dependencies



   (Re)Definition (Tuple Generating Dependencies)
   Let P1 and P2 be SPARQL patterns, and W ⊂ var(P1 ) ∩ var(P2 ).
   An RDF tgd is a sentence of the form

                            (W, P1 ) → (W, P2 )

   Given two RDF graphs G1 and G2 , and a set of tgds Σ,
   (G1 , G2 ) |= Σ if for every tgd (W, P1 ) → (W, P2 ) in Σ it is the
   case that (W, P1 ) G1        (W, P2 ) G2
RDF Schema Mappings




  Since S and T are fixed,

                             M=Σ


                G2 ∈ SolM (G1 ) ←→ (G1 , G2 ) |= Σ
Universal solutions

   Example
   Let W = {?X}, Σ =
   {(W, (?X, name, ?Y ) AND (?X, email, ?Z)) →
   (W, (?Y, hasmail, ?Z))}
   and consider the dataset D:

   Solution 1
    G2 =     {
                 (paul   hasmail   paul@example.edu)
                                                       }

   Solution 2
    G2 =     {
                 (paul   hasmail   paul@example.edu)
                 (john   hasmail    n)
                                                       }
Universal solutions




   Definition
   A solution G2 is universal if for every other solution G2 , G2   G2

       Solution 1 is universal
       Solution 2 is not
Universal solutions




   Not all settings have universal solutions:
   Consider G1 = {(1, 2, 3)}, W = {?X, ?Y } and

             Σ = {(W, (?X, ?Y, ?Z)) →
                   (W, ((?X, a, b) OPT (?W, b, ?Y ))
                    AND ((?X, c, d) OPT (?Z, d, ?Y )))}
Solution 1
 G2 =    {
             (1     a   b)
             ( n1   b   2)
             (1     c   d)
                             }

Solution 2
 G2 =    {
             (1     a   b)
             ( n2   d   2)
             (1     c   d)
                             }
This setting has no universal solution!
Good and bad news



  Bad news
  There is no ensurance that an exchange setting that has a solution
  will have a universal solution

  Good news
  If the heads of all tgds in Σ are well-designed and there is a
  solution, there is always a universal solution

  Better news
  We have an algorithm
“Chasing” SPARQL queries

 input A mapping µ and a (well-designed) SPARQL pattern P
output An RDF graph G such that µ ∈ P     G

  Chase(µ, ν, P, G)
      t:
      add unbound variables in t as fresh blank nodes to ν
      add ν(t) to G
      P1 AND P2 :
      Chase(µ, ν, P1 , G)
      Chase(µ, ν, P2 , G)
      P1 OPT P2 :
      Chase(µ, ν, P1 , G)
      if dom(µ)  dom(ν) ∩ var(P2 ) = ∅: Chase(µ, ν, P2 , G)
After chasing:




       µ     ν
       ν∈ P      G
       {µ}       P   G
       If we chase with every P2 in Heads(Σ) the evaluations of
        (W, P1 ) G1 , we get a universal solution.
Certain answers



   Definition (Certain answers on a regular data exchange setting)
   The set of certain answers is the intersection of the evaluation of
   the query over all the valid solutions

   Example
   Consider G1 = {(1, 2, 3)} and

              {({?X},(?X, ?Y, ?Z)) →
                      ({?X}, (?X, 1, 2) OPT (?X, ?Y, 3))}
Solution 1
 G2 =   {
             (1   1   2)                  (W, P2 )     G2   = {{?X → 1}}
                           }

Solution 2
 G2 =   {
             (1   1   2)
                                      (W, P2 )   G2   = {{?X → 1, ?Y → 2}}
             (1   2   3)
                        }
  The intersection of (W, P2 )   G2   and (W, P2 )     G2   is empty!
Certain answers


   Given a pattern P and a set of RDF graphs G, let Lower(P, G) be
   the set of all lower bounds of G w.r.t. subsumption.
   (Re)Definition (Certain Answers)
   The set of certain answers of a set of RDF graphs and a SPARQL
   pattern P is defined as any mapping Ω in Lower(P, G), such that
   for any other Ω in Lower(P, G) it is the case that Ω Ω .

   Claim
   All the possible sets of certain answers to an RDF data exchange
   setting are homomorfically equivalent.
Back in our previous example...



 Solution 1
  G2 =   {                            (W, P2 )   G2   = {{?X → 1}}
              (1   1   2)
                            }

 Solution 2
  G2 =   {
              (1   1   2)
              (1   2   3)
                                    (W, P2 ) G2 = {{?X → 1, ?Y → 2}}
                          }
   The set of certain answers is now {{?X → 1}}
In conclusion...




   Our contributions so far:
       RDF and SPARQL TGDs
       RDF Schema mappings
       Universal solutions
       Materialization of universal solutions
       Certain answers
In conclusion...




   To do:
       Prove remaining claims
       Query answering (using universal solutions)
       Incomplete information in the source instance
       Knowledge exchange over RDFs
Thank you for listening




   Any questions?

More Related Content

What's hot

Lesson 22: Optimization Problems (slides)
Lesson 22: Optimization Problems (slides)Lesson 22: Optimization Problems (slides)
Lesson 22: Optimization Problems (slides)
Matthew Leingang
 
Dmss2011 public
Dmss2011 publicDmss2011 public
Dmss2011 public
Yasuo Tabei
 
Tele4653 l4
Tele4653 l4Tele4653 l4
Tele4653 l4Vin Voro
 
Ecfft zk studyclub 9.9
Ecfft zk studyclub 9.9Ecfft zk studyclub 9.9
Ecfft zk studyclub 9.9
Alex Pruden
 
Gwt sdm public
Gwt sdm publicGwt sdm public
Gwt sdm public
Yasuo Tabei
 
Total Dominating Color Transversal Number of Graphs And Graph Operations
Total Dominating Color Transversal Number of Graphs And Graph OperationsTotal Dominating Color Transversal Number of Graphs And Graph Operations
Total Dominating Color Transversal Number of Graphs And Graph Operations
inventionjournals
 
Classification of Groups and Homomorphism -By-Rajesh Bandari Yadav
Classification of Groups and Homomorphism -By-Rajesh Bandari YadavClassification of Groups and Homomorphism -By-Rajesh Bandari Yadav
Classification of Groups and Homomorphism -By-Rajesh Bandari Yadav
Rajesh Yadav
 
Gwt presen alsip-20111201
Gwt presen alsip-20111201Gwt presen alsip-20111201
Gwt presen alsip-20111201Yasuo Tabei
 
A STUDY ON L-FUZZY NORMAL SUBl -GROUP
A STUDY ON L-FUZZY NORMAL SUBl -GROUPA STUDY ON L-FUZZY NORMAL SUBl -GROUP
A STUDY ON L-FUZZY NORMAL SUBl -GROUP
ijejournal
 
Biconnectivity
BiconnectivityBiconnectivity
Biconnectivity
msramanujan
 
Some Concepts on Constant Interval Valued Intuitionistic Fuzzy Graphs
Some Concepts on Constant Interval Valued Intuitionistic Fuzzy GraphsSome Concepts on Constant Interval Valued Intuitionistic Fuzzy Graphs
Some Concepts on Constant Interval Valued Intuitionistic Fuzzy Graphs
iosrjce
 

What's hot (14)

Grup
GrupGrup
Grup
 
Lesson 22: Optimization Problems (slides)
Lesson 22: Optimization Problems (slides)Lesson 22: Optimization Problems (slides)
Lesson 22: Optimization Problems (slides)
 
1 cb02e45d01
1 cb02e45d011 cb02e45d01
1 cb02e45d01
 
Dmss2011 public
Dmss2011 publicDmss2011 public
Dmss2011 public
 
Tele4653 l4
Tele4653 l4Tele4653 l4
Tele4653 l4
 
Ecfft zk studyclub 9.9
Ecfft zk studyclub 9.9Ecfft zk studyclub 9.9
Ecfft zk studyclub 9.9
 
Gwt sdm public
Gwt sdm publicGwt sdm public
Gwt sdm public
 
Total Dominating Color Transversal Number of Graphs And Graph Operations
Total Dominating Color Transversal Number of Graphs And Graph OperationsTotal Dominating Color Transversal Number of Graphs And Graph Operations
Total Dominating Color Transversal Number of Graphs And Graph Operations
 
Classification of Groups and Homomorphism -By-Rajesh Bandari Yadav
Classification of Groups and Homomorphism -By-Rajesh Bandari YadavClassification of Groups and Homomorphism -By-Rajesh Bandari Yadav
Classification of Groups and Homomorphism -By-Rajesh Bandari Yadav
 
Gwt presen alsip-20111201
Gwt presen alsip-20111201Gwt presen alsip-20111201
Gwt presen alsip-20111201
 
A STUDY ON L-FUZZY NORMAL SUBl -GROUP
A STUDY ON L-FUZZY NORMAL SUBl -GROUPA STUDY ON L-FUZZY NORMAL SUBl -GROUP
A STUDY ON L-FUZZY NORMAL SUBl -GROUP
 
Biconnectivity
BiconnectivityBiconnectivity
Biconnectivity
 
15 predicate
15 predicate15 predicate
15 predicate
 
Some Concepts on Constant Interval Valued Intuitionistic Fuzzy Graphs
Some Concepts on Constant Interval Valued Intuitionistic Fuzzy GraphsSome Concepts on Constant Interval Valued Intuitionistic Fuzzy Graphs
Some Concepts on Constant Interval Valued Intuitionistic Fuzzy Graphs
 

Viewers also liked

Evolving web, evolving search
Evolving web, evolving searchEvolving web, evolving search
Evolving web, evolving searchnet2-project
 
Vector spaces for information extraction - Random Projection Example
Vector spaces for information extraction - Random Projection ExampleVector spaces for information extraction - Random Projection Example
Vector spaces for information extraction - Random Projection Example
net2-project
 
Exchanging More than Complete Data
Exchanging More than Complete DataExchanging More than Complete Data
Exchanging More than Complete Datanet2-project
 
Managing Social Communities
Managing Social CommunitiesManaging Social Communities
Managing Social Communitiesnet2-project
 
Extracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting PreparationExtracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting Preparation
net2-project
 
Exchanging More than Complete Data
Exchanging More than Complete DataExchanging More than Complete Data
Exchanging More than Complete Datanet2-project
 
Exchanging more than Complete Data
Exchanging more than Complete DataExchanging more than Complete Data
Exchanging more than Complete Datanet2-project
 

Viewers also liked (7)

Evolving web, evolving search
Evolving web, evolving searchEvolving web, evolving search
Evolving web, evolving search
 
Vector spaces for information extraction - Random Projection Example
Vector spaces for information extraction - Random Projection ExampleVector spaces for information extraction - Random Projection Example
Vector spaces for information extraction - Random Projection Example
 
Exchanging More than Complete Data
Exchanging More than Complete DataExchanging More than Complete Data
Exchanging More than Complete Data
 
Managing Social Communities
Managing Social CommunitiesManaging Social Communities
Managing Social Communities
 
Extracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting PreparationExtracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting Preparation
 
Exchanging More than Complete Data
Exchanging More than Complete DataExchanging More than Complete Data
Exchanging More than Complete Data
 
Exchanging more than Complete Data
Exchanging more than Complete DataExchanging more than Complete Data
Exchanging more than Complete Data
 

Similar to Data Exchange over RDF

Enumeration of 2-level polytopes
Enumeration of 2-level polytopesEnumeration of 2-level polytopes
Enumeration of 2-level polytopes
Vissarion Fisikopoulos
 
Introduction to harmonic analysis on groups, links with spatial correlation.
Introduction to harmonic analysis on groups, links with spatial correlation.Introduction to harmonic analysis on groups, links with spatial correlation.
Introduction to harmonic analysis on groups, links with spatial correlation.
Valentin De Bortoli
 
Darmon Points for fields of mixed signature
Darmon Points for fields of mixed signatureDarmon Points for fields of mixed signature
Darmon Points for fields of mixed signature
mmasdeu
 
Predicate Logic
Predicate LogicPredicate Logic
Predicate Logic
Darío Garigliotti
 
Probabilistic Retrieval TFIDF
Probabilistic Retrieval TFIDFProbabilistic Retrieval TFIDF
Probabilistic Retrieval TFIDF
DKALab
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
The Statistical and Applied Mathematical Sciences Institute
 
590-Article Text.pdf
590-Article Text.pdf590-Article Text.pdf
590-Article Text.pdf
BenoitValea
 
590-Article Text.pdf
590-Article Text.pdf590-Article Text.pdf
590-Article Text.pdf
BenoitValea
 
Fixed point result in menger space with ea property
Fixed point result in menger space with ea propertyFixed point result in menger space with ea property
Fixed point result in menger space with ea property
Alexander Decker
 
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group PreferencesQuery Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group PreferencesOana Tifrea-Marciuska
 
Time complexity
Time complexityTime complexity
Time complexity
LAKSHMITHARUN PONNAM
 
RDFS with Attribute Equations via SPARQL Rewriting
RDFS with Attribute Equations via SPARQL RewritingRDFS with Attribute Equations via SPARQL Rewriting
RDFS with Attribute Equations via SPARQL Rewriting
Stefan Bischof
 
Paraproducts with general dilations
Paraproducts with general dilationsParaproducts with general dilations
Paraproducts with general dilations
VjekoslavKovac1
 
Probabilistic Retrieval
Probabilistic RetrievalProbabilistic Retrieval
Probabilistic Retrieval
otisg
 
Algorithm Design and Complexity - Course 11
Algorithm Design and Complexity - Course 11Algorithm Design and Complexity - Course 11
Algorithm Design and Complexity - Course 11Traian Rebedea
 
Tales on two commuting transformations or flows
Tales on two commuting transformations or flowsTales on two commuting transformations or flows
Tales on two commuting transformations or flows
VjekoslavKovac1
 
Darmon Points for fields of mixed signature
Darmon Points for fields of mixed signatureDarmon Points for fields of mixed signature
Darmon Points for fields of mixed signature
mmasdeu
 
Improving estimates for discrete polynomial averaging operators
Improving estimates for discrete polynomial averaging operatorsImproving estimates for discrete polynomial averaging operators
Improving estimates for discrete polynomial averaging operators
VjekoslavKovac1
 

Similar to Data Exchange over RDF (20)

Enumeration of 2-level polytopes
Enumeration of 2-level polytopesEnumeration of 2-level polytopes
Enumeration of 2-level polytopes
 
Introduction to harmonic analysis on groups, links with spatial correlation.
Introduction to harmonic analysis on groups, links with spatial correlation.Introduction to harmonic analysis on groups, links with spatial correlation.
Introduction to harmonic analysis on groups, links with spatial correlation.
 
Darmon Points for fields of mixed signature
Darmon Points for fields of mixed signatureDarmon Points for fields of mixed signature
Darmon Points for fields of mixed signature
 
Number theory lecture (part 2)
Number theory lecture (part 2)Number theory lecture (part 2)
Number theory lecture (part 2)
 
Predicate Logic
Predicate LogicPredicate Logic
Predicate Logic
 
Probabilistic Retrieval TFIDF
Probabilistic Retrieval TFIDFProbabilistic Retrieval TFIDF
Probabilistic Retrieval TFIDF
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
590-Article Text.pdf
590-Article Text.pdf590-Article Text.pdf
590-Article Text.pdf
 
590-Article Text.pdf
590-Article Text.pdf590-Article Text.pdf
590-Article Text.pdf
 
Fixed point result in menger space with ea property
Fixed point result in menger space with ea propertyFixed point result in menger space with ea property
Fixed point result in menger space with ea property
 
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group PreferencesQuery Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
 
Time complexity
Time complexityTime complexity
Time complexity
 
RDFS with Attribute Equations via SPARQL Rewriting
RDFS with Attribute Equations via SPARQL RewritingRDFS with Attribute Equations via SPARQL Rewriting
RDFS with Attribute Equations via SPARQL Rewriting
 
Paraproducts with general dilations
Paraproducts with general dilationsParaproducts with general dilations
Paraproducts with general dilations
 
Probabilistic Retrieval
Probabilistic RetrievalProbabilistic Retrieval
Probabilistic Retrieval
 
Algorithm Design and Complexity - Course 11
Algorithm Design and Complexity - Course 11Algorithm Design and Complexity - Course 11
Algorithm Design and Complexity - Course 11
 
Tales on two commuting transformations or flows
Tales on two commuting transformations or flowsTales on two commuting transformations or flows
Tales on two commuting transformations or flows
 
Darmon Points for fields of mixed signature
Darmon Points for fields of mixed signatureDarmon Points for fields of mixed signature
Darmon Points for fields of mixed signature
 
Improving estimates for discrete polynomial averaging operators
Improving estimates for discrete polynomial averaging operatorsImproving estimates for discrete polynomial averaging operators
Improving estimates for discrete polynomial averaging operators
 
Induced subgraphs
Induced subgraphsInduced subgraphs
Induced subgraphs
 

More from net2-project

Random Manhattan Indexing
Random Manhattan IndexingRandom Manhattan Indexing
Random Manhattan Indexing
net2-project
 
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic SystemsBorders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systemsnet2-project
 
Exchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge BasesExchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge Basesnet2-project
 
Federation and Navigation in SPARQL 1.1
Federation and Navigation in SPARQL 1.1Federation and Navigation in SPARQL 1.1
Federation and Navigation in SPARQL 1.1net2-project
 
Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...
Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...
Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...net2-project
 
Extending DBpedia (LOD) using WikiTables
Extending DBpedia (LOD) using WikiTablesExtending DBpedia (LOD) using WikiTables
Extending DBpedia (LOD) using WikiTablesnet2-project
 
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
net2-project
 
Answer-set programming
Answer-set programmingAnswer-set programming
Answer-set programmingnet2-project
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
net2-project
 

More from net2-project (10)

Random Manhattan Indexing
Random Manhattan IndexingRandom Manhattan Indexing
Random Manhattan Indexing
 
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic SystemsBorders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systems
 
Exchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge BasesExchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge Bases
 
Federation and Navigation in SPARQL 1.1
Federation and Navigation in SPARQL 1.1Federation and Navigation in SPARQL 1.1
Federation and Navigation in SPARQL 1.1
 
Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...
Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...
Mining Semi-structured Data: Understanding Web-tables – Building a Taxonomy f...
 
Extending DBpedia (LOD) using WikiTables
Extending DBpedia (LOD) using WikiTablesExtending DBpedia (LOD) using WikiTables
Extending DBpedia (LOD) using WikiTables
 
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
 
Answer-set programming
Answer-set programmingAnswer-set programming
Answer-set programming
 
XSPARQL Tutorial
XSPARQL TutorialXSPARQL Tutorial
XSPARQL Tutorial
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 

Recently uploaded

The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 

Recently uploaded (20)

The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 

Data Exchange over RDF

  • 1. Data Exchange over RDF Andr´s Letelier e Advisor: Marcelo Arenas Pontificia Universidad Cat´lica de Chile o September 1, 2011
  • 2. What is data exchange? Problem Data under one schema S needs to be restructured and translated into a target schema T S −→ T IS −→ IT
  • 3. Schema mappings Question Which source instances corresponds to which target instances? Answer Schema mappings: M ⊆ Instances(S) × Instances(T) Usually, schema mappings are defined as M = (S, T, ΣST )
  • 4. Definition (Solution) I2 is a solution of I1 under M iif (I1 , I2 ) ∈ M The set of all solutions for I1 under M is denoted by SolM (I1 )
  • 5. Resource Description Framework (RDF) Data model for representing information about World Wide Web resources W3C Recommendation (1998) Part of the semantic web stack Directed, labeled graphs Blank nodes (labeled nulls) Basically, sets of triples (s, p, o)
  • 6. Example D= { (B1 name paul) (B1 email paul@example.edu) (B2 name john) (B2 city Liverpool) }
  • 7. SPARQL (pronounced “sparkle”) Query language for RDF W3C Recommendation(2008) Standard for querying RDF datasets Returns sets of partial mappings Operators: Projection AND (inner join) OPT (left join) FILTER UNION and more
  • 8. Example P1 = (?X, name, ?Y ) ?X ?Y P1 D = B1 paul B2 john
  • 9. Example P2 = (?X, name, ?Y ) AND (?X, email, ?Z) ?X ?Y ?Z P2 D = B1 paul paul@example.edu
  • 10. Example P3 = (?X, name, ?Y ) OPT (?X, email, ?Z) ?X ?Y ?Z P3 D = B1 paul paul@example.edu B2 john
  • 11. Well-designed SPARQL patterns Definition (Well-designed patterns) A pattern P is well designed if for every subpattern P of the form P1 OPT P2 , every variable that appears in P2 and outside P also appears in P1 . Example (?X, name, ?Y ) OPT ((?X, email, ?Z) OPT (?X, city, ?A)) is well-designed (?X, name, ?Y ) OPT ((?W, email, ?Z) OPT (?X, city, ?A)) is not
  • 12. Data Exchange over RDF S and T are fixed to be RDF triples Tuple generating dependencies have to be redefined But first, we need some definitions...
  • 13. RDF Tuple Generating Dependencies Let P be a SPARQL pattern, µ1 and µ2 be partial mappings, and Ω1 and Ω2 be sets of mappings. Then: var(P ) are the variables mentioned in P dom(µ1 ) is the domain of µ1 A SPARQL SELECT query (denoted by (W, P ), where W ⊆ var(P )) is the projection of the evaluation of P onto the variables in W
  • 14. RDF Tuple Generating Dependencies Let P be a SPARQL pattern, µ1 and µ2 be partial mappings, and Ω1 and Ω2 be sets of mappings. Then: µ1 is subsumed by µ2 (µ1 µ2 ) if dom(µ1 ) ⊆ dom(µ2 ), for every ?X in dom(µ1 ) that is not bound to a blank node we have that µ1 (?X) = µ2 (?X) and for every pair of variables ?X and ?Y in dom(µ1 ) such that µ1 (?X) = µ1 (?Y ) it is the case that µ2 (?X) = µ2 (?Y ). Ω1 is subsumed by Ω2 (Ω1 Ω2 ) if for every mapping µ1 in Ω1 there exists a mapping µ2 in Ω2 such that µ1 µ2 .
  • 15. RDF Tuple Generating Dependencies (Re)Definition (Tuple Generating Dependencies) Let P1 and P2 be SPARQL patterns, and W ⊂ var(P1 ) ∩ var(P2 ). An RDF tgd is a sentence of the form (W, P1 ) → (W, P2 ) Given two RDF graphs G1 and G2 , and a set of tgds Σ, (G1 , G2 ) |= Σ if for every tgd (W, P1 ) → (W, P2 ) in Σ it is the case that (W, P1 ) G1 (W, P2 ) G2
  • 16. RDF Schema Mappings Since S and T are fixed, M=Σ G2 ∈ SolM (G1 ) ←→ (G1 , G2 ) |= Σ
  • 17. Universal solutions Example Let W = {?X}, Σ = {(W, (?X, name, ?Y ) AND (?X, email, ?Z)) → (W, (?Y, hasmail, ?Z))} and consider the dataset D: Solution 1 G2 = { (paul hasmail paul@example.edu) } Solution 2 G2 = { (paul hasmail paul@example.edu) (john hasmail n) }
  • 18. Universal solutions Definition A solution G2 is universal if for every other solution G2 , G2 G2 Solution 1 is universal Solution 2 is not
  • 19. Universal solutions Not all settings have universal solutions: Consider G1 = {(1, 2, 3)}, W = {?X, ?Y } and Σ = {(W, (?X, ?Y, ?Z)) → (W, ((?X, a, b) OPT (?W, b, ?Y )) AND ((?X, c, d) OPT (?Z, d, ?Y )))}
  • 20. Solution 1 G2 = { (1 a b) ( n1 b 2) (1 c d) } Solution 2 G2 = { (1 a b) ( n2 d 2) (1 c d) } This setting has no universal solution!
  • 21. Good and bad news Bad news There is no ensurance that an exchange setting that has a solution will have a universal solution Good news If the heads of all tgds in Σ are well-designed and there is a solution, there is always a universal solution Better news We have an algorithm
  • 22. “Chasing” SPARQL queries input A mapping µ and a (well-designed) SPARQL pattern P output An RDF graph G such that µ ∈ P G Chase(µ, ν, P, G) t: add unbound variables in t as fresh blank nodes to ν add ν(t) to G P1 AND P2 : Chase(µ, ν, P1 , G) Chase(µ, ν, P2 , G) P1 OPT P2 : Chase(µ, ν, P1 , G) if dom(µ) dom(ν) ∩ var(P2 ) = ∅: Chase(µ, ν, P2 , G)
  • 23. After chasing: µ ν ν∈ P G {µ} P G If we chase with every P2 in Heads(Σ) the evaluations of (W, P1 ) G1 , we get a universal solution.
  • 24. Certain answers Definition (Certain answers on a regular data exchange setting) The set of certain answers is the intersection of the evaluation of the query over all the valid solutions Example Consider G1 = {(1, 2, 3)} and {({?X},(?X, ?Y, ?Z)) → ({?X}, (?X, 1, 2) OPT (?X, ?Y, 3))}
  • 25. Solution 1 G2 = { (1 1 2) (W, P2 ) G2 = {{?X → 1}} } Solution 2 G2 = { (1 1 2) (W, P2 ) G2 = {{?X → 1, ?Y → 2}} (1 2 3) } The intersection of (W, P2 ) G2 and (W, P2 ) G2 is empty!
  • 26. Certain answers Given a pattern P and a set of RDF graphs G, let Lower(P, G) be the set of all lower bounds of G w.r.t. subsumption. (Re)Definition (Certain Answers) The set of certain answers of a set of RDF graphs and a SPARQL pattern P is defined as any mapping Ω in Lower(P, G), such that for any other Ω in Lower(P, G) it is the case that Ω Ω . Claim All the possible sets of certain answers to an RDF data exchange setting are homomorfically equivalent.
  • 27. Back in our previous example... Solution 1 G2 = { (W, P2 ) G2 = {{?X → 1}} (1 1 2) } Solution 2 G2 = { (1 1 2) (1 2 3) (W, P2 ) G2 = {{?X → 1, ?Y → 2}} } The set of certain answers is now {{?X → 1}}
  • 28. In conclusion... Our contributions so far: RDF and SPARQL TGDs RDF Schema mappings Universal solutions Materialization of universal solutions Certain answers
  • 29. In conclusion... To do: Prove remaining claims Query answering (using universal solutions) Incomplete information in the source instance Knowledge exchange over RDFs
  • 30. Thank you for listening Any questions?