SlideShare a Scribd company logo
1 of 6
Quality Reasoning in the Semantic Web
 Chris Baillie, Pete Edwards and Edoardo Pignotti


 • Evaluating data quality – a challenge for agents (human
   and machine)
 • Data quality is a measure of “fitness for use”
 • To evaluate quality an agent must examine the context
   surrounding data (Linked Data)
 • Part of this context should describe data provenance:
     – The entities, activities, and agents associated with data



c.baillie@abdn.ac.uk
Illustrative Scenario
                                                   ssn:FeatureOfInterest
                                                        busroute:X95

                                     ssn:featureOfInterest         ssn:featureOfInterest
  -2.789
               geo:long                                                                       ssn:SensingDevice
                            ssn:Observation                                                        "iPhone1"
                                 Value
                geo:lat        "ObsValue2"
  55.605
                                                                                           ssn:observedBy
                             ssn:hasValue

                            ssn:Observation
                                                                                ssn:Observation
                             "Observation2"
                                                                                 "Observation1"




            prov:Entity                                prov:Activity                          prov:Entity
           "Observation2"         prov:generated                           prov:used
                                                       "Map Matching"                        "Observation1"



                                                   prov:wasDerivedFrom
c.baillie@abdn.ac.uk
Research Questions

 1. Is it possible to reason about quality in the
    Web of Linked Sensor Data?

 2. Is it possible to reason about quality using
    the provenance of sensor observations?

 3. Can the provenance of existing quality scores
    facilitate new quality assessments?
c.baillie@abdn.ac.uk
Work To Date

 • Developed a quality reasoning framework
   enabling:
     – Linked Data representation of sensor observations
       using W3C Semantic Sensor Network ontology
     – Document observation provenance with W3C PROV
       ontology
     – Definition of quality requirements using SPARQL rules
     – Generation of quality scores via SPIN reasoner
 • Deployed as part of a RTPIS (GetThere)
     – See my poster for more info or dotrural.ac.uk/irp

c.baillie@abdn.ac.uk
Target Results

 • Demonstrate that it is feasible to evaluate the
   quality of linked data (DONE)
 • Demonstrate that the presence of provenance
   enhances quality assessment (QA)
     – Empirical studies involving benchmark cases with and
       without provenance
 • Demonstrate that we can re-use existing quality
   assessment results based on their provenance.
     – Empirical studies examining quality score re-use;
       measure impact on QA computational overhead

c.baillie@abdn.ac.uk
Reasoning about Quality
                           CON ST RU CT {
                             _:b0 a QualityS core .
                             _:b0 score ? .
                                          qs
                             _:b0 dqm:ruleViolation _:b1 .
                             _:b1 a dqm:RequirementViolation .
                             _:b1 dqm:affectedInstance ? instance
                           } W H ERE {
                             ?instance a prov:Entity .
                             ?instance prov:wasDerivedFrom ? .  obs
         spin:rule           N OT EX IST S {
                               ? prov:wasDerivedFrom ?
                                obs                         entity . }
                             ? _:distanceFromRoute ?
                              obs                         distance .
     Relevance
                             LET (? := (1 - (?
                                    qs          distance / 100))) .
   Requirement              }
   "Requirement1"


  dqm:basedOn
                                          dqm:Requirement
 dqm:QualityScore    dqm:ruleViolation         Violation
   "QualityScore1"                        "RelevanceViolation1"

  dqm:plainScore                         dqm:affectedInstance


         0                                    prov:Entity
                                             "Obser vation2"
                                             "Observation2"

More Related Content

Viewers also liked

Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...
Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...
Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...Fernando Marcos Marcos
 
Circuitos Digitales - Corrimiento de bits
Circuitos Digitales - Corrimiento de bits Circuitos Digitales - Corrimiento de bits
Circuitos Digitales - Corrimiento de bits Fernando Marcos Marcos
 
11.mon div
11.mon div11.mon div
11.mon divwdwasile
 
Connect and combine
Connect and combineConnect and combine
Connect and combinedoeniadee
 
Evaluating Data Quality using Sensor Metadata and Provenance
Evaluating Data Quality using Sensor Metadata and ProvenanceEvaluating Data Quality using Sensor Metadata and Provenance
Evaluating Data Quality using Sensor Metadata and ProvenanceChris Baillie
 
10.mon write
10.mon write 10.mon write
10.mon write wdwasile
 
Circuitos Digitales - Contador ascendente y descendente con reset
Circuitos Digitales - Contador ascendente y descendente con resetCircuitos Digitales - Contador ascendente y descendente con reset
Circuitos Digitales - Contador ascendente y descendente con resetFernando Marcos Marcos
 

Viewers also liked (9)

Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...
Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...
Circuitos Digitales - Registro de corrimiento bidireccional de 4 bits con var...
 
Circuitos Digitales - Corrimiento de bits
Circuitos Digitales - Corrimiento de bits Circuitos Digitales - Corrimiento de bits
Circuitos Digitales - Corrimiento de bits
 
11.mon div
11.mon div11.mon div
11.mon div
 
10.mon pr
10.mon pr10.mon pr
10.mon pr
 
Connect and combine
Connect and combineConnect and combine
Connect and combine
 
Evaluating Data Quality using Sensor Metadata and Provenance
Evaluating Data Quality using Sensor Metadata and ProvenanceEvaluating Data Quality using Sensor Metadata and Provenance
Evaluating Data Quality using Sensor Metadata and Provenance
 
10.mon write
10.mon write 10.mon write
10.mon write
 
Grammar book
Grammar bookGrammar book
Grammar book
 
Circuitos Digitales - Contador ascendente y descendente con reset
Circuitos Digitales - Contador ascendente y descendente con resetCircuitos Digitales - Contador ascendente y descendente con reset
Circuitos Digitales - Contador ascendente y descendente con reset
 

Quality Reasoning in the Semantic Web

  • 1. Quality Reasoning in the Semantic Web Chris Baillie, Pete Edwards and Edoardo Pignotti • Evaluating data quality – a challenge for agents (human and machine) • Data quality is a measure of “fitness for use” • To evaluate quality an agent must examine the context surrounding data (Linked Data) • Part of this context should describe data provenance: – The entities, activities, and agents associated with data c.baillie@abdn.ac.uk
  • 2. Illustrative Scenario ssn:FeatureOfInterest busroute:X95 ssn:featureOfInterest ssn:featureOfInterest -2.789 geo:long ssn:SensingDevice ssn:Observation "iPhone1" Value geo:lat "ObsValue2" 55.605 ssn:observedBy ssn:hasValue ssn:Observation ssn:Observation "Observation2" "Observation1" prov:Entity prov:Activity prov:Entity "Observation2" prov:generated prov:used "Map Matching" "Observation1" prov:wasDerivedFrom c.baillie@abdn.ac.uk
  • 3. Research Questions 1. Is it possible to reason about quality in the Web of Linked Sensor Data? 2. Is it possible to reason about quality using the provenance of sensor observations? 3. Can the provenance of existing quality scores facilitate new quality assessments? c.baillie@abdn.ac.uk
  • 4. Work To Date • Developed a quality reasoning framework enabling: – Linked Data representation of sensor observations using W3C Semantic Sensor Network ontology – Document observation provenance with W3C PROV ontology – Definition of quality requirements using SPARQL rules – Generation of quality scores via SPIN reasoner • Deployed as part of a RTPIS (GetThere) – See my poster for more info or dotrural.ac.uk/irp c.baillie@abdn.ac.uk
  • 5. Target Results • Demonstrate that it is feasible to evaluate the quality of linked data (DONE) • Demonstrate that the presence of provenance enhances quality assessment (QA) – Empirical studies involving benchmark cases with and without provenance • Demonstrate that we can re-use existing quality assessment results based on their provenance. – Empirical studies examining quality score re-use; measure impact on QA computational overhead c.baillie@abdn.ac.uk
  • 6. Reasoning about Quality CON ST RU CT { _:b0 a QualityS core . _:b0 score ? . qs _:b0 dqm:ruleViolation _:b1 . _:b1 a dqm:RequirementViolation . _:b1 dqm:affectedInstance ? instance } W H ERE { ?instance a prov:Entity . ?instance prov:wasDerivedFrom ? . obs spin:rule N OT EX IST S { ? prov:wasDerivedFrom ? obs entity . } ? _:distanceFromRoute ? obs distance . Relevance LET (? := (1 - (? qs distance / 100))) . Requirement } "Requirement1" dqm:basedOn dqm:Requirement dqm:QualityScore dqm:ruleViolation Violation "QualityScore1" "RelevanceViolation1" dqm:plainScore dqm:affectedInstance 0 prov:Entity "Obser vation2" "Observation2"

Editor's Notes

  1. Consider a crowd-sourcing applicationthat enables users to provide location information on public transport.We can enhance the context around user observations using, e.g. the W3C SSN ontology.This is enough to perform quality assessment, and we’d probably conclude that this observation is quite high quality.Describing observation provenance using, e.g. W3C PROV ontology enables us to examine how the observation was created.It was actually derived from some other observation, highlighted in green, that has been forced back onto the expected route by a map matching algorithm.Considering new context this as part of a quality assessment, it is likely that Observation 2 is lower quality than we initially concluded.