SlideShare a Scribd company logo
1 of 34
Download to read offline
Provenance Information
     in the Web of Data




         Olaf Hartig
         Humboldt-Universität zu Berlin
         http://olafhartig.de/foaf.rdf#olaf
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         2
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         3
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         4
Outline


                                      Towards a model of
                                      Web data provenance

                                      Provenance information
                                      in the Web of data today

                                      Upcoming
                                      tasks
Olaf Hartig - Provenance Information in the Web of Data          5
Existing Provenance Research
     Main research areas: (scientific) workflows, DBMSs
 ●




     General focus:
 ●

           data creation




Olaf Hartig - Provenance Information in the Web of Data   6
Olaf Hartig - Provenance Information in the Web of Data   7
Olaf Hartig - Provenance Information in the Web of Data   8
Olaf Hartig - Provenance Information in the Web of Data   9
Olaf Hartig - Provenance Information in the Web of Data   10
Web data provenance
                            comprises
                         two dimensions:
        Data Creation • Data Access

Olaf Hartig - Provenance Information in the Web of Data   11
Basics of the Provenance Model
     Provenance graph describes provenance of a data item
 ●


         Nodes: provenance elements – pieces of provenance info
     ●


         Edges: relate provenance elements to each other
     ●


         Subgraphs for related data items possible
     ●




Olaf Hartig - Provenance Information in the Web of Data       12
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●




Olaf Hartig - Provenance Information in the Web of Data   13
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●



     High level of abstraction (only main element types)
 ●




Olaf Hartig - Provenance Information in the Web of Data    14
Basics of the Provenance Model



     General differentiation:
 ●


                                                          Actors
                                                          Executions
                                                          Artifacts




Olaf Hartig - Provenance Information in the Web of Data                15
Data Access Dimension
                                                                                               Data Item
                 Data Accessor
                  (Non-Human)
                                                                                                   contains
                                                                        Information Resource

                  Access Time
                                                          Data Access


                        Data Providing Service
                              (Non-Human)
                                                            controls
          uses
                                                Service Provider
       Data Publisher
           (Human)


              Relation to
       the provided Information
              Resource



Olaf Hartig - Provenance Information in the Web of Data                                                       16
Data Access Dimension cont.


                                                  owns
                                                                         Public Key
               Signer


         Relation to
       the signed Data                             Integrity Assurance
                                                                                      Digital Signature




                                                                                                signs
                             Verification Result

                                                                         (Signed)
                                                                         Artifact




Olaf Hartig - Provenance Information in the Web of Data                                                   17
Data Creation Dimension
                                                                        Provenance
                                                                        Information

                                                                              Source Data
                                                                                                         Provenance
                                              Creation Time
                                                                                                         Information

                                                                                            Creation Guidelines
                    Data Creator
                                                             Data Creation
              (Human or Non-human)

   {complete,disjoint}


                                                  Data Creating Device
                                                        (e.g. Sensor)                        Data Item

                          Data Creating Service
                            (e.g. Software Agent)                                       part of
                                                                                                    Provenance
                                 responsible for responsible for
   Data Creating Entity                                                                             Information
 (e.g. Person, Group, Orga.)
                                                                                      (Encompassing)
                                                                                        Data Item
          Relation to
Olaf Hartig created Data
      the - Provenance Information in the Web of Data                                                                  18
Provenance information
 in the Web of data today


Olaf Hartig - Provenance Information in the Web of Data   19
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
              FOAF – Friend of a Friend
              SIOC – Semantically-Interlinked Online Communities
              SWP – Semantic Web Publishing vocabulary
          ●



              WOT – Web of Trust schema
          ●



              OMV – Ontology Metadata Vocabulary
          ●



              PML – Proof Markup Language
          ●



              Changeset vocabulary
          ●



              Ouzo Provenance Ontology
          ●




Olaf Hartig - Provenance Information in the Web of Data            20
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms




Olaf Hartig - Provenance Information in the Web of Data   21
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher
             ●




                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data   22
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher – “an entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                 23
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●



                                                                              Data Access
                 dc:contributor
             ●



                                                     Data Providing Service
                 dc:source
             ●
                                                          (Non-Human)
                                                                                controls
                                       uses

                 dc:created Publisher                                   Service Provider
             ●
                          Data
                                        (Human)

                 dc:modified
             ●




                 dc:publisher – “an entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                                     24
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●




Olaf Hartig - Provenance Information in the Web of Data          25
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms

                            Property                      Occurrences*
                  dc:creator                                 about 24,284
                  dc:contributor                                     476
                  dc:source                                   about 3,631
                  dc:created                                 about 82,720
                  dc:modified                                about 12,020
                  dc:provenance                                          7

            *Measured by querying Sindice; Feb. 7, 2009 (by that
            time Sindice indexed about 48,99 million documents)
Olaf Hartig - Provenance Information in the Web of Data                      26
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●



                   General lack of provenance-related metadata
               ●

                   on the Web of data




Olaf Hartig - Provenance Information in the Web of Data          27
Possible Reasons



                             Lack of suitable vocabularies
                         ●


                             Lack of usable tools
                         ●


                             Ignorance / lack of sensitization
                         ●




Olaf Hartig - Provenance Information in the Web of Data          28
Upcoming
                                                          tasks
Olaf Hartig - Provenance Information in the Web of Data          29
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements




Olaf Hartig - Provenance Information in the Web of Data          30
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




Olaf Hartig - Provenance Information in the Web of Data          31
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




     Let's raise awareness of data providers
 ●


         Probably the hardest task
     ●


         Maybe voiD can help
     ●



Olaf Hartig - Provenance Information in the Web of Data          32
Thank you!

Olaf Hartig
Humboldt-Universität zu Berlin
http://olafhartig.de/foaf.rdf#olaf
These slides have been created by
                                             Olaf Hartig
                                                  http://olafhartig.de

                              This work is licensed under a
                Creative Commons Attribution-Share Alike 3.0 License
                    (http://creativecommons.org/licenses/by-sa/3.0/)


                              Attribution:
                                  http://www.flickr.com/photos/adrenalin/3032734/
                              ●

                                  http://www.hasslefreeclipart.com
                              ●

                                  http://www.flickr.com/photos/dullhunk/428079229/
                              ●

                                  http://www.flickr.com/photos/darwinbell/1337963794/
                              ●


                                  http://www.flickr.com/photos/alandd/2780700767/
                              ●


                                  http://www.flickr.com/photos/simeon_barkas/2872099696/
                              ●


                                  http://www.flickr.com/photos/robinh00d/122544491/
                              ●


                                  http://www.flickr.com/photos/adrenalin/3032747/
                              ●




Olaf Hartig - Provenance Information in the Web of Data                                    34

More Related Content

Similar to Provenance Information in the Web of Data

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataTope Omitola
 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Apigee | Google Cloud
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataOlaf Hartig
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web dataGeorg Guentner
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataEdward Curry
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingInnoTech
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYAmit Sheth
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networksalitora
 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Cloudera, Inc.
 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainChristian Martorella
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataSteve Watt
 
sones company presentation
sones company presentationsones company presentation
sones company presentationsones GmbH
 

Similar to Provenance Information in the Web of Data (15)

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddata
 
Open Data
Open DataOpen Data
Open Data
 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked Data
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web data
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networks
 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP Spain
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
 
Steve Watt Presentation
Steve Watt PresentationSteve Watt Presentation
Steve Watt Presentation
 
Provenance and Trust
Provenance and TrustProvenance and Trust
Provenance and Trust
 
sones company presentation
sones company presentationsones company presentation
sones company presentation
 

More from Olaf Hartig

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebOlaf Hartig
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Olaf Hartig
 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryOlaf Hartig
 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)Olaf Hartig
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Olaf Hartig
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataOlaf Hartig
 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataOlaf Hartig
 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataOlaf Hartig
 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Olaf Hartig
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Olaf Hartig
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Olaf Hartig
 

More from Olaf Hartig (20)

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the Web
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and Query
 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked Data
 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked Data
 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)
 

Recently uploaded

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Provenance Information in the Web of Data

  • 1. Provenance Information in the Web of Data Olaf Hartig Humboldt-Universität zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 2. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 2
  • 3. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 3
  • 4. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 4
  • 5. Outline Towards a model of Web data provenance Provenance information in the Web of data today Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 5
  • 6. Existing Provenance Research Main research areas: (scientific) workflows, DBMSs ● General focus: ● data creation Olaf Hartig - Provenance Information in the Web of Data 6
  • 7. Olaf Hartig - Provenance Information in the Web of Data 7
  • 8. Olaf Hartig - Provenance Information in the Web of Data 8
  • 9. Olaf Hartig - Provenance Information in the Web of Data 9
  • 10. Olaf Hartig - Provenance Information in the Web of Data 10
  • 11. Web data provenance comprises two dimensions: Data Creation • Data Access Olaf Hartig - Provenance Information in the Web of Data 11
  • 12. Basics of the Provenance Model Provenance graph describes provenance of a data item ● Nodes: provenance elements – pieces of provenance info ● Edges: relate provenance elements to each other ● Subgraphs for related data items possible ● Olaf Hartig - Provenance Information in the Web of Data 12
  • 13. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● Olaf Hartig - Provenance Information in the Web of Data 13
  • 14. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● High level of abstraction (only main element types) ● Olaf Hartig - Provenance Information in the Web of Data 14
  • 15. Basics of the Provenance Model General differentiation: ● Actors Executions Artifacts Olaf Hartig - Provenance Information in the Web of Data 15
  • 16. Data Access Dimension Data Item Data Accessor (Non-Human) contains Information Resource Access Time Data Access Data Providing Service (Non-Human) controls uses Service Provider Data Publisher (Human) Relation to the provided Information Resource Olaf Hartig - Provenance Information in the Web of Data 16
  • 17. Data Access Dimension cont. owns Public Key Signer Relation to the signed Data Integrity Assurance Digital Signature signs Verification Result (Signed) Artifact Olaf Hartig - Provenance Information in the Web of Data 17
  • 18. Data Creation Dimension Provenance Information Source Data Provenance Creation Time Information Creation Guidelines Data Creator Data Creation (Human or Non-human) {complete,disjoint} Data Creating Device (e.g. Sensor) Data Item Data Creating Service (e.g. Software Agent) part of Provenance responsible for responsible for Data Creating Entity Information (e.g. Person, Group, Orga.) (Encompassing) Data Item Relation to Olaf Hartig created Data the - Provenance Information in the Web of Data 18
  • 19. Provenance information in the Web of data today Olaf Hartig - Provenance Information in the Web of Data 19
  • 20. Provenance-related Vocabularies DC – Dublin Core Metadata Terms FOAF – Friend of a Friend SIOC – Semantically-Interlinked Online Communities SWP – Semantic Web Publishing vocabulary ● WOT – Web of Trust schema ● OMV – Ontology Metadata Vocabulary ● PML – Proof Markup Language ● Changeset vocabulary ● Ouzo Provenance Ontology ● Olaf Hartig - Provenance Information in the Web of Data 20
  • 21. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Olaf Hartig - Provenance Information in the Web of Data 21
  • 22. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher ● dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 22
  • 23. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher – “an entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 23
  • 24. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● Data Access dc:contributor ● Data Providing Service dc:source ● (Non-Human) controls uses dc:created Publisher Service Provider ● Data (Human) dc:modified ● dc:publisher – “an entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 24
  • 25. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● Olaf Hartig - Provenance Information in the Web of Data 25
  • 26. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Property Occurrences* dc:creator about 24,284 dc:contributor 476 dc:source about 3,631 dc:created about 82,720 dc:modified about 12,020 dc:provenance 7 *Measured by querying Sindice; Feb. 7, 2009 (by that time Sindice indexed about 48,99 million documents) Olaf Hartig - Provenance Information in the Web of Data 26
  • 27. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● General lack of provenance-related metadata ● on the Web of data Olaf Hartig - Provenance Information in the Web of Data 27
  • 28. Possible Reasons Lack of suitable vocabularies ● Lack of usable tools ● Ignorance / lack of sensitization ● Olaf Hartig - Provenance Information in the Web of Data 28
  • 29. Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 29
  • 30. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Olaf Hartig - Provenance Information in the Web of Data 30
  • 31. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Olaf Hartig - Provenance Information in the Web of Data 31
  • 32. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Let's raise awareness of data providers ● Probably the hardest task ● Maybe voiD can help ● Olaf Hartig - Provenance Information in the Web of Data 32
  • 33. Thank you! Olaf Hartig Humboldt-Universität zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 34. These slides have been created by Olaf Hartig http://olafhartig.de This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/) Attribution: http://www.flickr.com/photos/adrenalin/3032734/ ● http://www.hasslefreeclipart.com ● http://www.flickr.com/photos/dullhunk/428079229/ ● http://www.flickr.com/photos/darwinbell/1337963794/ ● http://www.flickr.com/photos/alandd/2780700767/ ● http://www.flickr.com/photos/simeon_barkas/2872099696/ ● http://www.flickr.com/photos/robinh00d/122544491/ ● http://www.flickr.com/photos/adrenalin/3032747/ ● Olaf Hartig - Provenance Information in the Web of Data 34