SlideShare a Scribd company logo
1 of 34
Download to read offline
Provenance Information
     in the Web of Data




         Olaf Hartig
         Humboldt-UniversitΓ€t zu Berlin
         http://olafhartig.de/foaf.rdf#olaf
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         2
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         3
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         4
Outline


                                      Towards a model of
                                      Web data provenance

                                      Provenance information
                                      in the Web of data today

                                      Upcoming
                                      tasks
Olaf Hartig - Provenance Information in the Web of Data          5
Existing Provenance Research
     Main research areas: (scientific) workflows, DBMSs
 ●




     General focus:
 ●

           data creation




Olaf Hartig - Provenance Information in the Web of Data   6
Olaf Hartig - Provenance Information in the Web of Data   7
Olaf Hartig - Provenance Information in the Web of Data   8
Olaf Hartig - Provenance Information in the Web of Data   9
Olaf Hartig - Provenance Information in the Web of Data   10
Web data provenance
                            comprises
                         two dimensions:
        Data Creation β€’ Data Access

Olaf Hartig - Provenance Information in the Web of Data   11
Basics of the Provenance Model
     Provenance graph describes provenance of a data item
 ●


         Nodes: provenance elements – pieces of provenance info
     ●


         Edges: relate provenance elements to each other
     ●


         Subgraphs for related data items possible
     ●




Olaf Hartig - Provenance Information in the Web of Data       12
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●




Olaf Hartig - Provenance Information in the Web of Data   13
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●



     High level of abstraction (only main element types)
 ●




Olaf Hartig - Provenance Information in the Web of Data    14
Basics of the Provenance Model



     General differentiation:
 ●


                                                          Actors
                                                          Executions
                                                          Artifacts




Olaf Hartig - Provenance Information in the Web of Data                15
Data Access Dimension
                                                                                               Data Item
                 Data Accessor
                  (Non-Human)
                                                                                                   contains
                                                                        Information Resource

                  Access Time
                                                          Data Access


                        Data Providing Service
                              (Non-Human)
                                                            controls
          uses
                                                Service Provider
       Data Publisher
           (Human)


              Relation to
       the provided Information
              Resource



Olaf Hartig - Provenance Information in the Web of Data                                                       16
Data Access Dimension cont.


                                                  owns
                                                                         Public Key
               Signer


         Relation to
       the signed Data                             Integrity Assurance
                                                                                      Digital Signature




                                                                                                signs
                             Verification Result

                                                                         (Signed)
                                                                         Artifact




Olaf Hartig - Provenance Information in the Web of Data                                                   17
Data Creation Dimension
                                                                        Provenance
                                                                        Information

                                                                              Source Data
                                                                                                         Provenance
                                              Creation Time
                                                                                                         Information

                                                                                            Creation Guidelines
                    Data Creator
                                                             Data Creation
              (Human or Non-human)

   {complete,disjoint}


                                                  Data Creating Device
                                                        (e.g. Sensor)                        Data Item

                          Data Creating Service
                            (e.g. Software Agent)                                       part of
                                                                                                    Provenance
                                 responsible for responsible for
   Data Creating Entity                                                                             Information
 (e.g. Person, Group, Orga.)
                                                                                      (Encompassing)
                                                                                        Data Item
          Relation to
Olaf Hartig created Data
      the - Provenance Information in the Web of Data                                                                  18
Provenance information
 in the Web of data today


Olaf Hartig - Provenance Information in the Web of Data   19
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
              FOAF – Friend of a Friend
              SIOC – Semantically-Interlinked Online Communities
              SWP – Semantic Web Publishing vocabulary
          ●



              WOT – Web of Trust schema
          ●



              OMV – Ontology Metadata Vocabulary
          ●



              PML – Proof Markup Language
          ●



              Changeset vocabulary
          ●



              Ouzo Provenance Ontology
          ●




Olaf Hartig - Provenance Information in the Web of Data            20
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms




Olaf Hartig - Provenance Information in the Web of Data   21
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher
             ●




                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data   22
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher – β€œan entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                 23
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●



                                                                              Data Access
                 dc:contributor
             ●



                                                     Data Providing Service
                 dc:source
             ●
                                                          (Non-Human)
                                                                                controls
                                       uses

                 dc:created Publisher                                   Service Provider
             ●
                          Data
                                        (Human)

                 dc:modified
             ●




                 dc:publisher – β€œan entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                                     24
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●




Olaf Hartig - Provenance Information in the Web of Data          25
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms

                            Property                      Occurrences*
                  dc:creator                                 about 24,284
                  dc:contributor                                     476
                  dc:source                                   about 3,631
                  dc:created                                 about 82,720
                  dc:modified                                about 12,020
                  dc:provenance                                          7

            *Measured by querying Sindice; Feb. 7, 2009 (by that
            time Sindice indexed about 48,99 million documents)
Olaf Hartig - Provenance Information in the Web of Data                      26
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●



                   General lack of provenance-related metadata
               ●

                   on the Web of data




Olaf Hartig - Provenance Information in the Web of Data          27
Possible Reasons



                             Lack of suitable vocabularies
                         ●


                             Lack of usable tools
                         ●


                             Ignorance / lack of sensitization
                         ●




Olaf Hartig - Provenance Information in the Web of Data          28
Upcoming
                                                          tasks
Olaf Hartig - Provenance Information in the Web of Data          29
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements




Olaf Hartig - Provenance Information in the Web of Data          30
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




Olaf Hartig - Provenance Information in the Web of Data          31
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




     Let's raise awareness of data providers
 ●


         Probably the hardest task
     ●


         Maybe voiD can help
     ●



Olaf Hartig - Provenance Information in the Web of Data          32
Thank you!

Olaf Hartig
Humboldt-UniversitΓ€t zu Berlin
http://olafhartig.de/foaf.rdf#olaf
These slides have been created by
                                             Olaf Hartig
                                                  http://olafhartig.de

                              This work is licensed under a
                Creative Commons Attribution-Share Alike 3.0 License
                    (http://creativecommons.org/licenses/by-sa/3.0/)


                              Attribution:
                                  http://www.flickr.com/photos/adrenalin/3032734/
                              ●

                                  http://www.hasslefreeclipart.com
                              ●

                                  http://www.flickr.com/photos/dullhunk/428079229/
                              ●

                                  http://www.flickr.com/photos/darwinbell/1337963794/
                              ●


                                  http://www.flickr.com/photos/alandd/2780700767/
                              ●


                                  http://www.flickr.com/photos/simeon_barkas/2872099696/
                              ●


                                  http://www.flickr.com/photos/robinh00d/122544491/
                              ●


                                  http://www.flickr.com/photos/adrenalin/3032747/
                              ●




Olaf Hartig - Provenance Information in the Web of Data                                    34

More Related Content

Similar to Provenance Information in the Web of Data

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataTope Omitola
Β 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Apigee | Google Cloud
Β 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataOlaf Hartig
Β 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web dataGeorg Guentner
Β 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataEdward Curry
Β 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingInnoTech
Β 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYAmit Sheth
Β 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networksalitora
Β 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Cloudera, Inc.
Β 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainChristian Martorella
Β 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataSteve Watt
Β 
Steve Watt Presentation
Steve Watt PresentationSteve Watt Presentation
Steve Watt PresentationBig Data Houston
Β 
sones company presentation
sones company presentationsones company presentation
sones company presentationsones GmbH
Β 

Similar to Provenance Information in the Web of Data (15)

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddata
Β 
Open Data
Open DataOpen Data
Open Data
Β 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)
Β 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked Data
Β 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web data
Β 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
Β 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
Β 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
Β 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networks
Β 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Β 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP Spain
Β 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
Β 
Steve Watt Presentation
Steve Watt PresentationSteve Watt Presentation
Steve Watt Presentation
Β 
Provenance and Trust
Provenance and TrustProvenance and Trust
Provenance and Trust
Β 
sones company presentation
sones company presentationsones company presentation
sones company presentation
Β 

More from Olaf Hartig

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
Β 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebOlaf Hartig
Β 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
Β 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Olaf Hartig
Β 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Olaf Hartig
Β 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Olaf Hartig
Β 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Olaf Hartig
Β 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Olaf Hartig
Β 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
Β 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Olaf Hartig
Β 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Olaf Hartig
Β 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryOlaf Hartig
Β 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)Olaf Hartig
Β 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Olaf Hartig
Β 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataOlaf Hartig
Β 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataOlaf Hartig
Β 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataOlaf Hartig
Β 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Olaf Hartig
Β 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Olaf Hartig
Β 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Olaf Hartig
Β 

More from Olaf Hartig (20)

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
Β 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the Web
Β 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Β 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Β 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Β 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Β 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Β 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Β 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Β 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Β 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Β 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and Query
Β 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
Β 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Β 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked Data
Β 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
Β 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked Data
Β 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Β 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Β 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)
Β 

Recently uploaded

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
Β 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
Β 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
Β 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
Β 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
Β 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
Β 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
Β 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
Β 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel AraΓΊjo
Β 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
Β 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
Β 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
Β 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
Β 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
Β 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
Β 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
Β 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
Β 
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
Β 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
Β 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
Β 

Recently uploaded (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
Β 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Β 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Β 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
Β 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Β 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Β 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
Β 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Β 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Β 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
Β 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Β 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
Β 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Β 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Β 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Β 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
Β 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Β 
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
Β 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Β 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
Β 

Provenance Information in the Web of Data

  • 1. Provenance Information in the Web of Data Olaf Hartig Humboldt-UniversitΓ€t zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 2. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 2
  • 3. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 3
  • 4. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 4
  • 5. Outline Towards a model of Web data provenance Provenance information in the Web of data today Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 5
  • 6. Existing Provenance Research Main research areas: (scientific) workflows, DBMSs ● General focus: ● data creation Olaf Hartig - Provenance Information in the Web of Data 6
  • 7. Olaf Hartig - Provenance Information in the Web of Data 7
  • 8. Olaf Hartig - Provenance Information in the Web of Data 8
  • 9. Olaf Hartig - Provenance Information in the Web of Data 9
  • 10. Olaf Hartig - Provenance Information in the Web of Data 10
  • 11. Web data provenance comprises two dimensions: Data Creation β€’ Data Access Olaf Hartig - Provenance Information in the Web of Data 11
  • 12. Basics of the Provenance Model Provenance graph describes provenance of a data item ● Nodes: provenance elements – pieces of provenance info ● Edges: relate provenance elements to each other ● Subgraphs for related data items possible ● Olaf Hartig - Provenance Information in the Web of Data 12
  • 13. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● Olaf Hartig - Provenance Information in the Web of Data 13
  • 14. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● High level of abstraction (only main element types) ● Olaf Hartig - Provenance Information in the Web of Data 14
  • 15. Basics of the Provenance Model General differentiation: ● Actors Executions Artifacts Olaf Hartig - Provenance Information in the Web of Data 15
  • 16. Data Access Dimension Data Item Data Accessor (Non-Human) contains Information Resource Access Time Data Access Data Providing Service (Non-Human) controls uses Service Provider Data Publisher (Human) Relation to the provided Information Resource Olaf Hartig - Provenance Information in the Web of Data 16
  • 17. Data Access Dimension cont. owns Public Key Signer Relation to the signed Data Integrity Assurance Digital Signature signs Verification Result (Signed) Artifact Olaf Hartig - Provenance Information in the Web of Data 17
  • 18. Data Creation Dimension Provenance Information Source Data Provenance Creation Time Information Creation Guidelines Data Creator Data Creation (Human or Non-human) {complete,disjoint} Data Creating Device (e.g. Sensor) Data Item Data Creating Service (e.g. Software Agent) part of Provenance responsible for responsible for Data Creating Entity Information (e.g. Person, Group, Orga.) (Encompassing) Data Item Relation to Olaf Hartig created Data the - Provenance Information in the Web of Data 18
  • 19. Provenance information in the Web of data today Olaf Hartig - Provenance Information in the Web of Data 19
  • 20. Provenance-related Vocabularies DC – Dublin Core Metadata Terms FOAF – Friend of a Friend SIOC – Semantically-Interlinked Online Communities SWP – Semantic Web Publishing vocabulary ● WOT – Web of Trust schema ● OMV – Ontology Metadata Vocabulary ● PML – Proof Markup Language ● Changeset vocabulary ● Ouzo Provenance Ontology ● Olaf Hartig - Provenance Information in the Web of Data 20
  • 21. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Olaf Hartig - Provenance Information in the Web of Data 21
  • 22. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher ● dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 22
  • 23. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher – β€œan entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 23
  • 24. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● Data Access dc:contributor ● Data Providing Service dc:source ● (Non-Human) controls uses dc:created Publisher Service Provider ● Data (Human) dc:modified ● dc:publisher – β€œan entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 24
  • 25. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● Olaf Hartig - Provenance Information in the Web of Data 25
  • 26. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Property Occurrences* dc:creator about 24,284 dc:contributor 476 dc:source about 3,631 dc:created about 82,720 dc:modified about 12,020 dc:provenance 7 *Measured by querying Sindice; Feb. 7, 2009 (by that time Sindice indexed about 48,99 million documents) Olaf Hartig - Provenance Information in the Web of Data 26
  • 27. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● General lack of provenance-related metadata ● on the Web of data Olaf Hartig - Provenance Information in the Web of Data 27
  • 28. Possible Reasons Lack of suitable vocabularies ● Lack of usable tools ● Ignorance / lack of sensitization ● Olaf Hartig - Provenance Information in the Web of Data 28
  • 29. Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 29
  • 30. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Olaf Hartig - Provenance Information in the Web of Data 30
  • 31. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Olaf Hartig - Provenance Information in the Web of Data 31
  • 32. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Let's raise awareness of data providers ● Probably the hardest task ● Maybe voiD can help ● Olaf Hartig - Provenance Information in the Web of Data 32
  • 33. Thank you! Olaf Hartig Humboldt-UniversitΓ€t zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 34. These slides have been created by Olaf Hartig http://olafhartig.de This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/) Attribution: http://www.flickr.com/photos/adrenalin/3032734/ ● http://www.hasslefreeclipart.com ● http://www.flickr.com/photos/dullhunk/428079229/ ● http://www.flickr.com/photos/darwinbell/1337963794/ ● http://www.flickr.com/photos/alandd/2780700767/ ● http://www.flickr.com/photos/simeon_barkas/2872099696/ ● http://www.flickr.com/photos/robinh00d/122544491/ ● http://www.flickr.com/photos/adrenalin/3032747/ ● Olaf Hartig - Provenance Information in the Web of Data 34