SlideShare a Scribd company logo
1 of 34
Download to read offline
Provenance Information
     in the Web of Data




         Olaf Hartig
         Humboldt-Universität zu Berlin
         http://olafhartig.de/foaf.rdf#olaf
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         2
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         3
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         4
Outline


                                      Towards a model of
                                      Web data provenance

                                      Provenance information
                                      in the Web of data today

                                      Upcoming
                                      tasks
Olaf Hartig - Provenance Information in the Web of Data          5
Existing Provenance Research
     Main research areas: (scientific) workflows, DBMSs
 ●




     General focus:
 ●

           data creation




Olaf Hartig - Provenance Information in the Web of Data   6
Olaf Hartig - Provenance Information in the Web of Data   7
Olaf Hartig - Provenance Information in the Web of Data   8
Olaf Hartig - Provenance Information in the Web of Data   9
Olaf Hartig - Provenance Information in the Web of Data   10
Web data provenance
                            comprises
                         two dimensions:
        Data Creation • Data Access

Olaf Hartig - Provenance Information in the Web of Data   11
Basics of the Provenance Model
     Provenance graph describes provenance of a data item
 ●


         Nodes: provenance elements – pieces of provenance info
     ●


         Edges: relate provenance elements to each other
     ●


         Subgraphs for related data items possible
     ●




Olaf Hartig - Provenance Information in the Web of Data       12
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●




Olaf Hartig - Provenance Information in the Web of Data   13
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●



     High level of abstraction (only main element types)
 ●




Olaf Hartig - Provenance Information in the Web of Data    14
Basics of the Provenance Model



     General differentiation:
 ●


                                                          Actors
                                                          Executions
                                                          Artifacts




Olaf Hartig - Provenance Information in the Web of Data                15
Data Access Dimension
                                                                                               Data Item
                 Data Accessor
                  (Non-Human)
                                                                                                   contains
                                                                        Information Resource

                  Access Time
                                                          Data Access


                        Data Providing Service
                              (Non-Human)
                                                            controls
          uses
                                                Service Provider
       Data Publisher
           (Human)


              Relation to
       the provided Information
              Resource



Olaf Hartig - Provenance Information in the Web of Data                                                       16
Data Access Dimension cont.


                                                  owns
                                                                         Public Key
               Signer


         Relation to
       the signed Data                             Integrity Assurance
                                                                                      Digital Signature




                                                                                                signs
                             Verification Result

                                                                         (Signed)
                                                                         Artifact




Olaf Hartig - Provenance Information in the Web of Data                                                   17
Data Creation Dimension
                                                                        Provenance
                                                                        Information

                                                                              Source Data
                                                                                                         Provenance
                                              Creation Time
                                                                                                         Information

                                                                                            Creation Guidelines
                    Data Creator
                                                             Data Creation
              (Human or Non-human)

   {complete,disjoint}


                                                  Data Creating Device
                                                        (e.g. Sensor)                        Data Item

                          Data Creating Service
                            (e.g. Software Agent)                                       part of
                                                                                                    Provenance
                                 responsible for responsible for
   Data Creating Entity                                                                             Information
 (e.g. Person, Group, Orga.)
                                                                                      (Encompassing)
                                                                                        Data Item
          Relation to
Olaf Hartig created Data
      the - Provenance Information in the Web of Data                                                                  18
Provenance information
 in the Web of data today


Olaf Hartig - Provenance Information in the Web of Data   19
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
              FOAF – Friend of a Friend
              SIOC – Semantically-Interlinked Online Communities
              SWP – Semantic Web Publishing vocabulary
          ●



              WOT – Web of Trust schema
          ●



              OMV – Ontology Metadata Vocabulary
          ●



              PML – Proof Markup Language
          ●



              Changeset vocabulary
          ●



              Ouzo Provenance Ontology
          ●




Olaf Hartig - Provenance Information in the Web of Data            20
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms




Olaf Hartig - Provenance Information in the Web of Data   21
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher
             ●




                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data   22
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher – “an entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                 23
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●



                                                                              Data Access
                 dc:contributor
             ●



                                                     Data Providing Service
                 dc:source
             ●
                                                          (Non-Human)
                                                                                controls
                                       uses

                 dc:created Publisher                                   Service Provider
             ●
                          Data
                                        (Human)

                 dc:modified
             ●




                 dc:publisher – “an entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                                     24
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●




Olaf Hartig - Provenance Information in the Web of Data          25
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms

                            Property                      Occurrences*
                  dc:creator                                 about 24,284
                  dc:contributor                                     476
                  dc:source                                   about 3,631
                  dc:created                                 about 82,720
                  dc:modified                                about 12,020
                  dc:provenance                                          7

            *Measured by querying Sindice; Feb. 7, 2009 (by that
            time Sindice indexed about 48,99 million documents)
Olaf Hartig - Provenance Information in the Web of Data                      26
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●



                   General lack of provenance-related metadata
               ●

                   on the Web of data




Olaf Hartig - Provenance Information in the Web of Data          27
Possible Reasons



                             Lack of suitable vocabularies
                         ●


                             Lack of usable tools
                         ●


                             Ignorance / lack of sensitization
                         ●




Olaf Hartig - Provenance Information in the Web of Data          28
Upcoming
                                                          tasks
Olaf Hartig - Provenance Information in the Web of Data          29
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements




Olaf Hartig - Provenance Information in the Web of Data          30
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




Olaf Hartig - Provenance Information in the Web of Data          31
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




     Let's raise awareness of data providers
 ●


         Probably the hardest task
     ●


         Maybe voiD can help
     ●



Olaf Hartig - Provenance Information in the Web of Data          32
Thank you!

Olaf Hartig
Humboldt-Universität zu Berlin
http://olafhartig.de/foaf.rdf#olaf
These slides have been created by
                                             Olaf Hartig
                                                  http://olafhartig.de

                              This work is licensed under a
                Creative Commons Attribution-Share Alike 3.0 License
                    (http://creativecommons.org/licenses/by-sa/3.0/)


                              Attribution:
                                  http://www.flickr.com/photos/adrenalin/3032734/
                              ●

                                  http://www.hasslefreeclipart.com
                              ●

                                  http://www.flickr.com/photos/dullhunk/428079229/
                              ●

                                  http://www.flickr.com/photos/darwinbell/1337963794/
                              ●


                                  http://www.flickr.com/photos/alandd/2780700767/
                              ●


                                  http://www.flickr.com/photos/simeon_barkas/2872099696/
                              ●


                                  http://www.flickr.com/photos/robinh00d/122544491/
                              ●


                                  http://www.flickr.com/photos/adrenalin/3032747/
                              ●




Olaf Hartig - Provenance Information in the Web of Data                                    34

More Related Content

Similar to Provenance Information in the Web of Data

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataTope Omitola
 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Apigee | Google Cloud
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataOlaf Hartig
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web dataGeorg Guentner
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataEdward Curry
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingInnoTech
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYAmit Sheth
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networksalitora
 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Cloudera, Inc.
 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainChristian Martorella
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataSteve Watt
 
sones company presentation
sones company presentationsones company presentation
sones company presentationsones GmbH
 

Similar to Provenance Information in the Web of Data (15)

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddata
 
Open Data
Open DataOpen Data
Open Data
 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked Data
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web data
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networks
 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP Spain
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
 
Steve Watt Presentation
Steve Watt PresentationSteve Watt Presentation
Steve Watt Presentation
 
Provenance and Trust
Provenance and TrustProvenance and Trust
Provenance and Trust
 
sones company presentation
sones company presentationsones company presentation
sones company presentation
 

More from Olaf Hartig

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebOlaf Hartig
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Olaf Hartig
 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Olaf Hartig
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Olaf Hartig
 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryOlaf Hartig
 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)Olaf Hartig
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Olaf Hartig
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataOlaf Hartig
 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataOlaf Hartig
 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataOlaf Hartig
 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Olaf Hartig
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Olaf Hartig
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Olaf Hartig
 

More from Olaf Hartig (20)

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the Web
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and Query
 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked Data
 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked Data
 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 

Provenance Information in the Web of Data

  • 1. Provenance Information in the Web of Data Olaf Hartig Humboldt-Universität zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 2. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 2
  • 3. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 3
  • 4. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 4
  • 5. Outline Towards a model of Web data provenance Provenance information in the Web of data today Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 5
  • 6. Existing Provenance Research Main research areas: (scientific) workflows, DBMSs ● General focus: ● data creation Olaf Hartig - Provenance Information in the Web of Data 6
  • 7. Olaf Hartig - Provenance Information in the Web of Data 7
  • 8. Olaf Hartig - Provenance Information in the Web of Data 8
  • 9. Olaf Hartig - Provenance Information in the Web of Data 9
  • 10. Olaf Hartig - Provenance Information in the Web of Data 10
  • 11. Web data provenance comprises two dimensions: Data Creation • Data Access Olaf Hartig - Provenance Information in the Web of Data 11
  • 12. Basics of the Provenance Model Provenance graph describes provenance of a data item ● Nodes: provenance elements – pieces of provenance info ● Edges: relate provenance elements to each other ● Subgraphs for related data items possible ● Olaf Hartig - Provenance Information in the Web of Data 12
  • 13. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● Olaf Hartig - Provenance Information in the Web of Data 13
  • 14. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● High level of abstraction (only main element types) ● Olaf Hartig - Provenance Information in the Web of Data 14
  • 15. Basics of the Provenance Model General differentiation: ● Actors Executions Artifacts Olaf Hartig - Provenance Information in the Web of Data 15
  • 16. Data Access Dimension Data Item Data Accessor (Non-Human) contains Information Resource Access Time Data Access Data Providing Service (Non-Human) controls uses Service Provider Data Publisher (Human) Relation to the provided Information Resource Olaf Hartig - Provenance Information in the Web of Data 16
  • 17. Data Access Dimension cont. owns Public Key Signer Relation to the signed Data Integrity Assurance Digital Signature signs Verification Result (Signed) Artifact Olaf Hartig - Provenance Information in the Web of Data 17
  • 18. Data Creation Dimension Provenance Information Source Data Provenance Creation Time Information Creation Guidelines Data Creator Data Creation (Human or Non-human) {complete,disjoint} Data Creating Device (e.g. Sensor) Data Item Data Creating Service (e.g. Software Agent) part of Provenance responsible for responsible for Data Creating Entity Information (e.g. Person, Group, Orga.) (Encompassing) Data Item Relation to Olaf Hartig created Data the - Provenance Information in the Web of Data 18
  • 19. Provenance information in the Web of data today Olaf Hartig - Provenance Information in the Web of Data 19
  • 20. Provenance-related Vocabularies DC – Dublin Core Metadata Terms FOAF – Friend of a Friend SIOC – Semantically-Interlinked Online Communities SWP – Semantic Web Publishing vocabulary ● WOT – Web of Trust schema ● OMV – Ontology Metadata Vocabulary ● PML – Proof Markup Language ● Changeset vocabulary ● Ouzo Provenance Ontology ● Olaf Hartig - Provenance Information in the Web of Data 20
  • 21. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Olaf Hartig - Provenance Information in the Web of Data 21
  • 22. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher ● dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 22
  • 23. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher – “an entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 23
  • 24. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● Data Access dc:contributor ● Data Providing Service dc:source ● (Non-Human) controls uses dc:created Publisher Service Provider ● Data (Human) dc:modified ● dc:publisher – “an entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 24
  • 25. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● Olaf Hartig - Provenance Information in the Web of Data 25
  • 26. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Property Occurrences* dc:creator about 24,284 dc:contributor 476 dc:source about 3,631 dc:created about 82,720 dc:modified about 12,020 dc:provenance 7 *Measured by querying Sindice; Feb. 7, 2009 (by that time Sindice indexed about 48,99 million documents) Olaf Hartig - Provenance Information in the Web of Data 26
  • 27. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● General lack of provenance-related metadata ● on the Web of data Olaf Hartig - Provenance Information in the Web of Data 27
  • 28. Possible Reasons Lack of suitable vocabularies ● Lack of usable tools ● Ignorance / lack of sensitization ● Olaf Hartig - Provenance Information in the Web of Data 28
  • 29. Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 29
  • 30. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Olaf Hartig - Provenance Information in the Web of Data 30
  • 31. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Olaf Hartig - Provenance Information in the Web of Data 31
  • 32. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Let's raise awareness of data providers ● Probably the hardest task ● Maybe voiD can help ● Olaf Hartig - Provenance Information in the Web of Data 32
  • 33. Thank you! Olaf Hartig Humboldt-Universität zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 34. These slides have been created by Olaf Hartig http://olafhartig.de This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/) Attribution: http://www.flickr.com/photos/adrenalin/3032734/ ● http://www.hasslefreeclipart.com ● http://www.flickr.com/photos/dullhunk/428079229/ ● http://www.flickr.com/photos/darwinbell/1337963794/ ● http://www.flickr.com/photos/alandd/2780700767/ ● http://www.flickr.com/photos/simeon_barkas/2872099696/ ● http://www.flickr.com/photos/robinh00d/122544491/ ● http://www.flickr.com/photos/adrenalin/3032747/ ● Olaf Hartig - Provenance Information in the Web of Data 34