SlideShare a Scribd company logo
1 of 34
Download to read offline
Provenance Information
     in the Web of Data




         Olaf Hartig
         Humboldt-Universität zu Berlin
         http://olafhartig.de/foaf.rdf#olaf
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         2
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         3
Provenance of a data item: information about the history
 ●




Olaf Hartig - Provenance Information in the Web of Data         4
Outline


                                      Towards a model of
                                      Web data provenance

                                      Provenance information
                                      in the Web of data today

                                      Upcoming
                                      tasks
Olaf Hartig - Provenance Information in the Web of Data          5
Existing Provenance Research
     Main research areas: (scientific) workflows, DBMSs
 ●




     General focus:
 ●

           data creation




Olaf Hartig - Provenance Information in the Web of Data   6
Olaf Hartig - Provenance Information in the Web of Data   7
Olaf Hartig - Provenance Information in the Web of Data   8
Olaf Hartig - Provenance Information in the Web of Data   9
Olaf Hartig - Provenance Information in the Web of Data   10
Web data provenance
                            comprises
                         two dimensions:
        Data Creation • Data Access

Olaf Hartig - Provenance Information in the Web of Data   11
Basics of the Provenance Model
     Provenance graph describes provenance of a data item
 ●


         Nodes: provenance elements – pieces of provenance info
     ●


         Edges: relate provenance elements to each other
     ●


         Subgraphs for related data items possible
     ●




Olaf Hartig - Provenance Information in the Web of Data       12
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●




Olaf Hartig - Provenance Information in the Web of Data   13
Basics of the Provenance Model
     Provenance model defines:
 ●


         Types of provenance elements
     ●


         Relationships
     ●



     High level of abstraction (only main element types)
 ●




Olaf Hartig - Provenance Information in the Web of Data    14
Basics of the Provenance Model



     General differentiation:
 ●


                                                          Actors
                                                          Executions
                                                          Artifacts




Olaf Hartig - Provenance Information in the Web of Data                15
Data Access Dimension
                                                                                               Data Item
                 Data Accessor
                  (Non-Human)
                                                                                                   contains
                                                                        Information Resource

                  Access Time
                                                          Data Access


                        Data Providing Service
                              (Non-Human)
                                                            controls
          uses
                                                Service Provider
       Data Publisher
           (Human)


              Relation to
       the provided Information
              Resource



Olaf Hartig - Provenance Information in the Web of Data                                                       16
Data Access Dimension cont.


                                                  owns
                                                                         Public Key
               Signer


         Relation to
       the signed Data                             Integrity Assurance
                                                                                      Digital Signature




                                                                                                signs
                             Verification Result

                                                                         (Signed)
                                                                         Artifact




Olaf Hartig - Provenance Information in the Web of Data                                                   17
Data Creation Dimension
                                                                        Provenance
                                                                        Information

                                                                              Source Data
                                                                                                         Provenance
                                              Creation Time
                                                                                                         Information

                                                                                            Creation Guidelines
                    Data Creator
                                                             Data Creation
              (Human or Non-human)

   {complete,disjoint}


                                                  Data Creating Device
                                                        (e.g. Sensor)                        Data Item

                          Data Creating Service
                            (e.g. Software Agent)                                       part of
                                                                                                    Provenance
                                 responsible for responsible for
   Data Creating Entity                                                                             Information
 (e.g. Person, Group, Orga.)
                                                                                      (Encompassing)
                                                                                        Data Item
          Relation to
Olaf Hartig created Data
      the - Provenance Information in the Web of Data                                                                  18
Provenance information
 in the Web of data today


Olaf Hartig - Provenance Information in the Web of Data   19
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
              FOAF – Friend of a Friend
              SIOC – Semantically-Interlinked Online Communities
              SWP – Semantic Web Publishing vocabulary
          ●



              WOT – Web of Trust schema
          ●



              OMV – Ontology Metadata Vocabulary
          ●



              PML – Proof Markup Language
          ●



              Changeset vocabulary
          ●



              Ouzo Provenance Ontology
          ●




Olaf Hartig - Provenance Information in the Web of Data            20
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms




Olaf Hartig - Provenance Information in the Web of Data   21
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher
             ●




                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data   22
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●




                 dc:contributor
             ●




                 dc:source
             ●




                 dc:created
             ●




                 dc:modified
             ●




                 dc:publisher – “an entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                 23
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms
                 dc:creator
             ●



                                                                              Data Access
                 dc:contributor
             ●



                                                     Data Providing Service
                 dc:source
             ●
                                                          (Non-Human)
                                                                                controls
                                       uses

                 dc:created Publisher                                   Service Provider
             ●
                          Data
                                        (Human)

                 dc:modified
             ●




                 dc:publisher – “an entity responsible for making the
             ●

                                 resource available”
                 dc:provenance
             ●



Olaf Hartig - Provenance Information in the Web of Data                                     24
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●




Olaf Hartig - Provenance Information in the Web of Data          25
Provenance-related Vocabularies
              DC – Dublin Core Metadata Terms

                            Property                      Occurrences*
                  dc:creator                                 about 24,284
                  dc:contributor                                     476
                  dc:source                                   about 3,631
                  dc:created                                 about 82,720
                  dc:modified                                about 12,020
                  dc:provenance                                          7

            *Measured by querying Sindice; Feb. 7, 2009 (by that
            time Sindice indexed about 48,99 million documents)
Olaf Hartig - Provenance Information in the Web of Data                      26
Main Issues Today



                   Vocabularies:
               ●


                       Partly unsuitable
                   ●


                       Lack of certain features
                   ●


                       Coverage of provenance model impossible
                   ●



                   General lack of provenance-related metadata
               ●

                   on the Web of data




Olaf Hartig - Provenance Information in the Web of Data          27
Possible Reasons



                             Lack of suitable vocabularies
                         ●


                             Lack of usable tools
                         ●


                             Ignorance / lack of sensitization
                         ●




Olaf Hartig - Provenance Information in the Web of Data          28
Upcoming
                                                          tasks
Olaf Hartig - Provenance Information in the Web of Data          29
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements




Olaf Hartig - Provenance Information in the Web of Data          30
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




Olaf Hartig - Provenance Information in the Web of Data          31
Address the Issues
     Let's develop a vocabulary for Web data provenance
 ●


         Proposal: refine the presented provenance model
     ●


         Integrate existing vocabularies for specific types of
     ●

         provenance elements

     Let's develop usable tools for data providers
 ●


         Edit and publish provenance-related metadata
     ●


         Automatic generation if possible
     ●




     Let's raise awareness of data providers
 ●


         Probably the hardest task
     ●


         Maybe voiD can help
     ●



Olaf Hartig - Provenance Information in the Web of Data          32
Thank you!

Olaf Hartig
Humboldt-Universität zu Berlin
http://olafhartig.de/foaf.rdf#olaf
These slides have been created by
                                             Olaf Hartig
                                                  http://olafhartig.de

                              This work is licensed under a
                Creative Commons Attribution-Share Alike 3.0 License
                    (http://creativecommons.org/licenses/by-sa/3.0/)


                              Attribution:
                                  http://www.flickr.com/photos/adrenalin/3032734/
                              ●

                                  http://www.hasslefreeclipart.com
                              ●

                                  http://www.flickr.com/photos/dullhunk/428079229/
                              ●

                                  http://www.flickr.com/photos/darwinbell/1337963794/
                              ●


                                  http://www.flickr.com/photos/alandd/2780700767/
                              ●


                                  http://www.flickr.com/photos/simeon_barkas/2872099696/
                              ●


                                  http://www.flickr.com/photos/robinh00d/122544491/
                              ●


                                  http://www.flickr.com/photos/adrenalin/3032747/
                              ●




Olaf Hartig - Provenance Information in the Web of Data                                    34

More Related Content

Similar to Provenance Information in the Web of Data

Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)
Apigee | Google Cloud
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web data
Georg Guentner
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
Edward Curry
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
Steve Watt
 
Provenance and Trust
Provenance and TrustProvenance and Trust
Provenance and Trust
Jose Manuel Gómez-Pérez
 

Similar to Provenance Information in the Web of Data (15)

Omitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddataOmitola w3 c_govtlinkeddata
Omitola w3 c_govtlinkeddata
 
Open Data
Open DataOpen Data
Open Data
 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)Big Data: Beyond the "Bigness" and the Technology (webcast)
Big Data: Beyond the "Bigness" and the Technology (webcast)
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked Data
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web data
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networks
 
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
Hadoop World 2011: Completing the Big Data Picture Understanding Why and Not ...
 
A fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP SpainA fresh new look into Information Gathering - OWASP Spain
A fresh new look into Information Gathering - OWASP Spain
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
 
Steve Watt Presentation
Steve Watt PresentationSteve Watt Presentation
Steve Watt Presentation
 
Provenance and Trust
Provenance and TrustProvenance and Trust
Provenance and Trust
 
sones company presentation
sones company presentationsones company presentation
sones company presentation
 

More from Olaf Hartig

Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Olaf Hartig
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked Data
Olaf Hartig
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Olaf Hartig
 

More from Olaf Hartig (20)

LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
A Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the WebA Context-Based Semantics for SPARQL Property Paths over the Web
A Context-Based Semantics for SPARQL Property Paths over the Web
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
 
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
Tutorial "Linked Data Query Processing" Part 5 "Query Planning and Optimizati...
 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
 
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
Tutorial "Linked Data Query Processing" Part 3 "Source Selection Strategies" ...
 
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
Tutorial "Linked Data Query Processing" Part 2 "Theoretical Foundations" (WWW...
 
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
Tutorial "Linked Data Query Processing" Part 1 "Introduction" (WWW 2013 Ed.)
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
An Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and QueryAn Overview on PROV-AQ: Provenance Access and Query
An Overview on PROV-AQ: Provenance Access and Query
 
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
(An Overview on) Linked Data Management and SPARQL Querying (ISSLOD2011)
 
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversa...
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked Data
 
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked DataHow Caching Improves Efficiency and Result Completeness for Querying Linked Data
How Caching Improves Efficiency and Result Completeness for Querying Linked Data
 
A Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked DataA Main Memory Index Structure to Query Linked Data
A Main Memory Index Structure to Query Linked Data
 
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
Towards a Data-Centric Notion of Trust in the Semantic Web (A Position Statem...
 
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 

Provenance Information in the Web of Data

  • 1. Provenance Information in the Web of Data Olaf Hartig Humboldt-Universität zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 2. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 2
  • 3. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 3
  • 4. Provenance of a data item: information about the history ● Olaf Hartig - Provenance Information in the Web of Data 4
  • 5. Outline Towards a model of Web data provenance Provenance information in the Web of data today Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 5
  • 6. Existing Provenance Research Main research areas: (scientific) workflows, DBMSs ● General focus: ● data creation Olaf Hartig - Provenance Information in the Web of Data 6
  • 7. Olaf Hartig - Provenance Information in the Web of Data 7
  • 8. Olaf Hartig - Provenance Information in the Web of Data 8
  • 9. Olaf Hartig - Provenance Information in the Web of Data 9
  • 10. Olaf Hartig - Provenance Information in the Web of Data 10
  • 11. Web data provenance comprises two dimensions: Data Creation • Data Access Olaf Hartig - Provenance Information in the Web of Data 11
  • 12. Basics of the Provenance Model Provenance graph describes provenance of a data item ● Nodes: provenance elements – pieces of provenance info ● Edges: relate provenance elements to each other ● Subgraphs for related data items possible ● Olaf Hartig - Provenance Information in the Web of Data 12
  • 13. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● Olaf Hartig - Provenance Information in the Web of Data 13
  • 14. Basics of the Provenance Model Provenance model defines: ● Types of provenance elements ● Relationships ● High level of abstraction (only main element types) ● Olaf Hartig - Provenance Information in the Web of Data 14
  • 15. Basics of the Provenance Model General differentiation: ● Actors Executions Artifacts Olaf Hartig - Provenance Information in the Web of Data 15
  • 16. Data Access Dimension Data Item Data Accessor (Non-Human) contains Information Resource Access Time Data Access Data Providing Service (Non-Human) controls uses Service Provider Data Publisher (Human) Relation to the provided Information Resource Olaf Hartig - Provenance Information in the Web of Data 16
  • 17. Data Access Dimension cont. owns Public Key Signer Relation to the signed Data Integrity Assurance Digital Signature signs Verification Result (Signed) Artifact Olaf Hartig - Provenance Information in the Web of Data 17
  • 18. Data Creation Dimension Provenance Information Source Data Provenance Creation Time Information Creation Guidelines Data Creator Data Creation (Human or Non-human) {complete,disjoint} Data Creating Device (e.g. Sensor) Data Item Data Creating Service (e.g. Software Agent) part of Provenance responsible for responsible for Data Creating Entity Information (e.g. Person, Group, Orga.) (Encompassing) Data Item Relation to Olaf Hartig created Data the - Provenance Information in the Web of Data 18
  • 19. Provenance information in the Web of data today Olaf Hartig - Provenance Information in the Web of Data 19
  • 20. Provenance-related Vocabularies DC – Dublin Core Metadata Terms FOAF – Friend of a Friend SIOC – Semantically-Interlinked Online Communities SWP – Semantic Web Publishing vocabulary ● WOT – Web of Trust schema ● OMV – Ontology Metadata Vocabulary ● PML – Proof Markup Language ● Changeset vocabulary ● Ouzo Provenance Ontology ● Olaf Hartig - Provenance Information in the Web of Data 20
  • 21. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Olaf Hartig - Provenance Information in the Web of Data 21
  • 22. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher ● dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 22
  • 23. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● dc:contributor ● dc:source ● dc:created ● dc:modified ● dc:publisher – “an entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 23
  • 24. Provenance-related Vocabularies DC – Dublin Core Metadata Terms dc:creator ● Data Access dc:contributor ● Data Providing Service dc:source ● (Non-Human) controls uses dc:created Publisher Service Provider ● Data (Human) dc:modified ● dc:publisher – “an entity responsible for making the ● resource available” dc:provenance ● Olaf Hartig - Provenance Information in the Web of Data 24
  • 25. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● Olaf Hartig - Provenance Information in the Web of Data 25
  • 26. Provenance-related Vocabularies DC – Dublin Core Metadata Terms Property Occurrences* dc:creator about 24,284 dc:contributor 476 dc:source about 3,631 dc:created about 82,720 dc:modified about 12,020 dc:provenance 7 *Measured by querying Sindice; Feb. 7, 2009 (by that time Sindice indexed about 48,99 million documents) Olaf Hartig - Provenance Information in the Web of Data 26
  • 27. Main Issues Today Vocabularies: ● Partly unsuitable ● Lack of certain features ● Coverage of provenance model impossible ● General lack of provenance-related metadata ● on the Web of data Olaf Hartig - Provenance Information in the Web of Data 27
  • 28. Possible Reasons Lack of suitable vocabularies ● Lack of usable tools ● Ignorance / lack of sensitization ● Olaf Hartig - Provenance Information in the Web of Data 28
  • 29. Upcoming tasks Olaf Hartig - Provenance Information in the Web of Data 29
  • 30. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Olaf Hartig - Provenance Information in the Web of Data 30
  • 31. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Olaf Hartig - Provenance Information in the Web of Data 31
  • 32. Address the Issues Let's develop a vocabulary for Web data provenance ● Proposal: refine the presented provenance model ● Integrate existing vocabularies for specific types of ● provenance elements Let's develop usable tools for data providers ● Edit and publish provenance-related metadata ● Automatic generation if possible ● Let's raise awareness of data providers ● Probably the hardest task ● Maybe voiD can help ● Olaf Hartig - Provenance Information in the Web of Data 32
  • 33. Thank you! Olaf Hartig Humboldt-Universität zu Berlin http://olafhartig.de/foaf.rdf#olaf
  • 34. These slides have been created by Olaf Hartig http://olafhartig.de This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/) Attribution: http://www.flickr.com/photos/adrenalin/3032734/ ● http://www.hasslefreeclipart.com ● http://www.flickr.com/photos/dullhunk/428079229/ ● http://www.flickr.com/photos/darwinbell/1337963794/ ● http://www.flickr.com/photos/alandd/2780700767/ ● http://www.flickr.com/photos/simeon_barkas/2872099696/ ● http://www.flickr.com/photos/robinh00d/122544491/ ● http://www.flickr.com/photos/adrenalin/3032747/ ● Olaf Hartig - Provenance Information in the Web of Data 34