2010 09 opm_tutorial_01-jun-usecase-datagovuk

3,027 views

Published on

Provenance use cases from the data.gov.uk project. Part of the OPM tutorial for FIS'2010@Berlin.

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
3,027
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Tooling: Converting & generating; Linked data API; Syndication & merging; Validation; Predicate-based services; Data enrichment services; Visualisations
  • Stylesheet could import another stylesheet One stylesheet could be included in another stylesheet
  • Stylesheet could import another stylesheet One stylesheet could be included in another stylesheet
  • 2010 09 opm_tutorial_01-jun-usecase-datagovuk

    1. 1. Open Provenance Model Tutorial Session 4: Use cases from data.gov.uk Jun Zhao University of Oxford [email_address]
    2. 2. Outline <ul><li>Background about data.gov.uk </li></ul><ul><li>The use cases </li></ul><ul><ul><li>XML serialization </li></ul></ul><ul><ul><li>Data transformation on the fly </li></ul></ul><ul><ul><li>Complex and nested processes </li></ul></ul>
    3. 3. data.gov.uk <ul><li>Linking UK government data </li></ul><ul><li>Aims: </li></ul><ul><ul><li>Provide a set of best practices for government agencies </li></ul></ul><ul><ul><li>Provide the minimum set of tooling and specification to facilitate the publication of data </li></ul></ul><ul><ul><li>Encourage “responsible” data publishing </li></ul></ul>
    4. 4. XML -> RDF XSLT Processor XSLT Parameter Binding XSLT Stylesheet XSLT Template input output RDF File Who, when, which version, how Contributed by Jeni Tennison
    5. 5. XSLT Processor input output RDF File XSLT Parameter Binding XSLT Stylesheet XSLT Template Downloaded from; Unzipped from, etc Made accessible Who, when, which version, how Contributed by Jeni Tennison
    6. 6. On-the-fly Transformation Data transformation wrapper Who, when, which version, how Contributed by Stuart Williams http://mytransportatio.db/j10
    7. 7. Complex Data Creation Pipeline GATE Pipeline GateXMLRegressionTransformation GateXMLRdfaTransformation RdfaRdfXmlTransformation Courtesy of Paul Appleby from TSO (Data Enrichment Service)
    8. 8. Complex Data Creation Pipeline GATE Pipeline GateXMLRegressionTransformation GateXMLRdfaTransformation RdfaRdfXmlTransformation Document Reset PR ANNIE English Tokeniser ANNIE English Splitter ANNIE POS Tagger Data.gov.uk Morphological Analyzer Data.gov.uk Flexible Roof Gazetteer Data.gov.uk Generic Gazeteer GATE Noun Phrase Chunker Data.gov.uk Generic Transducer TSO Coreference Courtesy of Paul Appleby from TSO (Data Enrichment Service)
    9. 9. wasGeneratedBy wasGeneratedBy wasGeneratedBy hasParentProcess iterationOfProcess Level 1: Provenance of execution at higher level Level 0: Provenance of execution at detailed level Services used by executions Artifacts followed wasDerivedFrom A data collection wasTriggeredBy wasTriggeredBy accessedService
    10. 10. Non-digital Data Objects <ul><li>Organizations </li></ul><ul><ul><li>Organizational structure changes over time </li></ul></ul><ul><ul><li>Origin organization, resulting Organization </li></ul></ul><ul><li>Boundary </li></ul><ul><li>Legislation </li></ul>An organization ontology: http://www.epimorphics.com/public/vocabulary/org.html
    11. 11. The Challenges <ul><li>Data of different representations, of physical forms, of granularity </li></ul><ul><li>Not tooling support </li></ul><ul><li>Provenance across different types of systems </li></ul><ul><ul><li>Identification </li></ul></ul><ul><ul><li>Different terminologies </li></ul></ul>
    12. 12. The Gaps <ul><li>A vocabulary being able to describe provenance of all types of data, from different systems </li></ul><ul><li>A vocabulary still providing enough terms to describe provenance accurately </li></ul>
    13. 13. This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/)

    ×