• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Self-service Linked Government Data
 

Self-service Linked Government Data

on

  • 2,573 views

A publishing pipeline for Linked Government Data

A publishing pipeline for Linked Government Data

Statistics

Views

Total Views
2,573
Views on SlideShare
2,462
Embed Views
111

Actions

Likes
4
Downloads
28
Comments
0

3 Embeds 111

http://www.scoop.it 109
http://flashattackcrew.blogspot.com 1
http://www.onlydoo.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Self-service Linked Government Data Self-service Linked Government Data Presentation Transcript

    • Digital Enterprise Research Institute www.deri.ie Self-service Linked Government Data Fadi Maali, Richard Cyganiak, Vassilios Peristeras firstname.lastname@deri.org Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Enabling networked knowledge
    • data.gov.ukDigital Enterprise Research Institute www.deri.ie Enabling networked knowledge 2
    • data.gov.ukDigital Enterprise Research Institute www.deri.ie Enabling networked knowledge 3
    • data.govDigital Enterprise Research Institute www.deri.ie Enabling networked knowledge 4
    • data.govDigital Enterprise Research Institute www.deri.ie 4997 datasets 2590 in CSV 272 in RDF Enabling networked knowledge 5
    • Why Linked Governemnt Data (LGD)?Digital Enterprise Research Institute www.deri.ie  Web accessible  Interlinkable  Decentralised publishing of data  Standardised Enabling networked knowledge 6
    • LGDDigital Enterprise Research Institute www.deri.ie We need government data as Linked Data not just Raw Data ….aha, and of a good quality! Enabling networked knowledge 7
    • LGD is CostlyDigital Enterprise Research Institute www.deri.ie We want governments to provide Linked Data not just Raw Data… and of good quality http://code.google.com/p/google-refine/ Enabling networked knowledge 8
    • Self-service ApproachDigital Enterprise Research Institute www.deri.ie DIY Enabling networked knowledge 9
    • Self-service ApproachDigital Enterprise Research Institute www.deri.ie DIY Provide tools, models and algorithms that enable the self-service approach (a publishing pipeline) Enabling networked knowledge 10
    • Publishing pipeline requirementsDigital Enterprise Research Institute www.deri.ie  Interactive approach  Graphical user interface  Reproducibility and traceability  Flexibility  Decentralisation  Results sharing Enabling networked knowledge 11
    • Publishing pipeline requirementsDigital Enterprise Research Institute www.deri.ie  Interactive approach  Graphical user interface  Reproducibility and traceability  Flexibility  Decentralisation  Results sharing Enabling networked knowledge 12
    • Google RefineDigital Enterprise Research Institute www.deri.ie  Powerful data editing, transformation and enriching capabilities  Import capabilities e.g. JSON, Excel, CSV, TSV, XML, etc.  Persistent undo/redo history  Popular in open data community  Extensible and under active development  Free and open source http://code.google.com/p/google-refine/ Enabling networked knowledge 13
    • DIY Recipe (1000 feet view)Digital Enterprise Research Institute www.deri.ie Publishers provide RDF Tool support to select representation of their datasets of interest and User shares the RDF catalogues put them into RDF data Enabling networked knowledge 14
    • DIY Recipe (100 feet view)Digital Enterprise Research Institute www.deri.ie Publishers provide RDF representation of their catalogues Tool support to select datasets of interest User shares the and put them into RDF RDF data dcat Enabling networked knowledge 15
    • DIY Recipe (100 feet view)Digital Enterprise Research Institute www.deri.ie Tool support to select datasets ofPublishers provideRDF representation of interest and put them into RDF User shares the RDFtheir catalogues data dcat Google Refine + RDF export extension + RDF reconciliation extension Enabling networked knowledge 16
    • DIY Recipe (100 feet view)Digital Enterprise Research Institute www.deri.iePublishers provide Tool support to selectRDF representation of datasets of interest and put User shares the RDF datatheir catalogues them into RDF dcat Google Refine Share RDF data publicly (on + RDF export extension CKAN.net) along with the sufficient + RDF reconciliation extension provenance description Enabling networked knowledge 17
    • A Walk-through (1/5)Digital Enterprise Research Institute www.deri.ie Enabling networked knowledge 18
    • A Walk-through (2/5)Digital Enterprise Research Institute www.deri.ie Enabling networked knowledge 19
    • A Walk-through (3/5)Digital Enterprise Research Institute www.deri.ie Enabling networked knowledge 20
    • A Walk-through (4/5)Digital Enterprise Research Institute www.deri.ie Enabling networked knowledge 21
    • A Walk-through (5/5)Digital Enterprise Research Institute www.deri.ie Enabling networked knowledge 22
    • Data on CKAN.netDigital Enterprise Research Institute www.deri.ie Enabling networked knowledge 23
    • Data Provenance (simplified)Digital Enterprise Research Institute www.deri.ie :dataset dct:source :wasExportedBy :json-history :export-process :csv-ds :operations :usedData Enabling networked knowledge 24
    • DIY Recipe (10 feet view)Digital Enterprise Research Institute www.deri.ie Dcat  An RDF vocabulary to describe government catalogues  Current status: First Public Working Draft by the W3C GLD Working Group http://www.w3.org/TR/vocab-dcat/  Used on data.gov.uk (RDFa) and CKAN-based catalogues “Enabling Interoperability of Government Data Catalogues.” EGOV 2010 Enabling networked knowledge 25
    • DIY Recipe (10 feet view)Digital Enterprise Research Institute www.deri.ie RDF Mapping Enabling networked knowledge 26
    • More on RDF MappingDigital Enterprise Research Institute www.deri.ie  RDF-centric mapping  Multiple tree structure  Expression language for custom expression  Vocabularies/ontologies support Enabling networked knowledge 27
    • DIY Recipe (10 feet view)Digital Enterprise Research Institute www.deri.ie Interlinking Silk LSL RDF Reconcile Crafted RDF Silk Server Google Extension Refine SPARQL endpoint SPARQL endpoint with fulltext extension Enabling networked knowledge 28
    • More on InterlinkingDigital Enterprise Research Institute www.deri.ie  Interlinking as a pre-RDF-creation step  less unnecessary owl:sameAs  Focus on the interface  Semi-automatic process with good user support “Re-using Cool URIs: Entity Reconciliation Against LOD Hubs.” LDOW 2011 Enabling networked knowledge 29
    • DIY Recipe (10 feet view)Digital Enterprise Research Institute www.deri.ie Sharing  Captures the operations applied to the data  Represent them according to Open Provenance Model Vocabulary (OPMV)  Share the data and its provennce on CKAN.net CKAN Extension fro Google Refine http://lab.linkeddata.deri.ie/2011/grefine-ckan/ Enabling networked knowledge 30
    • Case study - Fingal CatalogueDigital Enterprise Research Institute www.deri.ie Number of datasets: 74 (68 available in CSV and 56 in XML) Fingal county Council (41), Central Statistics Top publishers: Office (17), Department of Education and Science (4) Demographics(18), Citizen Participation(18), Top domains: Education(9) http://data.fingal.ie Enabling networked knowledge 31
    • Case study - Fingal CatalogueDigital Enterprise Research Institute www.deri.ie  The catalogue was represented in Dcat  60 datasets were converted to RDF using the publishing pipeline (~300K triples)  Data Cube was used for statistical data  URIs were used consistently and shared among datasets  the data was interlinked  Externally linked to DBpedia Enabling networked knowledge 32
    • Open IssuesDigital Enterprise Research Institute www.deri.ie  Evaluating/Refining the crowd-sourcing aspects of the RDF creation process  RDF Modeling: Can we assist RDF modeling by examining the raw data? Enabling networked knowledge 33
    • Lessons LearnedDigital Enterprise Research Institute www.deri.ie  Interactive approach  Focus on plumbing tools together but don’t enforce a rigid process  Make it easy to adopt best-practices and good recipes Enabling networked knowledge 34