Data Integration,
A Linked Data Approach
Boris Villazón-Terrazas
@boricles
Slides available at: http://www.slideshare.net/...
2
ToC
» Introduction
» Linked Data
» Use Cases
3
Introduction
 Current data systems combine data from a tremendous number
of resources
….
4
Introduction
 We use the term data shape to refer on how data is arranged
and structured.
resource data shape
1. Michae...
5
Introduction
 Data Integration
6
Classic Web
MovieDB
CIA
World
FactBook
Data exposed to
the Web via
HTML, pdf, etc.
© Slide adapted from “5min Introducti...
7
Classic Web
Information from
single pages
can be found via
search engines
Complex queries
over multiple
pages / data
sou...
8
What do we actually want?
Use the Web like a single global database
Move from a Web of documents to a Web of Data
MovieD...
9
Linked Data enables such Web of Data
MovieDB
CIA
World
FactBook
Global Identifier: URI (Uniform Resource Identifier), wh...
11
Exploitation
Streaming resources
12
Enterprise Linked Data
 Linked Data is not necessarily free data
 Enterprises have many disparate data sources and da...
13
Enterprise Linked Data
Office A
Office B
Office C
Products
Company 1
Agency A
Services
Agency B
Company 2
14
GeoLinkedData Ecuador – http://geo.linkeddata.ec
Image taken from http://www.spatialytics.org/projects/geokettle/
RDF G...
15
Publication and exploitation
Parliament
SPARQL
http://purl.org/Ecuador/geo/sparql
Rivers of the province of Guayas
PREF...
16
Publication and exploitation
Parliament
SPARQL
Rivers of the province of Guayas
PREFIX geo: <http://www.opengis.net/ont...
17
iSOCO, tentative example
Enterprise Linked Data
Data Source
iSOCO ICM
Data Soource
iSOCO Lab
Data Source
iSOCO ST
Linke...
18
http://datosenlazados.org/cms/
http://linkeddata.ec/
19
Data Integration,
A Linked Data Approach
Boris Villazón-Terrazas
@boricles
Slides available at: http://www.slideshare.net/...
Upcoming SlideShare
Loading in …5
×

[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data approach”

382 views
326 views

Published on

Published in: Data & Analytics, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
382
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data approach”

  1. 1. Data Integration, A Linked Data Approach Boris Villazón-Terrazas @boricles Slides available at: http://www.slideshare.net/boricles/
  2. 2. 2 ToC » Introduction » Linked Data » Use Cases
  3. 3. 3 Introduction  Current data systems combine data from a tremendous number of resources ….
  4. 4. 4 Introduction  We use the term data shape to refer on how data is arranged and structured. resource data shape 1. Michael Hausenblas, Boris Villazon-Terrazas, Richard Cyganiak. Data shapes and data transformations. arXiv preprint arXiv:1211.1565 Fundamental data shapes • tabular • tree • graph
  5. 5. 5 Introduction  Data Integration
  6. 6. 6 Classic Web MovieDB CIA World FactBook Data exposed to the Web via HTML, pdf, etc. © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
  7. 7. 7 Classic Web Information from single pages can be found via search engines Complex queries over multiple pages / data sources? © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
  8. 8. 8 What do we actually want? Use the Web like a single global database Move from a Web of documents to a Web of Data MovieDB CIA World FactBook © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
  9. 9. 9 Linked Data enables such Web of Data MovieDB CIA World FactBook Global Identifier: URI (Uniform Resource Identifier), which is a string of characters used to identify a name or a resource on the Internet. http://cia.../Bolivia http://imdb.../TLLuvia Data Model: RDF (Resource Description Framework), which is a standard model for data interchange on the Web http://.../population http://.../name 8000000 “Even the Rain” Access Mechanism: HTTP Connection: Typed Links http://.../filming_location © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
  10. 10. 11 Exploitation Streaming resources
  11. 11. 12 Enterprise Linked Data  Linked Data is not necessarily free data  Enterprises have many disparate data sources and data silos  Linked Data allows having global identifiers for data that can be accessed using the Web infrastructure and typed links between data possibly from different applications  Graph-based RDF data model allows consuming and merging data without having to do complex structural transformation
  12. 12. 13 Enterprise Linked Data Office A Office B Office C Products Company 1 Agency A Services Agency B Company 2
  13. 13. 14 GeoLinkedData Ecuador – http://geo.linkeddata.ec Image taken from http://www.spatialytics.org/projects/geokettle/ RDF Generator Plugins • GeoKettle - Spatially-enabled version of the generic ETL tool, Kettle (Pentaho Data Integration) - Powerful, metadata-driven spatial ETL tool dedicates to the integration of different geospatial data resources Extract Transform Load
  14. 14. 15 Publication and exploitation Parliament SPARQL http://purl.org/Ecuador/geo/sparql Rivers of the province of Guayas PREFIX geo: <http://www.opengis.net/ont/geosparql#> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX geof: <http://www.opengis.net/def/function/geosparql/> PREFIX sf: <http://www.opengis.net/ont/sf#> PREFIX units: <http://www.opengis.net/def/uom/OGC/1.0/> SELECT distinct ?r ?label ?Figure ?r2 ?Figure2 WHERE { ?r rdf:type <http://geo.linkeddata.ec/ontology/riosdobles_promsa>. ?r rdfs:label ?label. ?r geo:hasGeometry ?geo. ?geo rdf:type ?geoType . ?geo geo:asWKT ?Figure. ?r2 rdf:type <http://geo.linkeddata.ec/ontology/provincias_promsa>. ?r2 rdfs:label "GUAYAS"@es. ?r2 geo:hasGeometry ?geo2. ?geo2 rdf:type ?geoType2 . ?geo2 geo:asWKT ?Figure2. FILTER (geof:sfIntersects(?Figure2, ?Figure)). }
  15. 15. 16 Publication and exploitation Parliament SPARQL Rivers of the province of Guayas PREFIX geo: <http://www.opengis.net/ont/geosparql#> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX geof: <http://www.opengis.net/def/function/geosparql/> PREFIX sf: <http://www.opengis.net/ont/sf#> PREFIX units: <http://www.opengis.net/def/uom/OGC/1.0/> SELECT distinct ?r ?label ?Figure ?r2 ?Figure2 WHERE { ?r a <http://geo.linkeddata.ec/ontology/riosdobles_promsa>. ?r rdfs:label ?label. ?r geo:hasGeometry ?geo. ?geo rdf:type ?geoType . ?geo geo:asWKT ?Figure. ?r2 a <http://geo.linkeddata.ec/ontology/provincias_promsa>. ?r2 rdfs:label "GUAYAS"@es. ?r2 geo:hasGeometry ?geo2. ?geo2 rdf:type ?geoType2 . ?geo2 geo:asWKT ?Figure2. FILTER (geof:sfIntersects(?Figure2, ?Figure)). } http://200.0.31.28:8081/map4rdf-0.0.4-OL-SNAPSHOT/#dashboard
  16. 16. 17 iSOCO, tentative example Enterprise Linked Data Data Source iSOCO ICM Data Soource iSOCO Lab Data Source iSOCO ST Linked Data Platform External data sources Added value services
  17. 17. 18 http://datosenlazados.org/cms/ http://linkeddata.ec/
  18. 18. 19
  19. 19. Data Integration, A Linked Data Approach Boris Villazón-Terrazas @boricles Slides available at: http://www.slideshare.net/boricles/

×