GeoLinkedData<br />Asunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches, <br />Oscar Corcho, and Bor...
ToC<br />Motivation<br />Related Work<br />GeoLinkedData<br />Identification of the data sources<br />Vocabulary Developme...
Motivation<br />		99.171 % English<br />		0.019 % Spanish<br />Source:Billion Triples dataset at http://km.aifb.kit.edu/pr...
Related Work<br />
GeoLinkedData<br />It is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data.<br />It h...
Process for Publishing Linked Data on the Web<br />Identification<br />of the data sources<br />Vocabulary<br />developmen...
1. Identification and selection of the data sources<br />IGN<br />INE<br />
2. Lightweight Ontology Development<br />WGS84 Geo Positioning: an RDF vocabulary<br />scv:Dimension<br />scv:Item<br />sc...
3. Generation of the RDF Data<br />NOR2O<br />INE<br />ODEMapster<br />IGN<br />Geometry2RDF<br />Geospatial<br />column<b...
3. Generation of the RDF Data – NOR2O<br />NOR2O<br />Year<br />Industry Production Index<br />Province<br />
3. Generation of the RDF Data – R2O & ODEMapster<br />R2O is an extensible, fully declarative language to describe mapping...
3. Generation of the RDF Data – R2O & ODEMapster<br />Creation of the R2O Mappings<br />
3. Generation of the RDF Data – R2O & ODEMapster<br />Excerpt of the R2O document<br />
3. Generation of the RDF Data – Geometry2RDF<br />Oracle STO UTIL package <br />SELECT  TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY...
3. Generation of the RDF Data – Geometry2RDF<br />
3. Generation of the RDF Data – Geometry2RDF<br />
3. Generation of the RDF data – RDF graphs<br />			IGN				        INE<br />So far<br />	7 RDF Named Graphs<br />     14122...
4. Publication of the RDF Data<br />SPARQL<br />Linked Data<br />HTML<br />Including Provenance<br />Support<br />Pubby<br...
4. Publication of the RDF Data<br />
4. Publication of the RDF Data - License<br />License for GeoLinkedData<br />Creative Commons Attribution-ShareAlike 3.0 <...
5. Data cleansing<br />Lack of documentation of the IGN datasets<br />Broken links: Spain, IGN resources<br />Lack of docu...
5. Data cleansing<br />URIs in Spanish<br />http://geo.linkeddata.es/ontology/Río<br />RDF allows UTF-8 characters for URI...
6. Linking of the RDF Data<br />Silk - A Link Discovery Framework for the Web of Data<br />First set of links: Provinces o...
7. Enable effective discovery<br />
DEMO<br />http://geo.linkeddata.es/<br />
Provinces<br />
Industry Production Index – Capital of Province<br />
Rivers<br />
Beaches<br />
Future Work<br />Generate more datasets from other domains, e.g. universities in Spain.<br />Identify more links to DBPedi...
Go raibh maith agaibh <br />
GeoLinkedData<br />Asunción Gómez-Pérez,  Alexander de Leon, Victor Saquicela, Luis M. Vilches, <br />Oscar Corcho, and Bo...
Upcoming SlideShare
Loading in...5
×

GeoLinkedData

1,494

Published on

GeoLinked Data (.es) is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data. This initiative started off by publishing diverse information sources belonging to the Spanish National Geographic Institute. Such sources are made available as RDF (Resource Description Framework) knowledge bases according to the Linked Data principles. With this work, Spain has joined the Linked Data initiative, in which the United Kingdom and Germany are already participating. In this presentation, we provide an overview of the process that has been followed for the development of this initiative.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,494
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
47
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

GeoLinkedData

  1. 1. GeoLinkedData<br />Asunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches, <br />Oscar Corcho, and Boris Villazón-Terrazas<br />Facultad de Informática, Universidad Politécnica de Madrid<br />Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid<br />http://www.oeg-upm.net<br />Phone: 34.91.3366605, Fax: 34.91.3524819<br />
  2. 2. ToC<br />Motivation<br />Related Work<br />GeoLinkedData<br />Identification of the data sources<br />Vocabulary Development<br />Generation of the RDF data<br />Publication of the RDF data<br />Data cleansing<br />Linking the RDF data<br />Enable effective discovery<br />Future Work<br />
  3. 3. Motivation<br /> 99.171 % English<br /> 0.019 % Spanish<br />Source:Billion Triples dataset at http://km.aifb.kit.edu/projects/btc-2010/<br />Thanks to Aidan and Richard<br />
  4. 4. Related Work<br />
  5. 5. GeoLinkedData<br />It is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data.<br />It has started off by publishing diverse information sources, such as National Geographic Institute of Spain (IGN).<br />http://geo.linkeddata.es<br />Recently, National Statistics Institute (INE)<br />
  6. 6. Process for Publishing Linked Data on the Web<br />Identification<br />of the data sources<br />Vocabulary<br />development<br />Generation<br />of the RDF Data<br />Publication<br />of the RDF data <br />Data cleansing<br />Linking <br />the RDF data<br />Enable effective <br />discovery<br />
  7. 7. 1. Identification and selection of the data sources<br />IGN<br />INE<br />
  8. 8. 2. Lightweight Ontology Development<br />WGS84 Geo Positioning: an RDF vocabulary<br />scv:Dimension<br />scv:Item<br />scv:Dataset<br />hydrographical phenomena (rivers, lakes, etc.)<br />Vocabulary for instants, intervals, durations, etc.<br />Names and international code systems for territories and groups<br />Ontology for OGC Geography Markup Language <br />Following the INSPIRE <br />(INfrastructure for SPatial InfoRmation in Europe) recommendation.<br />hydrOntology,SCOVO, FAO Geopolitcal, WGS84, GML, and Time<br />
  9. 9. 3. Generation of the RDF Data<br />NOR2O<br />INE<br />ODEMapster<br />IGN<br />Geometry2RDF<br />Geospatial<br />column<br />IGN<br />
  10. 10. 3. Generation of the RDF Data – NOR2O<br />NOR2O<br />Year<br />Industry Production Index<br />Province<br />
  11. 11. 3. Generation of the RDF Data – R2O & ODEMapster<br />R2O is an extensible, fully declarative language to describe mappings between relational database schemas and ontologies.<br />The ODEMapster processor generates Semantic Web instances from relational instances based on the mapping description expressed in the R2O document <br />
  12. 12. 3. Generation of the RDF Data – R2O & ODEMapster<br />Creation of the R2O Mappings<br />
  13. 13. 3. Generation of the RDF Data – R2O & ODEMapster<br />Excerpt of the R2O document<br />
  14. 14. 3. Generation of the RDF Data – Geometry2RDF<br />Oracle STO UTIL package <br />SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry)) <br /> AS Gml311Geometry<br />FROM "BCN200"."BCN200_0301L_RIO" c<br />WHERE c.Etiqueta='Arroyo'<br />
  15. 15. 3. Generation of the RDF Data – Geometry2RDF<br />
  16. 16. 3. Generation of the RDF Data – Geometry2RDF<br />
  17. 17. 3. Generation of the RDF data – RDF graphs<br /> IGN INE<br />So far<br /> 7 RDF Named Graphs<br /> 1412248 triples<br />BTN25<br />BCN200<br />IPI<br />….<br />http://geo.linkeddata.es/dataset/IGN/BTN25<br />http://geo.linkeddata.es/dataset/IGN/BCN200<br />http://geo.linkeddata.es/dataset/INE/IPI<br />
  18. 18. 4. Publication of the RDF Data<br />SPARQL<br />Linked Data<br />HTML<br />Including Provenance<br />Support<br />Pubby<br />Pubby 0.3<br />Virtuoso 6.1.0<br />
  19. 19. 4. Publication of the RDF Data<br />
  20. 20. 4. Publication of the RDF Data - License<br />License for GeoLinkedData<br />Creative Commons Attribution-ShareAlike 3.0 <br />GNU Free Documentation License<br />Each dataset will have its own specific license, IGN, INE, etc.<br />
  21. 21. 5. Data cleansing<br />Lack of documentation of the IGN datasets<br />Broken links: Spain, IGN resources<br />Lack of documentation of the ontology<br />Missing english and spanish labels<br />Building a spanish ontology and importing some concepts of other ontology (in English):<br />Importing the English ontology. Add annotations like a Spanish label to them.<br /> Importing the English ontology, creating new concepts and properties with a Spanish name and map those to the English equivalents.<br /> Re-declaring the terms of the English ontology that we need (using the same URI as in the English ontology), and adding a Spanish label.<br /> Creating your own class and properties that model the same things as the English ontology. <br />select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}<br />select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}<br />
  22. 22. 5. Data cleansing<br />URIs in Spanish<br />http://geo.linkeddata.es/ontology/Río<br />RDF allows UTF-8 characters for URIs<br />But, Linked Data URIs has to be URLs as well<br />So, non ASCII-US characters have to be %code<br />http://geo.linkeddata.es/ontology/R%C3%ADo<br />select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}<br />select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}<br />
  23. 23. 6. Linking of the RDF Data<br />Silk - A Link Discovery Framework for the Web of Data<br />First set of links: Provinces of Spain<br />86% accuracy<br />Geonames<br />GeoLinkedData<br />DBPedia<br />
  24. 24. 7. Enable effective discovery<br />
  25. 25. DEMO<br />http://geo.linkeddata.es/<br />
  26. 26. Provinces<br />
  27. 27. Industry Production Index – Capital of Province<br />
  28. 28. Rivers<br />
  29. 29. Beaches<br />
  30. 30. Future Work<br />Generate more datasets from other domains, e.g. universities in Spain.<br />Identify more links to DBPedia and Geonames.<br />Cover complex geometrical information, i.e. not only Point and LineString-like data; we will also treat information representation through polygons.<br />
  31. 31. Go raibh maith agaibh <br />
  32. 32. GeoLinkedData<br />Asunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches, <br />Oscar Corcho, and Boris Villazón-Terrazas<br />Facultad de Informática, Universidad Politécnica de Madrid<br />Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid<br />http://www.oeg-upm.net<br />Phone: 34.91.3366605, Fax: 34.91.3524819<br />
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×