GeoLinkedData
Upcoming SlideShare
Loading in...5
×
 

GeoLinkedData

on

  • 1,824 views

GeoLinked Data (.es) is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data. This initiative started off by publishing diverse information sources belonging to the ...

GeoLinked Data (.es) is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data. This initiative started off by publishing diverse information sources belonging to the Spanish National Geographic Institute. Such sources are made available as RDF (Resource Description Framework) knowledge bases according to the Linked Data principles. With this work, Spain has joined the Linked Data initiative, in which the United Kingdom and Germany are already participating. In this presentation, we provide an overview of the process that has been followed for the development of this initiative.

Statistics

Views

Total Views
1,824
Views on SlideShare
1,823
Embed Views
1

Actions

Likes
0
Downloads
46
Comments
0

1 Embed 1

http://www.linkedin.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution-NonCommercial-ShareAlike LicenseCC Attribution-NonCommercial-ShareAlike LicenseCC Attribution-NonCommercial-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

GeoLinkedData GeoLinkedData Presentation Transcript

  • GeoLinkedData
    Asunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches,
    Oscar Corcho, and Boris Villazón-Terrazas
    Facultad de Informática, Universidad Politécnica de Madrid
    Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
    http://www.oeg-upm.net
    Phone: 34.91.3366605, Fax: 34.91.3524819
  • ToC
    Motivation
    Related Work
    GeoLinkedData
    Identification of the data sources
    Vocabulary Development
    Generation of the RDF data
    Publication of the RDF data
    Data cleansing
    Linking the RDF data
    Enable effective discovery
    Future Work
  • Motivation
    99.171 % English
    0.019 % Spanish
    Source:Billion Triples dataset at http://km.aifb.kit.edu/projects/btc-2010/
    Thanks to Aidan and Richard
  • Related Work
  • GeoLinkedData
    It is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data.
    It has started off by publishing diverse information sources, such as National Geographic Institute of Spain (IGN).
    http://geo.linkeddata.es
    Recently, National Statistics Institute (INE)
  • Process for Publishing Linked Data on the Web
    Identification
    of the data sources
    Vocabulary
    development
    Generation
    of the RDF Data
    Publication
    of the RDF data
    Data cleansing
    Linking
    the RDF data
    Enable effective
    discovery
  • 1. Identification and selection of the data sources
    IGN
    INE
  • 2. Lightweight Ontology Development
    WGS84 Geo Positioning: an RDF vocabulary
    scv:Dimension
    scv:Item
    scv:Dataset
    hydrographical phenomena (rivers, lakes, etc.)
    Vocabulary for instants, intervals, durations, etc.
    Names and international code systems for territories and groups
    Ontology for OGC Geography Markup Language
    Following the INSPIRE
    (INfrastructure for SPatial InfoRmation in Europe) recommendation.
    hydrOntology,SCOVO, FAO Geopolitcal, WGS84, GML, and Time
  • 3. Generation of the RDF Data
    NOR2O
    INE
    ODEMapster
    IGN
    Geometry2RDF
    Geospatial
    column
    IGN
  • 3. Generation of the RDF Data – NOR2O
    NOR2O
    Year
    Industry Production Index
    Province
  • 3. Generation of the RDF Data – R2O & ODEMapster
    R2O is an extensible, fully declarative language to describe mappings between relational database schemas and ontologies.
    The ODEMapster processor generates Semantic Web instances from relational instances based on the mapping description expressed in the R2O document
  • 3. Generation of the RDF Data – R2O & ODEMapster
    Creation of the R2O Mappings
  • 3. Generation of the RDF Data – R2O & ODEMapster
    Excerpt of the R2O document
  • 3. Generation of the RDF Data – Geometry2RDF
    Oracle STO UTIL package
    SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry))
    AS Gml311Geometry
    FROM "BCN200"."BCN200_0301L_RIO" c
    WHERE c.Etiqueta='Arroyo'
  • 3. Generation of the RDF Data – Geometry2RDF
  • 3. Generation of the RDF Data – Geometry2RDF
  • 3. Generation of the RDF data – RDF graphs
    IGN INE
    So far
    7 RDF Named Graphs
    1412248 triples
    BTN25
    BCN200
    IPI
    ….
    http://geo.linkeddata.es/dataset/IGN/BTN25
    http://geo.linkeddata.es/dataset/IGN/BCN200
    http://geo.linkeddata.es/dataset/INE/IPI
  • 4. Publication of the RDF Data
    SPARQL
    Linked Data
    HTML
    Including Provenance
    Support
    Pubby
    Pubby 0.3
    Virtuoso 6.1.0
  • 4. Publication of the RDF Data
  • 4. Publication of the RDF Data - License
    License for GeoLinkedData
    Creative Commons Attribution-ShareAlike 3.0
    GNU Free Documentation License
    Each dataset will have its own specific license, IGN, INE, etc.
  • 5. Data cleansing
    Lack of documentation of the IGN datasets
    Broken links: Spain, IGN resources
    Lack of documentation of the ontology
    Missing english and spanish labels
    Building a spanish ontology and importing some concepts of other ontology (in English):
    Importing the English ontology. Add annotations like a Spanish label to them.
    Importing the English ontology, creating new concepts and properties with a Spanish name and map those to the English equivalents.
    Re-declaring the terms of the English ontology that we need (using the same URI as in the English ontology), and adding a Spanish label.
    Creating your own class and properties that model the same things as the English ontology.
    select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
    select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
  • 5. Data cleansing
    URIs in Spanish
    http://geo.linkeddata.es/ontology/Río
    RDF allows UTF-8 characters for URIs
    But, Linked Data URIs has to be URLs as well
    So, non ASCII-US characters have to be %code
    http://geo.linkeddata.es/ontology/R%C3%ADo
    select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
    select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
  • 6. Linking of the RDF Data
    Silk - A Link Discovery Framework for the Web of Data
    First set of links: Provinces of Spain
    86% accuracy
    Geonames
    GeoLinkedData
    DBPedia
  • 7. Enable effective discovery
  • DEMO
    http://geo.linkeddata.es/
  • Provinces
  • Industry Production Index – Capital of Province
  • Rivers
  • Beaches
  • Future Work
    Generate more datasets from other domains, e.g. universities in Spain.
    Identify more links to DBPedia and Geonames.
    Cover complex geometrical information, i.e. not only Point and LineString-like data; we will also treat information representation through polygons.
  • Go raibh maith agaibh
  • GeoLinkedData
    Asunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches,
    Oscar Corcho, and Boris Villazón-Terrazas
    Facultad de Informática, Universidad Politécnica de Madrid
    Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
    http://www.oeg-upm.net
    Phone: 34.91.3366605, Fax: 34.91.3524819