GeoLinkedDataAsunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches, Oscar Corcho, and Boris Villazón-TerrazasFacultad de Informática, Universidad Politécnica de MadridCampus de Montegancedo sn, 28660 Boadilla del Monte, Madridhttp://www.oeg-upm.netPhone: 34.91.3366605, Fax: 34.91.3524819
ToCMotivationRelated WorkGeoLinkedDataIdentification of the data sourcesVocabulary DevelopmentGeneration of the RDF dataPublication of the RDF dataData cleansingLinking the RDF dataEnable effective discoveryFuture Work
Motivation		99.171 % English		0.019 % SpanishSource:Billion Triples dataset at http://km.aifb.kit.edu/projects/btc-2010/Thanks to Aidan and Richard
Related Work
GeoLinkedDataIt is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data.It has started off by publishing diverse information sources, such as National Geographic Institute of Spain (IGN).http://geo.linkeddata.esRecently, National Statistics Institute (INE)
Process for Publishing Linked Data on the WebIdentificationof the data sourcesVocabularydevelopmentGenerationof the RDF DataPublicationof the RDF data Data cleansingLinking the RDF dataEnable effective discovery
1. Identification and selection of the data sourcesIGNINE
2. Lightweight Ontology DevelopmentWGS84 Geo Positioning: an RDF vocabularyscv:Dimensionscv:Itemscv:Datasethydrographical phenomena (rivers, lakes, etc.)Vocabulary for instants, intervals, durations, etc.Names and international code systems for territories and groupsOntology for OGC Geography Markup Language Following the INSPIRE (INfrastructure for SPatial InfoRmation in Europe) recommendation.hydrOntology,SCOVO, FAO Geopolitcal, WGS84, GML, and Time
3. Generation of the RDF DataNOR2OINEODEMapsterIGNGeometry2RDFGeospatialcolumnIGN
3. Generation of the RDF Data – NOR2ONOR2OYearIndustry Production IndexProvince
3. Generation of the RDF Data – R2O & ODEMapsterR2O is an extensible, fully declarative language to describe mappings between relational database schemas and ontologies.The ODEMapster processor generates Semantic Web instances from relational instances based on the mapping description expressed in the R2O document
3. Generation of the RDF Data – R2O & ODEMapsterCreation of the R2O Mappings
3. Generation of the RDF Data – R2O & ODEMapsterExcerpt of the R2O document
3. Generation of the RDF Data – Geometry2RDFOracle STO UTIL package SELECT  TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry)) 	AS Gml311GeometryFROM "BCN200"."BCN200_0301L_RIO" cWHERE   c.Etiqueta='Arroyo'
3. Generation of the RDF Data – Geometry2RDF
3. Generation of the RDF Data – Geometry2RDF
3. Generation of the RDF data – RDF graphs			IGN				        INESo far	7 RDF Named Graphs     1412248 triplesBTN25BCN200IPI….http://geo.linkeddata.es/dataset/IGN/BTN25http://geo.linkeddata.es/dataset/IGN/BCN200http://geo.linkeddata.es/dataset/INE/IPI
4. Publication of the RDF DataSPARQLLinked DataHTMLIncluding ProvenanceSupportPubbyPubby 0.3Virtuoso 6.1.0
4. Publication of the RDF Data
4. Publication of the RDF Data - LicenseLicense for GeoLinkedDataCreative Commons Attribution-ShareAlike 3.0 GNU Free Documentation LicenseEach dataset will have its own specific license, IGN, INE, etc.
5. Data cleansingLack of documentation of the IGN datasetsBroken links: Spain, IGN resourcesLack of documentation of the ontologyMissing english and spanish labelsBuilding a spanish ontology and importing some concepts of other ontology (in English):Importing the English ontology. Add annotations like a Spanish label to them. Importing the English ontology, creating new concepts and properties with a Spanish name and map those to the English equivalents. Re-declaring the terms of the English ontology that we need (using the same URI as in the English ontology), and adding a Spanish label. Creating your own class and properties that model the same things as the English ontology. select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
5. Data cleansingURIs in Spanishhttp://geo.linkeddata.es/ontology/RíoRDF allows UTF-8 characters for URIsBut, Linked Data URIs has to be URLs as wellSo, non ASCII-US characters have to be %codehttp://geo.linkeddata.es/ontology/R%C3%ADoselect DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
6. Linking of the RDF DataSilk - A Link Discovery Framework for the Web of DataFirst set of links: Provinces of Spain86% accuracyGeonamesGeoLinkedDataDBPedia
7. Enable effective discovery
DEMOhttp://geo.linkeddata.es/
Provinces
Industry Production Index – Capital of Province
Rivers
Beaches
Future WorkGenerate more datasets from other domains, e.g. universities in Spain.Identify more links to DBPedia and Geonames.Cover complex geometrical information, i.e. not only Point and LineString-like data; we will also treat information representation through polygons.
Go raibh maith agaibh
GeoLinkedDataAsunción Gómez-Pérez,  Alexander de Leon, Victor Saquicela, Luis M. Vilches, Oscar Corcho, and Boris Villazón-TerrazasFacultad de Informática, Universidad Politécnica de MadridCampus de Montegancedo sn, 28660 Boadilla del Monte, Madridhttp://www.oeg-upm.netPhone: 34.91.3366605, Fax: 34.91.3524819

GeoLinkedData

  • 1.
    GeoLinkedDataAsunción Gómez-Pérez, Alexanderde Leon, Victor Saquicela, Luis M. Vilches, Oscar Corcho, and Boris Villazón-TerrazasFacultad de Informática, Universidad Politécnica de MadridCampus de Montegancedo sn, 28660 Boadilla del Monte, Madridhttp://www.oeg-upm.netPhone: 34.91.3366605, Fax: 34.91.3524819
  • 2.
    ToCMotivationRelated WorkGeoLinkedDataIdentification ofthe data sourcesVocabulary DevelopmentGeneration of the RDF dataPublication of the RDF dataData cleansingLinking the RDF dataEnable effective discoveryFuture Work
  • 3.
    Motivation 99.171 % English 0.019% SpanishSource:Billion Triples dataset at http://km.aifb.kit.edu/projects/btc-2010/Thanks to Aidan and Richard
  • 4.
  • 5.
    GeoLinkedDataIt is anopen initiative whose aim is to enrich the Web of Data with Spanish geospatial data.It has started off by publishing diverse information sources, such as National Geographic Institute of Spain (IGN).http://geo.linkeddata.esRecently, National Statistics Institute (INE)
  • 6.
    Process for PublishingLinked Data on the WebIdentificationof the data sourcesVocabularydevelopmentGenerationof the RDF DataPublicationof the RDF data Data cleansingLinking the RDF dataEnable effective discovery
  • 7.
    1. Identification andselection of the data sourcesIGNINE
  • 8.
    2. Lightweight OntologyDevelopmentWGS84 Geo Positioning: an RDF vocabularyscv:Dimensionscv:Itemscv:Datasethydrographical phenomena (rivers, lakes, etc.)Vocabulary for instants, intervals, durations, etc.Names and international code systems for territories and groupsOntology for OGC Geography Markup Language Following the INSPIRE (INfrastructure for SPatial InfoRmation in Europe) recommendation.hydrOntology,SCOVO, FAO Geopolitcal, WGS84, GML, and Time
  • 9.
    3. Generation ofthe RDF DataNOR2OINEODEMapsterIGNGeometry2RDFGeospatialcolumnIGN
  • 10.
    3. Generation ofthe RDF Data – NOR2ONOR2OYearIndustry Production IndexProvince
  • 11.
    3. Generation ofthe RDF Data – R2O & ODEMapsterR2O is an extensible, fully declarative language to describe mappings between relational database schemas and ontologies.The ODEMapster processor generates Semantic Web instances from relational instances based on the mapping description expressed in the R2O document
  • 12.
    3. Generation ofthe RDF Data – R2O & ODEMapsterCreation of the R2O Mappings
  • 13.
    3. Generation ofthe RDF Data – R2O & ODEMapsterExcerpt of the R2O document
  • 14.
    3. Generation ofthe RDF Data – Geometry2RDFOracle STO UTIL package SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry)) AS Gml311GeometryFROM "BCN200"."BCN200_0301L_RIO" cWHERE c.Etiqueta='Arroyo'
  • 15.
    3. Generation ofthe RDF Data – Geometry2RDF
  • 16.
    3. Generation ofthe RDF Data – Geometry2RDF
  • 17.
    3. Generation ofthe RDF data – RDF graphs IGN INESo far 7 RDF Named Graphs 1412248 triplesBTN25BCN200IPI….http://geo.linkeddata.es/dataset/IGN/BTN25http://geo.linkeddata.es/dataset/IGN/BCN200http://geo.linkeddata.es/dataset/INE/IPI
  • 18.
    4. Publication ofthe RDF DataSPARQLLinked DataHTMLIncluding ProvenanceSupportPubbyPubby 0.3Virtuoso 6.1.0
  • 19.
    4. Publication ofthe RDF Data
  • 20.
    4. Publication ofthe RDF Data - LicenseLicense for GeoLinkedDataCreative Commons Attribution-ShareAlike 3.0 GNU Free Documentation LicenseEach dataset will have its own specific license, IGN, INE, etc.
  • 21.
    5. Data cleansingLackof documentation of the IGN datasetsBroken links: Spain, IGN resourcesLack of documentation of the ontologyMissing english and spanish labelsBuilding a spanish ontology and importing some concepts of other ontology (in English):Importing the English ontology. Add annotations like a Spanish label to them. Importing the English ontology, creating new concepts and properties with a Spanish name and map those to the English equivalents. Re-declaring the terms of the English ontology that we need (using the same URI as in the English ontology), and adding a Spanish label. Creating your own class and properties that model the same things as the English ontology. select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
  • 22.
    5. Data cleansingURIsin Spanishhttp://geo.linkeddata.es/ontology/RíoRDF allows UTF-8 characters for URIsBut, Linked Data URIs has to be URLs as wellSo, non ASCII-US characters have to be %codehttp://geo.linkeddata.es/ontology/R%C3%ADoselect DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}select DISTINCT ?graph where {GRAPH ?graph {?s ?p ?o.}.}
  • 23.
    6. Linking ofthe RDF DataSilk - A Link Discovery Framework for the Web of DataFirst set of links: Provinces of Spain86% accuracyGeonamesGeoLinkedDataDBPedia
  • 24.
  • 25.
  • 26.
  • 27.
    Industry Production Index– Capital of Province
  • 28.
  • 29.
  • 30.
    Future WorkGenerate moredatasets from other domains, e.g. universities in Spain.Identify more links to DBPedia and Geonames.Cover complex geometrical information, i.e. not only Point and LineString-like data; we will also treat information representation through polygons.
  • 31.
  • 32.
    GeoLinkedDataAsunción Gómez-Pérez, Alexander de Leon, Victor Saquicela, Luis M. Vilches, Oscar Corcho, and Boris Villazón-TerrazasFacultad de Informática, Universidad Politécnica de MadridCampus de Montegancedo sn, 28660 Boadilla del Monte, Madridhttp://www.oeg-upm.netPhone: 34.91.3366605, Fax: 34.91.3524819