Linked Data Applications:There is no One-Size-Fits-All Formula. (Long Presentation). Summer school on Ontological Engineering and the Semantic Web. Cercedilla, Spain
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Linked DAta Applications: There is no One-Size-Fits All Formula (Long presentation)
1. Linked Data Applications:
There is no One-Size-Fits-All
Formula
Asunción Gómez-Pérez
Facultad de Informática, Universidad Politécnica de Madrid
Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
http://www.oeg-upm.net
asun@fi.upm.es
Acknowledgements:
O.Corcho, D. Garijo, D. Vila, L.Vilches, B. Villazón
Work distributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0
2. Table of content
1. The concept
2. Foundations
3. The process
4. Examples
• Libraries: http://datos.bne.es
• Geo: http://geo.linkeddata.es/
• Metereology:http://aemet.linkeddata.es/
• Travelling: http://webenemasuno.linkeddata.es/
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 2
3. Complex queries using data from heterogeneous Web
pages
http://www.bne.es/
http://elviajero.elpais.com/
Cervantes enthusiast from Germany
visiting Madrid and willing to know
more about Cervantes’ work and life
http://www.viaf.org/
http://www.aemet
SSSW-12: 9th Summer School on Ontological Engineering andattribution: http://commons.wikimedia.org/wiki/User:Gugerell
*Picture the Semantic Web. Cercedilla. Spain 3
4. BD BD BD IGN BD BD
BD BNE VIAF AEMET Prisa DBpedia Data Integration
BNE
Ubicado en
Alcalá de Henares
1605 El Quijote
Año de Same as
Publicación Autor birthPlace
M. Cervantes Alcalá de Henares
M. Cervantes
M. Cervantes
creator
Year of
publication Don Quixote
1960 Alcalá de Henares
Alcalá de Henares
Translated
into
Temperatura
located guía
Hebrew
20º
Tapas Siglo
de Oro
VIAF
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 4
5. Table of content
1. The concept
2. Foundations
3. The process
4. Examples
• Libraries: http://datos.bne.es
• Geo: http://geo.linkeddata.es/
• Metereology:http://aemet.linkeddata.es/
• Travelling: http://webenemasuno.linkeddata.es/
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 5
6. The model (Ontology) and the data
Idiom
translation
Is creator of birthPlace
Year Work Person Place Ontology
Publication date
Located at
Has subject
Library
Catalán
translation
Is creator of
birthPlace
1960 El Quijote Cervantes Alcalá de Henares
Publication date
Has subject
Located in Data
Vida de Cervantes
BNE
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 13
7. The model (Ontology) and the data
Language
http://iflastandards.info/ns/fr/frbr/frbrer/C1002
Ontology
translation
Is creator of
work Person
Año http://iflastandards.info/ns/fr/frbr/frbrer/C1001 http://iflastandards.info/ns/fr/frbr/frbrer/C1005
Publication date
birthPlace
Has subject
Located in http://geo.linkeddata.es/ontology/Municipio
Biblioteca
http://xmlns.com/foaf/0.1/Organization
Catalán
http://datos.bne.es/resource/XX1924295
translation http://geo.linkeddata.es/resource/Alcalá de Henares
Don Quijote de la Mancha
Cervantes Saavedra, Miguel de
Es autor birthPlace
1960 http://datos.bne.es/resource/XX3383563 http://datos.bne.es/resource/XX1718747
Publication date
Has subject
Located in http://datos.bne.es/resource/bimo0002045496
BNE Vida de Miguel de Cervantes Saavedra
http://datos.bne.es/# Data
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
14
8. Table of content
1. The concept Specification
2. Foundations Modelling
3. The process RDF
Generation
4. Examples Links
Generation
Publication
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 15
9. Specification
• Data sources analysis
Modelling
RDF
• URI Design
Generation
Links • License definition
Generation
Publication
Exploitation
Reunión bilateral CNIG – OEG
SSSW-12: 9th Summer
Proyecto OTALEX
School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 16
10. Specification
URI design
Specification • Meaningful URIs vs opaque URIs
• Separate TBox (ontology model) from ABox
Modelling • Base URI
http://linkeddata.es/
RDF http://geo.linkeddata.es/
Generation http://otalex.linkeddata.es/
Links • Ontología (TBox URIs)
Generation http://phenomenontology.linkeddata.es/ontology/{concept|property}
http://phenomenontology.linkeddata.es/ontology/Municipality
Publication
• Datos (ABox URIs)
Exploitation
http://geo.linkeddata.es/resource/{resource type}/{resource name}
http://geo.linkeddata.es/resource/Municipio/Azuaga
Reunión bilateral CNIG – OEG
SSSW-12: 9th Summer
Proyecto OTALEX
School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 17
11. Specification
License Definition
• Several possibilities
Specification
• The UK Open Government License
• Open Database License
Modelling
• Public Domain Dedication and License
RDF • Open Data Commons Attribution License
Generation
• The Creative Commons Licenses (CC)
Links
Generation
• It is also possible to reuse and apply an existing
Publication license of the (government) data sources.
Exploitation
Reunión bilateral CNIG – OEG
SSSW-12: 9th Summer
Proyecto OTALEX
School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 18
12. Modelling
Ontology
Specification • Ontologies:
• A set of terms
• A set of explicit assumptions regarding the intended meaning of
Modelling the terms.
• Almost always including concepts and their classification
• Almost always including properties between concepts
RDF Generation
Links Generation
• Shared understanding of a domain of interest
Publication • Ontologies expressed in OWL or RDF(S), both based on
RDF
Exploitation
• The NeOn methodology helps to build ontologies
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 19
13. 2. Vocabulary development
Identification • Features
of the data sources
• Lightweight :
Vocabulary • Taxonomies and a few properties
development
• Consensuated vocabularies
• To avoid the mapping problems
Generation
of the RDF Data • Multilingual
• Linked data are multilingual
Publication
of the RDF data • The NeOn methodology can help to
• Re-enginer Non ontological resources into ontologie
Data cleansing
• Pros: use domain terminology already
consensuated by domain experts
Linking
the RDF data • Withdraw in heavyweight ontologies those features
that you don’t need
Enable effective • Reuse existing vocabularies
discovery
Asunción Gómez Pérez 9th
SSSW-12: Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 21
14. NeOn Methodology
Knowledge Resources
Non Ontological Resources Ontological Resources
Glossaries O. Design Patterns O. Repositories and Registries 3 4
Dictionaries Lexicons
Flogic
5 6
Classification
Taxonomies Thesauri RDF(S)
Schemas
OWL Ontological Resource
2 Reuse
2 5 6
Ontology Design 4 O. Aligning
Non Ontological Resource
Pattern Reuse 3
Reuse
6 O. Merging
2 Ontological Resource
7 Reengineering 5 Alignments
Non Ontological Resource
Reengineering 4 6
1
RDF(S)
O. Specification O. Conceptualization O. Formalization O. Implementation
Flogic
8
9 Ontology Restructuring
O. Localization (Pruning, Extension, OWL
Specialization, Modularization)
1,2,3,4,5,6,7,8, 9
Ontology Support Activities: Knowledge Acquisition (Elicitation); Documentation;
Configuration Management; Evaluation (V&V); Assessment
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
15. Modelling
Reuse available vocabularies
Reuse suitable
Ontologies and
vocabularies
Linked Open Vocabularies
…
Search for suitable
non-ontological resources
are there Yes Build the vocabulary by
suitable transforming available
resources? resources
No Domain-related sites
Build the vocabulary from Government Catalogs
Highly reliable Web Sites
scratch
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 23
16. Publicación
Specification
Modelling
Data publication
RDF Metadata publicacion using VOID
Generation
Links
To facilitate the discovery
Generation
• Register in CKAN your dataset
Publication
• Use to sitemap4rdf to generate the site map
Exploitation
• Upload the site map to Google and Sindice
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
17. Table of content
1. The concept
2. Foundations
3. The process
4. Examples
• Libraries: http://datos.bne.es
• http://linkeddata3.dia.fi.upm.es/bne-demo
• Geo: http://geo.linkeddata.es/
• Metereology: http://aemet.linkeddata.es/
• Travelling: http://webenemasuno.linkeddata.es/
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 25
18. MARC21
Specification
• Different communication formats:
• MARC 21 format for Bibliographic Data
Modelling
• MARC 21 format for Authority Data
• Others: Holdings, Classification, etc.
RDF Generation
• Three main elements:
• Record structure: ISO 2709. Fields, indicators,
Links Generation subfields…
• Content designation: "Meaning" of codes and
conventions
Publication
• Content: Defined outside the MARC standard (ISBD,
AACR..)
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 26
19. Specification@ BNE
• Records in the MARC 21 format
• 3.9 million bibliographical records
Specification
• 4.2 million authority records
Modelling
• Version: November, 2011
AUTHORITY BIBLIOGRAPHIC
RDF Generation
Links Generation
Persons 76576 Maps
Corporate bodies 320727 Sound recordings
Publication Conferences 166017 Gravings, drawings, pictures
Titles 35770 Manuscripts
Subject 143959 Ancient books
2696560 Modern books
Exploitation
178473 Scores
3021 Electronic resources
156634 Serials
96672 Videos
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 27
20. MARC21 record structure
Specification • Authority record: Camus, Albert*
Control Field 001 XX1721208
005 200012181124
008 901120nn aijnnaabn n aaa
016 $a BNE19900178994
040 $a SpMaBN $b spa $c SpMaBN $e rdc $f
embne
Field Subfield Content 100 10 $a Camus, Albert
HEADING
Subfield Content 1XX
$d 1913-1960
670 $a El mite de Sísif, 1987 $b port. (Albert
Camus)
670 $a Dic. de filosofía, de J. Ferrater Mora,
1980$b(Camus., Albert (1913-1960); n.
Mondovi, Argel)
670 $a Aut. BN-OPALE, 1995 $b (Camus, Albert)
* http://datos.bne.es/resource/XX1721208
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 28
21. MARC21 record content designation
• Authority record: Camus, Albert*
Control Number 001 XX1721208
HEADING – Personal
Personal name Name 100 10 $a Camus, Albert Name
100
Dates associated with name $d 1913-1960
Source consulted Citation 670 $a El mite de Sísif, 1987 $b port. (Albert
Camus)
• Human reading:
An authority record that describes a Person, named
Camus, Albert with associated dates 1913-1960
* http://datos.bne.es/resource/XX1721208
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 29
22. Frecuency of codes in records
Specification
Modelling
RDF Generation
Links Generation
Publication
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 30
23. Specification
• Source data: MARC 21 records, not RDB. Very flat
Specification
structure difficult to map to richer models
Modelling • Domain experts (catalogers) need to be part of the mapping
process.
RDF Generation
• Data quality good but still many errors: reporting.
Links Generation
• Iterative and incremental transformation process: measure
coverage and progress.
Publication
• Highly specialized library models: FRBR, ISBD.
Exploitation • Multilinguality, collaboration with IFLA
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
24. Model: FRBR at a glance
Work 2
Specification
Works
Work 1
Modelling Work 3
RDF Generation
Expression 2
Links Generation
Expression1 Expressions
Publication
Exploitation
Manifestations
Manifestation1 Manifestation2
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 32
25. The Ontology: based on IFLA vocabularies
Specification
Modelling
RDF
Generation
Links
Generation
Publication
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
26. Who will be the mapping generator?
001 XX1721208
Specification
005 200012181124
008 901120nn aijnnaabn n aaa
016 $a BNE19900178994
Modelling 040 $a SpMaBN $b spa $c SpMaBN $e rdc $f embne
100 10 $a Camus, Albert
$d 1913-1960
RDF 670 $a El mite de Sísif, 1987 $b port. (Albert Camus)
Generation 670 $a Dic. de filosofía, de J. Ferrater Mora,
1980$b(Camus., Albert (1913-1960); n. Mondovi,
Argel)
Links 670 $a Aut. BN-OPALE, 1995 $b (Camus, Albert)
Generation
Publication
Exploitation
BNE
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
27. Similar to mapping ontologies
100a maps Person
maps
Content Content
(100a) (100at) is creator of
contained in
maps
100at Work
subfield
property
maps
100t title of work
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 35
28. Marimba allows librarians to create mappings
• Three spreadsheets:
Classification Basic structure
mapping
MARC21 Records count Content sample Mapping
info
100 $a $d 888.880 Camus, Albert foaf:Person
1913-1960
Annotation 100 $a 999.999 Cervantes, Miguel foaf:name
mapping de
100 $a $m 10.000 Cervantes, iguel ERROR
Relationships
mapping
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 36
29. Librarians create mappings using excell
Classification
mapping
Classification Basic structure
mapping
MARC21 Records count Content sample Mapping
info
100 $a $d 888.880 Camus, Albert foaf:Person
1913-1960
Annotation 100 $a 999.999 Cervantes, Miguel foaf:name
mapping de
100 $a $m 10.000 Cervantes, iguel ERROR
Relationships
mapping
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 37
30. Librarians create mappings using excell
Annotation
mapping
place of publication
has dimensions
Is part of work
Relationships
mapping
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 38
31. Marimba interprets the Mappings and generate the RDF
001 XX1721208
……
Specification 100 10 $a Camus, Albert
$d 1913-1960
……
Modelling • Classify: Exploiting the heading field and subfield codes.
100 $a $d Person (it has a personal name)
RDF 100 $a $d $t Work (it has a title)
Generation
• Annotate: Using subfield codes and the content.
Links
Generation 100 $a "Camus, Albert" frbr:3001 "Camus, Albert"
100 $t "La Peste" frbr:P3039 "La Peste"
Publication
MARC 21 record Action RDF (Output)
(Input)
Exploitation
100 $a $d Classify rdf:type frbr:C1005
100 $a Camus, Annotate frbr:P3039 "Camus,
BNE
Albert Albert"
100 $d 1913-1960 Annotate frbr:P3040 "1913-
1960"
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 39
32. Mapping process more in detail
• But, what about the relationships between the entities?
RDF • Relationships between records are not explicit in MARC.
Generation
Goal: The work "La Peste" was created by Albert Camus
001 XX1721208 001 XX1910518
100 10 $a Camus, Albert $d 1913-1960 100 10 $a Camus, Albert$d1913-1960 $tLa peste
Common Common Diff
Work
We know the type of R1 and R2, and we look at the heading diff
bne:XX1721208 frbr:2010 bne:XX1910518
(isCreatorOf)
* http://datos.bne.es/resource/XX1910518
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 40
33. Marimba: Mapping process summary
(MARC records)
001 XX1721208 001 XX1910518
Specification
100 10 $a Camus, Albert $d 1913-1960 100 10 $a Camus, Albert$d1913-1960 $tLa
peste
Modelling Classify
bne:XX1721208 a frbr:Person bne:XX1910518 a frbr:Work
RDF
Generation
Annotate
Links bne:XX1721208 a frbr:Person bne:XX1910518 a frbr:Work
Generation frbr:name "Camus, Albert" . frbr:title "La Peste"
frbr:hasDates 1913-1960
Publication
Relate
bne:XX1721208 a frbr:Person bne:XX1910518 a frbr:Work
frbr:name "Camus, Albert" . frbr:title "La Peste" .
Exploitation frbr:hasDates 1913-1960 . frbr:isCreatedBy bne:XX1721208
frbr:isCreatorOf bne:XX1721208
BNE
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 41
34. Marimba uses the ontology to generate RDF
Specification
Modelling
RDF
Generation
Links
Generation
Publication
Exploitation
BNE
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
35. Marimba links with other resources:
VIAF, DNB, SUDOC, LIBRIS, DBpedia
http://d-nb.info/gnd/11851993X
Specification
DNB
Modelling http://viaf.org/viaf/17220427
VIAF
Same As
RDF Same As http://dbpedia.org/resource/Miguel_de_Cervantes
Generation
DBpedia
Same As
Links
Generation
http://datos.bne.es/resource/XX1718747
BNE
Publication Same As
Same As
Exploitation http://www.idref.fr/026774771/id
SUDOC
http://libris.kb.se/resource/auth/45369
LIBRIS
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
36. Marimba links with other resources:
VIAF, DNB, SUDOC, LIBRIS, DBpedia
Specification
Modelling
RDF
Generation
Links
Generation
Publication
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
37. Publicación
Specification
Modelling
Data publication
RDF Metadata publicacion using VOID
Generation
Links
To facilitate the discovery
Generation
• Register in CKAN your dataset
Publication
• Use to sitemap4rdf to generate the site map
Exploitation
• Upload the site map to Google and Sindice
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
38. Exploitation
Web Interface
Especification
Specification
Modelling
Model
RDF
Generation
generation
Links
Publication
Generation
SPARQL queries
Exploitation
Publication
URI Cervantes
select distinct COUNT(?Obras) where {
http://datos.bne.es/resource/XX1718747 Is author
Exploitation
<http://iflastandards.info/ns/fr/frbr/frbrer/P2010>
?Obras
}
http://bne.linkeddata.es/
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
39. SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 48
40. SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 49
41. SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 50
42. Technological Support
• Modelling:
• Open Metadata Registry
• Neon Toolkit
• Mapping and generation
• MARiMbA: Library-oriented, supports and facilitates the
entire process od transformation from MARC21 to RDF
• Publication:
• Virtuoso Universal Server
• Pubby
• CKAN registry
• Sitemap4rdf
• Exploitation:
• Web Applications that visualize data using SPARQL
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
43. Results: datos.bne.es
• Total number of authority records: 4.100.000
• Total number of bibliographical records: 2.390.140
• Total number of RDF triples: 58.053.215
• Number of links: (15% authorities): 587.520
• Linked sources:
• VIAF
• SUDOC (French collective university catalogue) FR
• GND (German National Library of authorities) GER
• LIBRIS Sweden
• DBPedia
• Soon BNF
http://bne.linkeddata.es/
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 52
44. Table of content
1. The concept
2. Foundations
3. The process
4. Examples
• Libraries: http://datos.bne.es
• http://linkeddata3.dia.fi.upm.es/bne-demo
• Geo: http://geo.linkeddata.es/
• Metereology: http://aemet.linkeddata.es/
• Travelling: http://webenemasuno.linkeddata.es/
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 53
45. Identification and selection of data sources
Geographical
Specification Spanish
Institute
Modelling
RDF Generation
Links Generation
Statistical
Publication
Spanish
institute
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 54
46. • Geographic Spanish Institute
Specification • Multilingual (Spanish, Vasc, Gallician, Catalan)
• Conceptualization mistmatches
• Granularity (scale concept)
Modelling
• Textual information
Inform. Hidrográfica: reservoir, river, , etc.
RDF Generation Transportes. Vía desdoblada, Ferrocarril, …
Unidades Administrativas. Municipality
• Particularaties
Links Generation
• Longitude and latitude
Publication
• Statistic Spanish Institute
• Monolingual
Exploitation
• Numerical information
• Particularaties
• Geo (textual level) and Temporal
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 55
47. 1. Identification and selection of the data sources
IGN-E
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
48. Specification
Statistical information
Specification
Modelling
RDF
Generation
Links
Generation
Publication
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 57
49. 2. Lightweight Ontology Development
WGS84 Geo
Positioning: an RDF
vocabulary scv:Dimension
Specification hydrographical
phenomena (rivers, W3C 4
scv:Item
scv:Dataset
lakes, etc.) Vocabulary
O.
Modelling hasStatisticalData Statistics
hasLat/Long WGS84 hasLat/Long
SCOVO
RDF
on
Generation hydrOntology hasLocation/isLocated FAO
Vocabulary for
O. instants, intervals,
FAO durations, etc.
Links Time
UNESCO Geopolitical
Generation
EGM / ERM 4 hasGeometry
hasGeometry
ontology
W3C Time
GeoNames
… GML
Publication
Names and Legend
international code
Ontology for OGC GML 4 systems for
territories and
Ontology
Geography Markup Specification groups
Exploitation Language Specification 4
Thesaurus
Classes 33 33
Object Properties 44 44 reused
Data Properties 318 318 Following the INSPIRE
(INfrastructure for SPatial InfoRmation in Europe) recommendation.
hydrOntology,SCOVO, FAO Geopolitcal, WGS84, GML, and Time
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
50. Hydroontology
Nivel superior
Nivel inferior
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
51. Implementation
+ Pellet
1 4
2
5
3
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
52. Geospatial Model
geoes: http://geo.linkeddata.es/
geo: http://www.w3.org/2003/01/geo/wgs84_pos#
geoes:ontology/Geometry
rdfs:subClassOf rdfs:subClassOf
rdfs:subClassOf
geo:Point geoes:ontology/Curve geoes:ontology/Polygon
Composed by Composed by
geo:lat geo:long
Collection of 2 or Collection of 3 or
more geo:Points more geo:Points
62
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
53. 3. Generation of RDF
Identification
of the data sources
• From the Data sources
Vocabulary • Geographic information (Databases)
development
• Statistic information (.xsl)
Generation • Geospatial information
of the RDF Data
Publication • Different technologies for RDF
of the RDF data
generation
Data cleansing
• NOR20 (from excell, XML, text files, …)
• R20 and ODEMapster (from Databases)
Linking • Geometry2RDF and SPh2RDF (for Geo
the RDF data data)
Enable effective
discovery
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
54. 3. Generation of the RDF Data / instances
· Classification
schemes
· Thesauri NOR2O
· Lexicons
• PR-NORs define a procedure that transforms a Non-Ontological Resource
(NOR) components into ontology elements. http://ontologydesignpatterns.org/
NOR2O
FAO Water classification
· Classification scheme
· Path enumeration data model
· Implemented in a database
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
55. 3. Generation of the RDF Data – NOR2O
Industry Production Index Year
Province
NOR2O
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
56. RDF geoespatial
Specification
Modelling
RDF
Generation
geometry2RDF
Links
Generation
Publication
Exploitation shp2RDF
shp2RDF
Reunión bilateral CNIG – OEG
SSSW-12: 9th
Proyecto OTALEX
Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 70
57. Generación
Generación de RDF geoespacial (geometry2RDF)
Specification
Oracle STO UTIL package
Modelling
SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry))
AS Gml311Geometry
RDF FROM "BCN200"."BCN200_0301L_RIO" c
Generation WHERE c.Etiqueta='Arroyo'
Links
Generation
Publication
Exploitation
Reunión bilateral CNIG – OEG
SSSW-12: 9th
Proyecto OTALEX
Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 71
58. Generación
Generación de RDF geoespacial (shp2RDF)
Specification
Modelling
RDF
Generation
Links
Generation
Publication
Exploitation
Reunión bilateral CNIG – OEG
SSSW-12: 9th
Proyecto OTALEX
Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 72
59. 3. Generation of the RDF Data – Geometry2RDF
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
61. Specification
Modelling
RDF
Generation
Links
Generation
Publication
Exploitation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 75
62. Statistics in RDF
RDF Data Cube – DataSet rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns#
qb: http://purl.org/linked-data/cube#
qb:DataSet ota: http://otalex.linkeddata.es/resource/
rdf:type
ota:ds/population
qb:dataSet qb:dataSet 8396
otaonto:population
ota:ds/population/Atala …… ota:ds/population/Azu
otaonto:geoArea
ya_2008 aga_2008
rdf:type rdf:type ota:Municipio/Azuaga
qb:Observation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 76
63. Specification
http://dbpedia.org/resource/Azuaga
Modelling
DBpedia
RDF Same As
Generation
Links
Generation
http://geo.linkeddata.es/page/resource/Municipio/Azuaga
Publication IGN
Exploitation
Same As
http://www.geonames.org/2521436/
Geonames
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
64. Exploitation
Exploitation
Especification
Specification Visualización
Web Interface
Modelling
Model
RDF
Generation
generation
map4rdf:
Links
• Google maps viewer of RDF resources
Publication
Generation • Resources with spatial information
Exploitation
• Used in other applications like AEMET, Goodrelations,
Publication
GeoLinked Data, El Viajero…
Exploitation SPARQL
map4rdf
http://oegdev.dia.fi.upm.es/projects/map4rdf/ Triplestore
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
65. • Simple SPARQL Query
• To get the RDF instances of the Laguna concept, and their
geometry property.
SELECT ?x ?geo WHERE {?x rdf:type <http://geo.linkeddata.es/ontology/Laguna> .
?x <http://www.w3.org/2003/01/geo/wgs84_pos#geometry> ?geo}
• Complex SPARQL Query:
• To get resources near the city of Azuaga, at a distance of 10Km
(0.1) and with labels in spanish. The query has a limit of 50
resources.
PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#>
SELECT ?subject ?label ?latitude2 ?longitude2 WHERE
{<http://geo.linkeddata.es/resource/Municipio/Azuaga>
<http://www.w3.org/2003/01/geo/wgs84_pos#geometry> ?g. ?g geo:lat ?latitude. ?g geo:long ?longitude.
?subject <http://www.w3.org/2003/01/geo/wgs84_pos#geometry> ?g2. ?g2 geo:lat ?latitude2. ?g2 geo:long
?longitude2. ?subject rdfs:label ?label. FILTER(xsd:double(?latitude2) - xsd:double(?latitude) <= 0.1 &&
xsd:double(?latitude) - xsd:double(?latitude2) <= 0.1 && xsd:double(?longitude2) - xsd:double(?longitude) <=
0.1 && xsd:double(?longitude) - xsd:double(?longitude2) <= 0.1 && lang(?label) = "es"). } limit 50
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 81
66. SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
70. Phase BNE IGN AEMET PRISA INE
hydrontology Scovo
Modeling SSN ontology
Wgs84
DC SIOC Data cube
time
MARiMbA
RDF generation
geometry2rdf
NOR2O CSV parser CSV parser NOR2O
Silk Silk Silk
NOR2O
DBPEDIA
Links DNB DBPEDIA
VIAF Geolinkeddata.es Geolinkeddata.es
generation LIBRIS Geonames Geolinkeddata.es
DBPEDIA
Publication Pubby
sitemap4rdf
SPARQL
Exploitation map4rdf
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 87
71. Linked Data
Data & Knowledge
Visualization
Linking
RDF
Generation
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain
72. Results
http://datos.bne.es http://webenemasuno.linkeddata.es/
• Total number of authority records: 4.100.000 • Total number of guides: 27.876
• Total number of bibliographic records: • Total number of posts: 32.502
2.390.140 • Total number of locations: 6.838
• Total number of RDF triples: 58.053.215 • Total number of RDF triples: 9.462.339
• Links (15% authority): 587.520 • Linked sources: 12.750
• Linked sources: DBPedia (6024 links)
• VIAF GeoLinkedData (6726 links)
• SUDOC (Sistema Universitario de
Documentación) FR
• GND (Auth German National Library) GER
• LIBRIS Sweden
• DBPedia
http://geo.linkeddata.es/
Number of geo type phenomenon: 95 (Rivers, mountains, etc.)
Number of geo entities: 155.000
Total number of RDF triples: 21.564.199
Links: 1002 (outlinks) y 6782 (coming)
Linked sources: DBpedia y GeoNames (outlinks)
AEMET y El Viajero (entry)
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 89
73. Lessons learnt
• URI Learn about Linked Data
• Follow existing design guidelines for new URIs with UPM official courses in
• Reuse existing URIs from authoritative sources one week
• Models
• Reuse existing models when available
• Create new models from authoritative sources
• Do not forget to align your model with existing models
• Generation
• Vertical domains usually require specific tools for generation
• Link
• Generic link discovery tools performs well in vertical domains
• Link to other data sets using
bne:Cervantes Dbpedia:cervantes
• Equivalence links (sameAs) sameAs
Person Municipality
• Typed links birthPlace
• Discovery
• Use sitemap4rdf to allow search engines to find your data
• Use an iterative-incremental life cycle in your development
SSSW-12: 9th Summer School on Ontological Engineering and the Semantic Web. Cercedilla. Spain 90
74. Linked Data Applications:
There is no One-Size-Fits-All
Formula
Asunción Gómez-Pérez
Facultad de Informática, Universidad Politécnica de Madrid
Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
http://www.oeg-upm.net
asun@fi.upm.es
Acknowledgements:
O.Corcho, D. Garijo, D. Vila, L.Vilches, B. Villazón
Work distributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0