LOD2014 LINKED OPEN DATA: WHERE ARE WE? 
METHODS AND EXPERIENCES IN CULTURAL HERITAGE ENHANCEMENT 
Roma, 20th - 21st Feb 2014 
Archivio Centrale dello Stato, Roma 
Organized by W3C Italy 
Francesca Tomasi 
University of Bologna 
Fabio Ciotti 
University of Roma Tor Vergata 
Maurizio Lana 
University of Piemonte Orientale 
Diego Magro 
University of Torino 
Silvio Peroni 
University of Bologna 
Fabio Vitali 
University of Bologna
THE PROJECT 
CH and LOD 
Our appoach: conversion, extraction, creation 
Database conversion into LOD; 
Extraction of LOD from XML/TEI texts; 
Creation of new ontologies to produce LOD. 
The CH domain: people and roles, ancient and modern places, books and archival documents 
The aim: best pratices in LOD production and dissemination in the CH domain 
Common strategy: 
ontologies creation and reuse; 
stand-off markup and Open Annotation Data Model
THE CASES STUDY 
Relational database 
Zeri Photo Archive database (http://www.fondazionezeri.unibo.it/catalogo) 
Digital edition 
Vespasiano da Bisticci Letters (doi:10.6092/unibo/vespasianodabisticciletters) 
Digital library 
 digilibLT (http://www.digiliblt.unipmn.it) 
Geographic ontology 
Geolat project (http://www.geolat.it) 
Archival ontology 
Proles ontology (http://www.essepuntato.it/2013/10/politicalroles)
ZERI PHOTO ARCHIVE 
“is a rich digital catalog, and is today considered one of the most important repertories of Italian art on the web”. 
Our mission is to convert the database in LOD: 
 reengineer the E/R model implemented by the database tables, which contain data according to the Scheda F, into OWL, so as to obtain a first version of an ontology; 
 iteratively enhance the ontology according to the specifications described by the Scheda F and CIDOC-CRM, (changing the whole conceptual organisation and entity naming of the existing model as less as possible); 
 by using appropriate scripts transform data originally stored in the database into RDF statements compliant to the OWL ontology developed; 
 apply automatic and semi-automatic mechanisms to generate links to existing datasets, such as DBpedia and Europeana.
ZERI: THE PROCESS ONTOLOGY REUSE AND LOD POPULATION 
Scheda F 
Photograph 
Scheda OA 
WorkOfArt 
describes 
describes 
describes 
has subject 
FRBR Work 
FRBR Expression 
FRBR Manifestation 
FRBR Item 
Database Fondazione Zeri 
Create the 
ontology 
from the E/R 
Model and the 
data in DB 
Add links 
to LOD 
FRBR
VESPASIANO, LETTERS A DIGITAL EDITION 
A digital annoted (XML/TEI) collection of letters form the XV century sent/received to/by the florentine copyist Vespasiano da Bisticci. 
A web environment that focuses on: persons mentioned in the documents; classical latin and greek manuscripts requested/copied/proposed to/by Vespasiano da Bisticci’s school and their description. 
The purpose is to identify persons related to manuscripts in order to expose datasets of people related to manuscripts, these last described by technical words. 
The XML/TEI annotation (persons, manuscripts and technical terms) has been realized with embedded markup (@ref=”URI”) pointing to stand-off RDF file (with assertion) and controlled form of the names (VIAF, LCA, Geonames, etc.) for managing attributes values.
VESPASIANO: THE MODEL RDF SUPPORT TO STAND-OFF ANNOTATION 
SUBJECT 
PREDICATES 
OBJECT 
people.rdf#PdM 
URI: http://vespasianodabisticciletters/ people/PdM 
has_normalized_form 
Medici, Piero de’: 
Dbpedia: http://eu.dbpedia.org/page/Piero_de_Medici 
VIAF: http://viaf.org/viaf/25406033 
has_variant_forms 
Piero, 
Piero di Cosimo de’ Medici, 
Principe di Firenze 
is_owner_of 
manuscripts.rdf#P_SN 
manuscripts.rdf#L_D_III 
manuscripts.rdf#L_D_IV_E 
SUBJECT 
PREDICATES 
OBJECT 
manuscripts.rdf#P_SN 
URI: http://vespasianodabisticciletters/ manuscripts/P_SN 
has_normalized_form 
Plinio, Storia naturale 
is_requested_by 
is_owned_by 
is_copied_by 
is_illuminated_by 
people.rdf#PdM 
people.rdf#PdM 
people.rdf#PS 
people.rdf#FT 
SUBJECT 
PREDICATES 
OBJECT 
lexicon.rdf#min 
URI: http://vespasianodabisticciletters/ lexicon/min 
has_normalized_form 
miniare, miniatura, miniato 
is_referred_to 
manuscripts.rdf#L_D_IV_E
Work in progress 
Main aims: 
increasing the value of geographic references in latin texts 
enabling innovative access to latin works (e.g. through geography) 
contributing to the LOD cloud 
GEOLAT (PROJECT FUNDED BY COMPAGNIA DI SAN PAOLO)
GEOLAT: THE FRAMEWORK 
digilibLT 
(XML/TEI Resources) 
Bibliographic Resources 
RDF data 
Annotations 
Geographic entities RDF data 
Bibliographic Resource 
Ontology (bro) 
Ancient World Geographic Ontology (awgo) 
automatic 
extraction 
computer-aided annotation (Geographic NER) 
specified according to 
specified according to 
specified according to 
Open Annotation Data Model (oa) 
bridges the gap 
Mappings to other datasets (e.g. Pleiades)
rdf:type 
Primae frugiparos fetus mortalibus aegris dididerunt quondam praeclaro nomine Athenae et recreaverunt vitam legesque rogarunt [...] 
De rerum natura – Book VI 
GEOLAT:THE MODEL (SIMPLIFIED) 
athenaeWord 
bro:TextFragment 
bro:Book 
isPartOf 
rdf:type 
bro:LiteraryWork 
rdf:type 
isPartOf 
deRerumNatura 
athens 
awgo:GreekPolis 
rdf:type 
geographicSpace1 
awgo:GeographicSpace 
awgo:locatedIn 
bro:identifies 
anno1 
oa:Annotation 
oa:hasTarget 
trig:Graph 
rdf:type 
oa:hasBody 
rdf:type 
DRN_BookVI 
rdf:type 
pleiades: 579885 
skos:closeMatch
AN ARCHIVAL ONTOLOGY: PROLES 
The Political Roles (PRoles) Ontology is an OWL 2 DL ontology that allows one to represent political role attributions and their possible links to related events by means of particular classes and properties imported and used by several concepts from PRO, n-ary participation pattern and PROV-O. 
We are now managing an experiment on Andrea Costa fond, by exploiting the related authority record (http://archivi.ibc.regione.emilia-romagna.it/eac-cpf/IT- ER-IBC-SP00001-0000264), in collaboration with IBC, Soprintendenza per i Beni librari e documentari.
PROLES: THE MODEL ONTOLOGY CREATION AND REUSE 
The first layer of the PRoles Ontology: role attribution 
The third layer of the PRoles Ontology: provenance information 
The second layer of the PRoles Ontology: participation to events
FINAL REMARKS 
The common method: 
Ontology reuse; 
Definition of new classes and predicates; 
Ontology as the basis for LOD creation; 
Stand-off markup and OA data model; 
LOD cloud population; 
Mapping to other datasets
THANK YOU! FRANCESCA, FABIO C., MAURIZIO, DIEGO, SILVIO, FABIO V. THE GEOLAT RESEARCH IS FUNDED BY FONDAZIONE COMPAGNIA DI SANPAOLO

Methods and experiences in cultural heritage enhancement

  • 1.
    LOD2014 LINKED OPENDATA: WHERE ARE WE? METHODS AND EXPERIENCES IN CULTURAL HERITAGE ENHANCEMENT Roma, 20th - 21st Feb 2014 Archivio Centrale dello Stato, Roma Organized by W3C Italy Francesca Tomasi University of Bologna Fabio Ciotti University of Roma Tor Vergata Maurizio Lana University of Piemonte Orientale Diego Magro University of Torino Silvio Peroni University of Bologna Fabio Vitali University of Bologna
  • 2.
    THE PROJECT CHand LOD Our appoach: conversion, extraction, creation Database conversion into LOD; Extraction of LOD from XML/TEI texts; Creation of new ontologies to produce LOD. The CH domain: people and roles, ancient and modern places, books and archival documents The aim: best pratices in LOD production and dissemination in the CH domain Common strategy: ontologies creation and reuse; stand-off markup and Open Annotation Data Model
  • 3.
    THE CASES STUDY Relational database Zeri Photo Archive database (http://www.fondazionezeri.unibo.it/catalogo) Digital edition Vespasiano da Bisticci Letters (doi:10.6092/unibo/vespasianodabisticciletters) Digital library  digilibLT (http://www.digiliblt.unipmn.it) Geographic ontology Geolat project (http://www.geolat.it) Archival ontology Proles ontology (http://www.essepuntato.it/2013/10/politicalroles)
  • 4.
    ZERI PHOTO ARCHIVE “is a rich digital catalog, and is today considered one of the most important repertories of Italian art on the web”. Our mission is to convert the database in LOD:  reengineer the E/R model implemented by the database tables, which contain data according to the Scheda F, into OWL, so as to obtain a first version of an ontology;  iteratively enhance the ontology according to the specifications described by the Scheda F and CIDOC-CRM, (changing the whole conceptual organisation and entity naming of the existing model as less as possible);  by using appropriate scripts transform data originally stored in the database into RDF statements compliant to the OWL ontology developed;  apply automatic and semi-automatic mechanisms to generate links to existing datasets, such as DBpedia and Europeana.
  • 5.
    ZERI: THE PROCESSONTOLOGY REUSE AND LOD POPULATION Scheda F Photograph Scheda OA WorkOfArt describes describes describes has subject FRBR Work FRBR Expression FRBR Manifestation FRBR Item Database Fondazione Zeri Create the ontology from the E/R Model and the data in DB Add links to LOD FRBR
  • 6.
    VESPASIANO, LETTERS ADIGITAL EDITION A digital annoted (XML/TEI) collection of letters form the XV century sent/received to/by the florentine copyist Vespasiano da Bisticci. A web environment that focuses on: persons mentioned in the documents; classical latin and greek manuscripts requested/copied/proposed to/by Vespasiano da Bisticci’s school and their description. The purpose is to identify persons related to manuscripts in order to expose datasets of people related to manuscripts, these last described by technical words. The XML/TEI annotation (persons, manuscripts and technical terms) has been realized with embedded markup (@ref=”URI”) pointing to stand-off RDF file (with assertion) and controlled form of the names (VIAF, LCA, Geonames, etc.) for managing attributes values.
  • 7.
    VESPASIANO: THE MODELRDF SUPPORT TO STAND-OFF ANNOTATION SUBJECT PREDICATES OBJECT people.rdf#PdM URI: http://vespasianodabisticciletters/ people/PdM has_normalized_form Medici, Piero de’: Dbpedia: http://eu.dbpedia.org/page/Piero_de_Medici VIAF: http://viaf.org/viaf/25406033 has_variant_forms Piero, Piero di Cosimo de’ Medici, Principe di Firenze is_owner_of manuscripts.rdf#P_SN manuscripts.rdf#L_D_III manuscripts.rdf#L_D_IV_E SUBJECT PREDICATES OBJECT manuscripts.rdf#P_SN URI: http://vespasianodabisticciletters/ manuscripts/P_SN has_normalized_form Plinio, Storia naturale is_requested_by is_owned_by is_copied_by is_illuminated_by people.rdf#PdM people.rdf#PdM people.rdf#PS people.rdf#FT SUBJECT PREDICATES OBJECT lexicon.rdf#min URI: http://vespasianodabisticciletters/ lexicon/min has_normalized_form miniare, miniatura, miniato is_referred_to manuscripts.rdf#L_D_IV_E
  • 8.
    Work in progress Main aims: increasing the value of geographic references in latin texts enabling innovative access to latin works (e.g. through geography) contributing to the LOD cloud GEOLAT (PROJECT FUNDED BY COMPAGNIA DI SAN PAOLO)
  • 9.
    GEOLAT: THE FRAMEWORK digilibLT (XML/TEI Resources) Bibliographic Resources RDF data Annotations Geographic entities RDF data Bibliographic Resource Ontology (bro) Ancient World Geographic Ontology (awgo) automatic extraction computer-aided annotation (Geographic NER) specified according to specified according to specified according to Open Annotation Data Model (oa) bridges the gap Mappings to other datasets (e.g. Pleiades)
  • 10.
    rdf:type Primae frugiparosfetus mortalibus aegris dididerunt quondam praeclaro nomine Athenae et recreaverunt vitam legesque rogarunt [...] De rerum natura – Book VI GEOLAT:THE MODEL (SIMPLIFIED) athenaeWord bro:TextFragment bro:Book isPartOf rdf:type bro:LiteraryWork rdf:type isPartOf deRerumNatura athens awgo:GreekPolis rdf:type geographicSpace1 awgo:GeographicSpace awgo:locatedIn bro:identifies anno1 oa:Annotation oa:hasTarget trig:Graph rdf:type oa:hasBody rdf:type DRN_BookVI rdf:type pleiades: 579885 skos:closeMatch
  • 11.
    AN ARCHIVAL ONTOLOGY:PROLES The Political Roles (PRoles) Ontology is an OWL 2 DL ontology that allows one to represent political role attributions and their possible links to related events by means of particular classes and properties imported and used by several concepts from PRO, n-ary participation pattern and PROV-O. We are now managing an experiment on Andrea Costa fond, by exploiting the related authority record (http://archivi.ibc.regione.emilia-romagna.it/eac-cpf/IT- ER-IBC-SP00001-0000264), in collaboration with IBC, Soprintendenza per i Beni librari e documentari.
  • 12.
    PROLES: THE MODELONTOLOGY CREATION AND REUSE The first layer of the PRoles Ontology: role attribution The third layer of the PRoles Ontology: provenance information The second layer of the PRoles Ontology: participation to events
  • 13.
    FINAL REMARKS Thecommon method: Ontology reuse; Definition of new classes and predicates; Ontology as the basis for LOD creation; Stand-off markup and OA data model; LOD cloud population; Mapping to other datasets
  • 14.
    THANK YOU! FRANCESCA,FABIO C., MAURIZIO, DIEGO, SILVIO, FABIO V. THE GEOLAT RESEARCH IS FUNDED BY FONDAZIONE COMPAGNIA DI SANPAOLO