SlideShare a Scribd company logo
Validation of Europeana data:
application profile, OWL ontology, or else?
Antoine Isaac
Application Profiles as an alternative to OWL Ontologies
Dublin Core Conference
5 September 2013
EDM basic pattern
 A data provider submits to Europeana a “bundle” of an object
and its digital representation(s)
For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation
Europeana Data Model: an example
Provided Cultural Heritage Object (CHO)
and descriptive metadata
Web Resources – digital representations
Aggregations – Bundling it all together
EDM Specs
http://pro.europeana.eu/edm-documentation
- EDM Definition:
- Mapping Guidelines and templates
- XML Schema
- OWL ontology
EDM Definitions
High level definition of classes and properties
edm:aggregatedCHO
- Definition: This property associates an ORE aggregation with the cultural
heritage object(s) (CHO for short) it is about.
- Subproperty of: ore:aggregates, dc:subject, P129_is_about
- Domain: ore:Aggregation
- Range: edm:ProvidedCHO
EDM Definitions
 Avoids adding semantics to re-used classes and properties
Except for mapping purposes, hierarchies of classes and properties for inference
dc:contributor rdfs:subPropertyOf edm:hasMet .
 Borderline case of axioms not in formal version of original specs
ore:proxyIn
- Obligation & Occurrence: A proxy may be in 1 to many aggregations,
and an aggregation may have 0 to many proxies in it
EDM Definitions
First hints at data constraints
edm:dataProvider
- Obligation & Occurrence: Mandatory for Europeana (Minimum: 1,
Maximum: 1)
edm:currentLocation
- Domain: The set of cultural heritage objects that Europeana collects
descriptions about, represented in the EDM by ProvidedCHOs and ORE
proxies for these CHOs.
edm:aggregatedCHO
- Obligation & Occurrence: In Europeana, an aggregation aggregates
exactly one CHO
Data validation: Europeana requirements
EDM is RDF-oriented: unbounded web of information, etc.
But Europeana needs to enforce constraints on the data it receives
 Data that meets basic Europeana function requirements
• An Aggregation should always have an edm:aggregatedCHO
• There must be exactly one edm:type -- the value must be TEXT, VIDEO,
SOUND, IMAGE or 3D
 Data quality criteria
• A ProvidedCHO should have at least a dc:title or a dc:description
We need specs for validation that are easily shareable, both for
humans and machines
EDM Mapping Guidelines
 Document written after the EDM Definitions
 Tries to formulate clearer instructions for Europeana providers
 Template-based, e.g. for provider’s Aggregation:
property value type cardinality
Machine-readable specs as OWL ontologies?
OWL is good for writing constraints, but not for validation!
Quite OK
 “Value types” via owl:ObjectProperty owl:DatatypeProperty in
OWL(DL)
 Data ranges (TEXT-VIDEO-SOUND-IMAGE-3D)
Less ok:
 Object domain and ranges
 (qualified) cardinality axioms
Including combinations: (either isShownBy OR isShownAt is mandatory)
We’ve created an OWL version of EDM
[…]
<owl:equivalentClass>
<owl:Restriction>
<owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality>
<owl:onProperty rdf:resource="&edm;aggregatedCHO"/>
</owl:Restriction>
</owl:equivalentClass>
[…]
OWL?
https://github.com/europeana/corelib/blob/master/c
orelib-solr-definitions/src/main/resources/eu/rdf/
But these are not really validation axioms
And it’s bad practice to add semantics to classes and properties that
already exist, such as ore:Aggregation
(let’s be honest: we were not ready for full RDF/OWL compatibility
anyway…)
EDM is implemented by an XML Schema (for RDF data!)
[…]
<sequence>
<element ref="edm:aggregatedCHO" maxOccurs="1" minOccurs="1"/>
<element ref="edm:dataProvider" maxOccurs="1" minOccurs="1"/>
<element ref="edm:isShownAt" maxOccurs="1" minOccurs="0"/>
<element ref="edm:isShownBy" maxOccurs="1" minOccurs="0"/>
<element ref="edm:object" maxOccurs="1" minOccurs="0"/>
<element ref="edm:provider" maxOccurs="1" minOccurs="1"/>
<element ref="dc:rights" maxOccurs="unbounded" minOccurs="0"/>
<element ref="edm:rights" maxOccurs="1" minOccurs="1"/>
</sequence>
[…]
XML Schema
And Schematron rules:
[…]
<sch:pattern name="Either Is shownby or is shownat should be present">
<sch:rule context="ore:Aggregation">
<sch:assert test="edm:isShownAt or edm:isShownBy">
[Error message]
</sch:assert>
</sch:rule>
</sch:pattern>
XML Schema
XML Schema: not ideal!
 Specific to a syntax
 Document-centric approach to validation
Back to square one: records!
 Forces us to enumerate the attributes with extra constraints, especially
order of elements
It’s really a super-closed world
 Schematron does slightly better, but then we have two constraint
languages co-existing in a same implementation
EDM as a “real” application profile?
 It is an application profile, already: mixing several vocabularies,
adding specific constraints
 Documentation includes definitions with constraints and
examples
 Interpretation of constraint in APs fit quite well
• AP constraints are expressed on the data
• Europeana needs dataset-level validation, mostly
EDM as a real application profile?
A fragment in DSP XML
<DescriptionTemplate ID=”aggregation" standalone=”yes">
<ResourceClass>ore:Aggregation</ResourceClass>
<StatementTemplate minOccurs="1" maxOccurs="1”>
<Property>edm:aggregatedCHO</Property>
</StatementTemplate>
<StatementTemplate minOccurs="1”>
<Property>edm:isShownBy</Property>
<Property>edm:isShownAt</Property>
</StatementTemplate>
</DescriptionTemplate>
http://dublincore.org/documents/dc-dsp/
Could be converted to other constraint
checking formalisms (1/2): SPIN
SPARQL Inferencing Notation http://spinrdf.org
ore:Aggregation
spin:constraint
[ a sp:Ask ;
sp:text """
# either isShownBy or isShownAt must be present
ASK WHERE {
{?this isShownBy ?image } UNION {?this isShownBy ?page }
}"""
] .
Issue: still looks like adding semantics to ore:Aggregation in general…
Could be converted to other constraint
checking formalisms (1/2): Stardog ICV
Integrity Constraint Validation http://stardog.com/docs/sdp/
Class: ore:Aggregation
SubClassOf: exactly 1 edm:aggregatedCHO
Class: ore:Aggregation
SubClassOf: min 1 edm:isShownBy or min 1 edm:isShownAt
Note: this is OWL2’s ‘Manchester Syntax’
Stardog accepts OWL, SWRL and SPARQL, uses SPARQL as back-end
Issue: still looks like adding semantics to ore:Aggregation in general…
Conclusions
Europeana requirements seem to be met by the AP approach, if this AP
approach is matched with SPARQL constraints
Much better than to try to partially catch constraints in OWL and
XML+Schematron as isolated machine-readable specs
Needs further testing
incl. trying to express all constraints with DSP and SPARQL queries
(with or without the help of a higher-level language)
An area that needs maturation
Maintainers (like us) may have validation specs in various forms
DSP is not flying, RDF validation still worthy of workshops!
https://www.w3.org/2012/12/rdf-val/
Thank you!
aisaac@few.vu.nl

More Related Content

What's hot

W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
Antoine Isaac
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
DeVonne Parks, CEM
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616
Dov Winer
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
Antoine Isaac
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
Antoine Isaac
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovation
Antoine Isaac
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
Antoine Isaac
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
CARARE
 
Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001
Dan Brickley
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
Antoine Isaac
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwiner
Dov Winer
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
Max Kaiser
 
LIBER and its EU projects
LIBER and its EU projectsLIBER and its EU projects
LIBER and its EU projects
LIBER Europe
 
Linked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the CitizenLinked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the Citizen
Stefan Gradmann
 
Calames :: CERL seminar (Paris, 2008)
Calames :: CERL seminar (Paris, 2008)Calames :: CERL seminar (Paris, 2008)
Calames :: CERL seminar (Paris, 2008)
Y. Nicolas
 
Uniting Digitization & Heritage Metadata : Calames Plus & other tracks
Uniting  Digitization & Heritage Metadata : Calames Plus & other tracksUniting  Digitization & Heritage Metadata : Calames Plus & other tracks
Uniting Digitization & Heritage Metadata : Calames Plus & other tracks
ABES
 
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaThe Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
Vangelis Banos
 

What's hot (17)

W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovation
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
 
Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwiner
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
 
LIBER and its EU projects
LIBER and its EU projectsLIBER and its EU projects
LIBER and its EU projects
 
Linked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the CitizenLinked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the Citizen
 
Calames :: CERL seminar (Paris, 2008)
Calames :: CERL seminar (Paris, 2008)Calames :: CERL seminar (Paris, 2008)
Calames :: CERL seminar (Paris, 2008)
 
Uniting Digitization & Heritage Metadata : Calames Plus & other tracks
Uniting  Digitization & Heritage Metadata : Calames Plus & other tracksUniting  Digitization & Heritage Metadata : Calames Plus & other tracks
Uniting Digitization & Heritage Metadata : Calames Plus & other tracks
 
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaThe Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
 

Similar to Validation of Europeana data: application profile, OWL ontology, or else?

Report on Work of Joint DCMI/IEEE LTSC Task Force
Report on Work of Joint DCMI/IEEE LTSC Task ForceReport on Work of Joint DCMI/IEEE LTSC Task Force
Report on Work of Joint DCMI/IEEE LTSC Task Force
Eduserv Foundation
 
Innovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC IntegrationsInnovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC Integrations
Steve Speicher
 
Entity Framework 4
Entity Framework 4Entity Framework 4
Entity Framework 4
Stefano Paluello
 
Ado.Net Data Services (Astoria)
Ado.Net Data Services (Astoria)Ado.Net Data Services (Astoria)
Ado.Net Data Services (Astoria)
Igor Moochnick
 
EDM for Europeana Sounds
EDM for Europeana SoundsEDM for Europeana Sounds
EDM for Europeana Sounds
Europeana_Sounds
 
RESTful Services
RESTful ServicesRESTful Services
RESTful Services
Kurt Cagle
 
Facilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic WebFacilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic Web
Roberto García
 
Modern Database Development Oow2008 Lucas Jellema
Modern Database Development Oow2008 Lucas JellemaModern Database Development Oow2008 Lucas Jellema
Modern Database Development Oow2008 Lucas Jellema
Lucas Jellema
 
EDM for sounds
EDM for soundsEDM for sounds
EDM for sounds
Cecile Devarenne
 
oodb.ppt
oodb.pptoodb.ppt
oodb.ppt
ISHAAGARWAL75
 
Euclid Data Model 101 - Episode 01: Overview
Euclid Data Model 101 - Episode 01: OverviewEuclid Data Model 101 - Episode 01: Overview
Euclid Data Model 101 - Episode 01: Overview
euc-dm-test
 
The JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scopeThe JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scope
Eduserv Foundation
 
Building social and RESTful frameworks
Building social and RESTful frameworksBuilding social and RESTful frameworks
Building social and RESTful frameworks
brendonschwartz
 
Oodb
OodbOodb
Metadata Cloud
Metadata CloudMetadata Cloud
Metadata Cloud
Norm Friesen
 
Runtime Environment Of .Net Divya Rathore
Runtime Environment Of .Net Divya RathoreRuntime Environment Of .Net Divya Rathore
Runtime Environment Of .Net Divya Rathore
Esha Yadav
 
Eclipse Modeling Framework
Eclipse Modeling FrameworkEclipse Modeling Framework
Eclipse Modeling Framework
Ajay K
 
DC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open sessionDC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open session
Mikael Nilsson
 
Description Set profiles
Description Set profilesDescription Set profiles
Description Set profiles
Mikael Nilsson
 
Entity framework 4.0
Entity framework 4.0Entity framework 4.0
Entity framework 4.0
Abhishek Sur
 

Similar to Validation of Europeana data: application profile, OWL ontology, or else? (20)

Report on Work of Joint DCMI/IEEE LTSC Task Force
Report on Work of Joint DCMI/IEEE LTSC Task ForceReport on Work of Joint DCMI/IEEE LTSC Task Force
Report on Work of Joint DCMI/IEEE LTSC Task Force
 
Innovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC IntegrationsInnovate2011 Keys to Building OSLC Integrations
Innovate2011 Keys to Building OSLC Integrations
 
Entity Framework 4
Entity Framework 4Entity Framework 4
Entity Framework 4
 
Ado.Net Data Services (Astoria)
Ado.Net Data Services (Astoria)Ado.Net Data Services (Astoria)
Ado.Net Data Services (Astoria)
 
EDM for Europeana Sounds
EDM for Europeana SoundsEDM for Europeana Sounds
EDM for Europeana Sounds
 
RESTful Services
RESTful ServicesRESTful Services
RESTful Services
 
Facilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic WebFacilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic Web
 
Modern Database Development Oow2008 Lucas Jellema
Modern Database Development Oow2008 Lucas JellemaModern Database Development Oow2008 Lucas Jellema
Modern Database Development Oow2008 Lucas Jellema
 
EDM for sounds
EDM for soundsEDM for sounds
EDM for sounds
 
oodb.ppt
oodb.pptoodb.ppt
oodb.ppt
 
Euclid Data Model 101 - Episode 01: Overview
Euclid Data Model 101 - Episode 01: OverviewEuclid Data Model 101 - Episode 01: Overview
Euclid Data Model 101 - Episode 01: Overview
 
The JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scopeThe JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scope
 
Building social and RESTful frameworks
Building social and RESTful frameworksBuilding social and RESTful frameworks
Building social and RESTful frameworks
 
Oodb
OodbOodb
Oodb
 
Metadata Cloud
Metadata CloudMetadata Cloud
Metadata Cloud
 
Runtime Environment Of .Net Divya Rathore
Runtime Environment Of .Net Divya RathoreRuntime Environment Of .Net Divya Rathore
Runtime Environment Of .Net Divya Rathore
 
Eclipse Modeling Framework
Eclipse Modeling FrameworkEclipse Modeling Framework
Eclipse Modeling Framework
 
DC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open sessionDC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open session
 
Description Set profiles
Description Set profilesDescription Set profiles
Description Set profiles
 
Entity framework 4.0
Entity framework 4.0Entity framework 4.0
Entity framework 4.0
 

More from Antoine Isaac

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Antoine Isaac
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
Antoine Isaac
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
Antoine Isaac
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
Antoine Isaac
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018
Antoine Isaac
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
Antoine Isaac
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
Antoine Isaac
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
Antoine Isaac
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
Antoine Isaac
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Antoine Isaac
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
Antoine Isaac
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
Antoine Isaac
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
Antoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
Antoine Isaac
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Antoine Isaac
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Antoine Isaac
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Antoine Isaac
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Antoine Isaac
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
Antoine Isaac
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
Antoine Isaac
 

More from Antoine Isaac (20)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
 

Recently uploaded

LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024
Vadym Kazulkin
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 

Recently uploaded (20)

LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 

Validation of Europeana data: application profile, OWL ontology, or else?

  • 1. Validation of Europeana data: application profile, OWL ontology, or else? Antoine Isaac Application Profiles as an alternative to OWL Ontologies Dublin Core Conference 5 September 2013
  • 2. EDM basic pattern  A data provider submits to Europeana a “bundle” of an object and its digital representation(s) For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation
  • 4. Provided Cultural Heritage Object (CHO) and descriptive metadata
  • 5. Web Resources – digital representations
  • 6. Aggregations – Bundling it all together
  • 7. EDM Specs http://pro.europeana.eu/edm-documentation - EDM Definition: - Mapping Guidelines and templates - XML Schema - OWL ontology
  • 8. EDM Definitions High level definition of classes and properties edm:aggregatedCHO - Definition: This property associates an ORE aggregation with the cultural heritage object(s) (CHO for short) it is about. - Subproperty of: ore:aggregates, dc:subject, P129_is_about - Domain: ore:Aggregation - Range: edm:ProvidedCHO
  • 9. EDM Definitions  Avoids adding semantics to re-used classes and properties Except for mapping purposes, hierarchies of classes and properties for inference dc:contributor rdfs:subPropertyOf edm:hasMet .  Borderline case of axioms not in formal version of original specs ore:proxyIn - Obligation & Occurrence: A proxy may be in 1 to many aggregations, and an aggregation may have 0 to many proxies in it
  • 10. EDM Definitions First hints at data constraints edm:dataProvider - Obligation & Occurrence: Mandatory for Europeana (Minimum: 1, Maximum: 1) edm:currentLocation - Domain: The set of cultural heritage objects that Europeana collects descriptions about, represented in the EDM by ProvidedCHOs and ORE proxies for these CHOs. edm:aggregatedCHO - Obligation & Occurrence: In Europeana, an aggregation aggregates exactly one CHO
  • 11. Data validation: Europeana requirements EDM is RDF-oriented: unbounded web of information, etc. But Europeana needs to enforce constraints on the data it receives  Data that meets basic Europeana function requirements • An Aggregation should always have an edm:aggregatedCHO • There must be exactly one edm:type -- the value must be TEXT, VIDEO, SOUND, IMAGE or 3D  Data quality criteria • A ProvidedCHO should have at least a dc:title or a dc:description We need specs for validation that are easily shareable, both for humans and machines
  • 12. EDM Mapping Guidelines  Document written after the EDM Definitions  Tries to formulate clearer instructions for Europeana providers  Template-based, e.g. for provider’s Aggregation: property value type cardinality
  • 13. Machine-readable specs as OWL ontologies? OWL is good for writing constraints, but not for validation! Quite OK  “Value types” via owl:ObjectProperty owl:DatatypeProperty in OWL(DL)  Data ranges (TEXT-VIDEO-SOUND-IMAGE-3D) Less ok:  Object domain and ranges  (qualified) cardinality axioms Including combinations: (either isShownBy OR isShownAt is mandatory)
  • 14. We’ve created an OWL version of EDM […] <owl:equivalentClass> <owl:Restriction> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality> <owl:onProperty rdf:resource="&edm;aggregatedCHO"/> </owl:Restriction> </owl:equivalentClass> […] OWL? https://github.com/europeana/corelib/blob/master/c orelib-solr-definitions/src/main/resources/eu/rdf/ But these are not really validation axioms And it’s bad practice to add semantics to classes and properties that already exist, such as ore:Aggregation (let’s be honest: we were not ready for full RDF/OWL compatibility anyway…)
  • 15. EDM is implemented by an XML Schema (for RDF data!) […] <sequence> <element ref="edm:aggregatedCHO" maxOccurs="1" minOccurs="1"/> <element ref="edm:dataProvider" maxOccurs="1" minOccurs="1"/> <element ref="edm:isShownAt" maxOccurs="1" minOccurs="0"/> <element ref="edm:isShownBy" maxOccurs="1" minOccurs="0"/> <element ref="edm:object" maxOccurs="1" minOccurs="0"/> <element ref="edm:provider" maxOccurs="1" minOccurs="1"/> <element ref="dc:rights" maxOccurs="unbounded" minOccurs="0"/> <element ref="edm:rights" maxOccurs="1" minOccurs="1"/> </sequence> […] XML Schema
  • 16. And Schematron rules: […] <sch:pattern name="Either Is shownby or is shownat should be present"> <sch:rule context="ore:Aggregation"> <sch:assert test="edm:isShownAt or edm:isShownBy"> [Error message] </sch:assert> </sch:rule> </sch:pattern> XML Schema
  • 17. XML Schema: not ideal!  Specific to a syntax  Document-centric approach to validation Back to square one: records!  Forces us to enumerate the attributes with extra constraints, especially order of elements It’s really a super-closed world  Schematron does slightly better, but then we have two constraint languages co-existing in a same implementation
  • 18. EDM as a “real” application profile?  It is an application profile, already: mixing several vocabularies, adding specific constraints  Documentation includes definitions with constraints and examples  Interpretation of constraint in APs fit quite well • AP constraints are expressed on the data • Europeana needs dataset-level validation, mostly
  • 19. EDM as a real application profile? A fragment in DSP XML <DescriptionTemplate ID=”aggregation" standalone=”yes"> <ResourceClass>ore:Aggregation</ResourceClass> <StatementTemplate minOccurs="1" maxOccurs="1”> <Property>edm:aggregatedCHO</Property> </StatementTemplate> <StatementTemplate minOccurs="1”> <Property>edm:isShownBy</Property> <Property>edm:isShownAt</Property> </StatementTemplate> </DescriptionTemplate> http://dublincore.org/documents/dc-dsp/
  • 20. Could be converted to other constraint checking formalisms (1/2): SPIN SPARQL Inferencing Notation http://spinrdf.org ore:Aggregation spin:constraint [ a sp:Ask ; sp:text """ # either isShownBy or isShownAt must be present ASK WHERE { {?this isShownBy ?image } UNION {?this isShownBy ?page } }""" ] . Issue: still looks like adding semantics to ore:Aggregation in general…
  • 21. Could be converted to other constraint checking formalisms (1/2): Stardog ICV Integrity Constraint Validation http://stardog.com/docs/sdp/ Class: ore:Aggregation SubClassOf: exactly 1 edm:aggregatedCHO Class: ore:Aggregation SubClassOf: min 1 edm:isShownBy or min 1 edm:isShownAt Note: this is OWL2’s ‘Manchester Syntax’ Stardog accepts OWL, SWRL and SPARQL, uses SPARQL as back-end Issue: still looks like adding semantics to ore:Aggregation in general…
  • 22. Conclusions Europeana requirements seem to be met by the AP approach, if this AP approach is matched with SPARQL constraints Much better than to try to partially catch constraints in OWL and XML+Schematron as isolated machine-readable specs Needs further testing incl. trying to express all constraints with DSP and SPARQL queries (with or without the help of a higher-level language) An area that needs maturation Maintainers (like us) may have validation specs in various forms DSP is not flying, RDF validation still worthy of workshops! https://www.w3.org/2012/12/rdf-val/

Editor's Notes

  1. Special session page: http://dcevents.dublincore.org/IntConf/index/pages/view/APaltOO
  2. View the object at: http://www.europeana.eu/portal/record/09102/_CM_0161930.html
  3. &quot;application profile” seen a bit as a binary relation between two vocabularies, ignoring what may happen beside the vocabulary that is being re-used (i.e. whether the AP uses other elements and whether these new elements are re-used from third-party contexts or newly coined).