Europeana and schema.org
Antoine Isaac
Dublin Core Conference
Schema.org special session
5 September 2013
Europeana Data Model: an example
For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation
Provided Cultural Heritage Object (CHO)
and descriptive metadata
Web Resources – digital representations
Aggregations – Bundling it all together
Why using schema.org?
Europeana tries to disseminate data to reach out to as many users
as possible
Search engines
• Customization of result lists – rich snippets
• Knowledge Graph
• Search Engine Optimization
Developers more comfortable with parsing web pages
In fact: schema.org and RDFa
Europeana has been publishing structured metadata via its portal
since a while
One application case: customization of public domain pages by
Creative Commons, with details on the work and Europeana
usage guidelines for public domain works
Europeana, Creative Commons and RDFa
http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B
11FA7317F5FE43B96B.html
The Creative Commons Public Domain page triggers a script that harvests mark-up on the Europeana object page
Going further
Creative Commons uses 5-6 fields, Agreed upon with the
developer(s) there
What to publish further?
Schema.org as a standardized form of more web page-based data
exchange
And it has a case – several ones in fact
Still quite prototype-ish
Current EDM – schema.org mapping
EDM element Schema.org mapping
ProvidedCHO & Proxy schema:CreativeWork
dc:contributor schema:contributor
dc:coverage
dc:creator schema:creator
dc:date
dc:description schema:description
dc:format
dc:identifier
dc:language schema:inLanguage
dc:publisher schema:publisher
dc:relation
dc:rights
dc:source
dc:subject schema:about
dc:title schema:name
dc:type
dcterms:alternative schema:alternativeHeadline
dcterms:conformsTo
dcterms:created schema:dateCreated
dcterms:extent
dcterms:hasFormat
dcterms:hasPart
dcterms:hasVersion
dcterms:isFormatOf
dcterms:isPartOf
dcterms:isReferencedBy
dcterms:isReplacedBy
dcterms:isRequiredBy
dcterms:issued schema:datePublished
dcterms:isVersionOf
dcterms:medium
dcterms:provenance
dcterms:references schema:mentions
dcterms:replaces
dcterms:requires
dcterms:spatial
dcterms:tableOfContents
dcterms:temporal
edm:currentLocation schema:contentLocation
edm:hasMet
edm:hasType schema:additionalType
edm:incorporates
edm:isDerivativeOf
edm:isNextInSequence
edm:isRelatedTo
edm:isRepresentationOf schema:about
edm:isSimilarTo
edm:isSuccessorOf
edm:realizes
edm:type N/A
edm:unstored
edm:wasPresentAt
edm:europeanaProxy N/A
edm:userTag schema:keywords or schema:comment
edm:year
ore:proxyFor N/A
ore:proxyIn N/A
owl:sameAs
rdf:type
Aggregation and
EuropeanaAggregation
ore:aggregates N/A
edm:aggregatedCHO N/A
edm:country schema:addressCountry
edm:dataProvider schema:provider
edm:hasView schema:url
edm:isShownAt schema:url
edm:isShownBy schema:contentUrl
edm:landingPage schema:url
edm:language
edm:object schema:image, if preview opt-out is NOT
activated
edm:preview schema:thumbnailUrl if preview opt-out is NOT
activated
edm:provider schema:provider
dc:rights
edm:rights
edm:ugc
edm:unstored N/A
WebResource schema:WebPage or schema:MediaObject
schema:url
dc:description schema:description
dc:format
dc:rights
dc:source schema:encodesCreativeWork
dcterms:conformsTo
dcterms:created schema:dateCreated
dcterms:extent
dcterms:hasPart
dcterms:isFormatOf
dcterms:issued schema:datePublished
edm:isNextInSequence
edm:rights
Agent schema:Person or schema:Organization
skos:prefLabel schema:name
skos:altLabel skos:hiddenLabel schema:additionalName
skos:note schema:description
dc:date
dc:identifier
edm:begin
edm:end schema:deathdate
edm:hasMet schema:knows
edm:isRelatedTo
edm:wasPresentAt
foaf:name schema:name
rdaGr2:biographicalInformation schema:description
rdaGr2:dateOfBirth schema:birthdate
rdaGr2:dateOfDeath schema:deathdate
rdaGr2:dateOfEstablishment schema:foundingDate
rdaGr2:dateOfTermination
rdaGr2:gender schema:gender
rdaGr2:professionOrOccupatio
n
schema:jobTitle
owl:sameAs
Place schema:Place
wgs84_pos:lat
wgs84_pos:long
wgs84_pos:alt
wgs84_pos:lat_long
skos:prefLabel schema:name
skos:altLabel skos:hiddenLabel schema:additionalName
skos:note schema:description
dcterms:hasPart
dcterms:isPartOf schema:containedIn
owl:sameAs
Current implementation
A glimpse of an object’s full data
http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09
B1642B11FA7317F5FE43B96B.html?format=labels
Anatomy of results from an RDFa parser
http://www.w3.org/2012/pyRdfa/
Several flavors of data in it…
<http://www.europeana.eu/resolve/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B
96B> a schema:CreativeWork;
schema:name "Cofre de base rectangular com tampa de quatro face...";
schema:about "Cofre-relicário";
schema:addressCountry "Portugal";
schema:contentUrl <http://www.matriznet.imc-
ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?
IdReg=5113&TipoReg=1&ThumbnailType=2>;
schema:creator "Desconhecido";
schema:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada
com um friso perlado que rodeia representações da Virgem com o Menino, do Calvário e da
Virgem em Glória, nas quatro faces. As arestas da tampa são emolduradas por pequenas
caneluras; nas duas faces principais um entrelaçado encerra uma arcaria polilobada.";
schema:image <http://www.matriznet.imc-
ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?
IdReg=5113&TipoReg=1&ThumbnailType=1>;
schema:provider "Instituto dos Museus e da Conservação",
"Museu de Alberto Sampaio";
schema:url
<http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96
B.html>,
<http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
Schema.org data
dc11:creator "Desconhecido";
dc11:date "XIII";
dc11:description "Cofre de base rectangular com tampa de quatro faces. […].";
dc11:format "Altura: 9,5 cm; Profundidade: 10 cm; Comprimento: 19,5 cm";
dc11:identifier "MAS O 37";
dc11:rights "Copyright © Instituto dos Museus e da Conservação";
dc11:subject "Cofre-relicário";
dc11:title "Cofre de base rectangular com tampa de quatro face...";
dc11:type "Ourivesaria";
edm:country "Portugal";
edm:rights <http://creativecommons.org/publicdomain/mark/1.0/>;
edm:dataProvider "Museu de Alberto Sampaio";
edm:provider "Instituto dos Museus e da Conservação";
edm:isShownAt <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?
IdReg=5113>;
edm:isShownBy <http://www.matriznet.imc-
ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?
IdReg=5113&TipoReg=1&ThumbnailType=2>;
edm:landingPage
<http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96
B.html>;
edm:object <http://www.matriznet.imc-
ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?
DC / EDM data
og:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada […]";
og:image "http://europeanastatic.eu/api/image?type=IMAGE&uri=http://www.matriznet.imc-
ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?
IdReg=5113&TipoReg=1&ThumbnailType=1&size=FULL_DOC";
og:site_name "Europeana";
og:title "Cofre de base rectangular com tampa de quatro face... | Desconhecido";
og:type "website";
og:url
"http://preview.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B
96B.html” .
FB OpenGraph data
xhv:license <http://creativecommons.org/publicdomain/mark/1.0/>;
cc:attributionURL <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?
IdReg=5113>;
cc:morePermissions <http://www.matriznet.imc-
ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
cc:useGuidelines <http://www.europeana.eu/rights/pd-usage-guide/>;
Creative Commons data
Observations
• Schema.org is simple
• Not everything can be mapped
• We’re losing grain, including some of the core benefits of
Europeana moving to the richer EDM!
• But it’s ok, because it matches needs
• And in fact it’s not entirely because of Schema.org
• And we can publish different flavors of the data in RDFa
Thank you!
Questions?
aisaac@few.vu.nl
Antoine Isaac

Europeana and Schema.org - DC2013

  • 1.
    Europeana and schema.org AntoineIsaac Dublin Core Conference Schema.org special session 5 September 2013
  • 2.
    Europeana Data Model:an example For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation
  • 3.
    Provided Cultural HeritageObject (CHO) and descriptive metadata
  • 4.
    Web Resources –digital representations
  • 5.
  • 6.
    Why using schema.org? Europeanatries to disseminate data to reach out to as many users as possible Search engines • Customization of result lists – rich snippets • Knowledge Graph • Search Engine Optimization Developers more comfortable with parsing web pages
  • 7.
    In fact: schema.organd RDFa Europeana has been publishing structured metadata via its portal since a while One application case: customization of public domain pages by Creative Commons, with details on the work and Europeana usage guidelines for public domain works
  • 8.
    Europeana, Creative Commonsand RDFa http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B 11FA7317F5FE43B96B.html
  • 9.
    The Creative CommonsPublic Domain page triggers a script that harvests mark-up on the Europeana object page
  • 11.
    Going further Creative Commonsuses 5-6 fields, Agreed upon with the developer(s) there What to publish further? Schema.org as a standardized form of more web page-based data exchange And it has a case – several ones in fact Still quite prototype-ish
  • 12.
    Current EDM –schema.org mapping EDM element Schema.org mapping ProvidedCHO & Proxy schema:CreativeWork dc:contributor schema:contributor dc:coverage dc:creator schema:creator dc:date dc:description schema:description dc:format dc:identifier dc:language schema:inLanguage dc:publisher schema:publisher dc:relation dc:rights dc:source dc:subject schema:about dc:title schema:name dc:type dcterms:alternative schema:alternativeHeadline dcterms:conformsTo dcterms:created schema:dateCreated dcterms:extent dcterms:hasFormat dcterms:hasPart dcterms:hasVersion dcterms:isFormatOf dcterms:isPartOf dcterms:isReferencedBy dcterms:isReplacedBy dcterms:isRequiredBy dcterms:issued schema:datePublished dcterms:isVersionOf dcterms:medium dcterms:provenance dcterms:references schema:mentions dcterms:replaces dcterms:requires dcterms:spatial dcterms:tableOfContents dcterms:temporal edm:currentLocation schema:contentLocation edm:hasMet edm:hasType schema:additionalType edm:incorporates edm:isDerivativeOf edm:isNextInSequence edm:isRelatedTo edm:isRepresentationOf schema:about edm:isSimilarTo edm:isSuccessorOf edm:realizes edm:type N/A edm:unstored edm:wasPresentAt edm:europeanaProxy N/A edm:userTag schema:keywords or schema:comment edm:year ore:proxyFor N/A ore:proxyIn N/A owl:sameAs rdf:type Aggregation and EuropeanaAggregation ore:aggregates N/A edm:aggregatedCHO N/A edm:country schema:addressCountry edm:dataProvider schema:provider edm:hasView schema:url edm:isShownAt schema:url edm:isShownBy schema:contentUrl edm:landingPage schema:url edm:language edm:object schema:image, if preview opt-out is NOT activated edm:preview schema:thumbnailUrl if preview opt-out is NOT activated edm:provider schema:provider dc:rights edm:rights edm:ugc edm:unstored N/A WebResource schema:WebPage or schema:MediaObject schema:url dc:description schema:description dc:format dc:rights dc:source schema:encodesCreativeWork dcterms:conformsTo dcterms:created schema:dateCreated dcterms:extent dcterms:hasPart dcterms:isFormatOf dcterms:issued schema:datePublished edm:isNextInSequence edm:rights Agent schema:Person or schema:Organization skos:prefLabel schema:name skos:altLabel skos:hiddenLabel schema:additionalName skos:note schema:description dc:date dc:identifier edm:begin edm:end schema:deathdate edm:hasMet schema:knows edm:isRelatedTo edm:wasPresentAt foaf:name schema:name rdaGr2:biographicalInformation schema:description rdaGr2:dateOfBirth schema:birthdate rdaGr2:dateOfDeath schema:deathdate rdaGr2:dateOfEstablishment schema:foundingDate rdaGr2:dateOfTermination rdaGr2:gender schema:gender rdaGr2:professionOrOccupatio n schema:jobTitle owl:sameAs Place schema:Place wgs84_pos:lat wgs84_pos:long wgs84_pos:alt wgs84_pos:lat_long skos:prefLabel schema:name skos:altLabel skos:hiddenLabel schema:additionalName skos:note schema:description dcterms:hasPart dcterms:isPartOf schema:containedIn owl:sameAs
  • 13.
    Current implementation A glimpseof an object’s full data http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09 B1642B11FA7317F5FE43B96B.html?format=labels Anatomy of results from an RDFa parser http://www.w3.org/2012/pyRdfa/ Several flavors of data in it…
  • 14.
    <http://www.europeana.eu/resolve/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B 96B> a schema:CreativeWork; schema:name"Cofre de base rectangular com tampa de quatro face..."; schema:about "Cofre-relicário"; schema:addressCountry "Portugal"; schema:contentUrl <http://www.matriznet.imc- ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd? IdReg=5113&TipoReg=1&ThumbnailType=2>; schema:creator "Desconhecido"; schema:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada com um friso perlado que rodeia representações da Virgem com o Menino, do Calvário e da Virgem em Glória, nas quatro faces. As arestas da tampa são emolduradas por pequenas caneluras; nas duas faces principais um entrelaçado encerra uma arcaria polilobada."; schema:image <http://www.matriznet.imc- ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd? IdReg=5113&TipoReg=1&ThumbnailType=1>; schema:provider "Instituto dos Museus e da Conservação", "Museu de Alberto Sampaio"; schema:url <http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96 B.html>, <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; Schema.org data
  • 15.
    dc11:creator "Desconhecido"; dc11:date "XIII"; dc11:description"Cofre de base rectangular com tampa de quatro faces. […]."; dc11:format "Altura: 9,5 cm; Profundidade: 10 cm; Comprimento: 19,5 cm"; dc11:identifier "MAS O 37"; dc11:rights "Copyright © Instituto dos Museus e da Conservação"; dc11:subject "Cofre-relicário"; dc11:title "Cofre de base rectangular com tampa de quatro face..."; dc11:type "Ourivesaria"; edm:country "Portugal"; edm:rights <http://creativecommons.org/publicdomain/mark/1.0/>; edm:dataProvider "Museu de Alberto Sampaio"; edm:provider "Instituto dos Museus e da Conservação"; edm:isShownAt <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx? IdReg=5113>; edm:isShownBy <http://www.matriznet.imc- ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd? IdReg=5113&TipoReg=1&ThumbnailType=2>; edm:landingPage <http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96 B.html>; edm:object <http://www.matriznet.imc- ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd? DC / EDM data
  • 16.
    og:description "Cofre debase rectangular com tampa de quatro faces. A urna é decorada […]"; og:image "http://europeanastatic.eu/api/image?type=IMAGE&uri=http://www.matriznet.imc- ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd? IdReg=5113&TipoReg=1&ThumbnailType=1&size=FULL_DOC"; og:site_name "Europeana"; og:title "Cofre de base rectangular com tampa de quatro face... | Desconhecido"; og:type "website"; og:url "http://preview.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B 96B.html” . FB OpenGraph data
  • 17.
    xhv:license <http://creativecommons.org/publicdomain/mark/1.0/>; cc:attributionURL <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx? IdReg=5113>; cc:morePermissions<http://www.matriznet.imc- ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; cc:useGuidelines <http://www.europeana.eu/rights/pd-usage-guide/>; Creative Commons data
  • 18.
    Observations • Schema.org issimple • Not everything can be mapped • We’re losing grain, including some of the core benefits of Europeana moving to the richer EDM! • But it’s ok, because it matches needs • And in fact it’s not entirely because of Schema.org • And we can publish different flavors of the data in RDFa
  • 19.

Editor's Notes

  • #3 View the object at: http://www.europeana.eu/portal/record/09102/_CM_0161930.html
  • #4 TODO: update graph
  • #13 https://github.com/europeana/portal/blob/master/portal2/src/test/schema.org.mapping/schema.org.mapping.properties
  • #16 In case it could be useful Europeana has a commitment to output some provenance data