1. Linked (Open) Europeana : Vernetzte Daten in der Europäischen Digitalen Bibliothek Prof. Dr. Stefan Gradmann Unter Rückgriff auf M. Doerr, S. Hennicke , A. Isaac, C. Meghini, G. Schreiber, H. Van de Sompel Und Arbeiten von Europeana V1.0 und EuropeanaConnect
9. Alte und neue Begriffe (thanks, Karen Coyle!) Katalog Bestand 'Record' Dokument Suche Bibliothek Information Aggregation Exploration Navigation Verbindung Kontext Wissen
10. Nach dem Katalog: Objekte und Semantischer Kontext in Europeana Kontextualisierte Objektrepräsentationen (-> Wissensgenerierung): Europeana verbindet Objekt- und Wissensorganisationssysteme!
11. Die Semantische Datenschicht Überbrückt Informationsinseln und verbindet Objektrepräsentationen aus verschiedenen Gedächtnisdomänen. Bibliothek Archiv Museum
12. EDM und Linked Open Data Kontextoptionen DBpedia VIAF GND Geonames LCSH … Europeana-Objekte
47. Ereignisbasierte Modellierung Erhalt und Nutzung der Originaldaten impliziert Kompatibilität der Beschreibungen jenseits der einfachen Objektsicht!
51. ESE in EDM <...xmlns:europeana="http://www.europeana.eu/schemas/ese/“ ... <!--National Library of Poland--> <record> <dc:identifier>urn:repox.bn.org.pl:polona:oai:www.polona.pl:4445</dc:identifier> <europeana:uri>http://www.europeana.eu/resolve/record/01406/18A8F5DFCAE75694A8AAD18E2791861FBE92589A</europeana:uri> <dc:title>[Ośmiu powstańców uzbrojonych w kosy] [Dokument ikonograficzny]</dc:title> <dc:subject>Powstanie 1863 r. styczniowe - ikonografia</dc:subject> <dc:date>[1863-1864]</dc:date> <dc:rights>Biblioteka Narodowa</dc:rights> <europeana:object>http://193.59.172.16/szzz/IsShownBy.do?id=6481</europeana:object> <europeana:isShownAt>http://193.59.172.16/szzz/ShowStart.do?id=6481</europeana:isShownAt> </record> Europeana Audiovisual Community Meeting20100409
52. ESE in EDM 5.0 Europeana Audiovisual Community Meeting20100409
53. Les Fleurs du Mal: UNIMARC http://catalogue.bnf.fr/ark:/12148/cb37367035f 000 nam 22 450 001FRBNF373670350000003 009http://catalogue.bnf.fr/ark:/12148/cb37367035f 039 $oGEA$a000288182 100 $a19920409d1857 m y0frey50 ba 1010 $afre 102 $aFR 105 $a||||z 00||| 106 $ar 2001 $aˆLes ‰fleurs du mal$bTexte imprimé$fpar Charles Baudelaire 210 $aParis$cPoulet-Malassis et De Broise$d1857 215 $a248 p.$d19 cm 676 $a841.8$v22 686 $a840$2Cadre de classement de la Bibliographie nationale française 700 |$311890582$aBaudelaire$bCharles$4070 801 0$aFR$bBNF$c19920409$gAFNOR$2intermrc
54. Les Fleurs du Mal: Gallica http://gallica.bnf.fr/ark:/12148/bpt6k70861t
55. Les Fleurs du Mal: Digitalisiert http://gallica.bnf.fr/ark:/12148/bpt6k70861t.textePage.f1
65. Martin Doerr, Stefan Gradmann, Steffen Hennicke, Antoine Isaac, Carlo Meghini, Herbert van de Sompel: The Europeana Data Model. IFLA 2010 (Gothenburg). Session on „Libraries and the Semantic Web“. http://www.ifla.org/files/hq/papers/ifla76/149-doerr-en.pdf
68. RDA (Resource Description and Access) RDA-basierte Metadaten werden in EDM integrierbar sein, sobald die FRBR-Entitäten unterstützt werden.
69. Skalierbarkeit von RDF 'triple stores' ist kein kritischer Faktor. Relationale Datenhaltung mit Tripel-Aufsätzen wie D2R-Prozessoren sind eine gangbare Alternative zu nativen 'triple stores'.
70. LOD (Linked Open Data) EDM erlaubt die technische Integration von Europeana in die Welt der Linked Open Data – ersetzt aber keinesfalls eine entsprechend angepasste Politik der Datenlieferanten!
71. LoD Pilot : Ab Ende Mai 2011 … und heute in Leipzig!
86. Dem entsprechend zielen wir in Europeana explizit auf ein Open Linked Data (mehr in einem Statement der Europeana Foundation unter http://bit.ly/fe637P) und streben an, Europeana-Daten unter CC0 verfügbar zu machen!
87.
88. Zugang zu den Daten und deren Verwendung entfällt als Einkommensquelle
89.
90. -> Was ist der (kommerzielle!) Wert semantischer Kontextualisierung?
91. Europeana v1.0 WP3 EuropeanaConnect WP1 Prof. Dr. Stefan Gradmann Humboldt-Universität zu Berlin Institut für Bibliotheks- und Informationswissenschaft Unter den Linden 6 10099 Berlin Tel.: +49 30 2093-4481 e-mail: stefan.gradmann@ibi.hu-berlin.de Fragen ?
Editor's Notes
EDM re-uses three ontologies all of which are defined as a RDFS model. SKOS SKOS is an ontology to model KOS (vocabularies) in the Semantic Data Layer of Europeana. It specifically enables cross-vocabulary matching between concepts. Dublin Core Dublin Core is used to describe the core features of culture objects. ESE uses “old” Dublin Core Element Set. EDM uses “new” Dublin Core Metadata Terms which are specializations of the 15 “old” Dublin Core Elements. The use of DC Terms ensures backward compatibility to ESE. OAI ORE The typical record about an object provided to Europeana will included several information pieces: e.g. with descriptive metadata, views (thumbnails, video files, audio files, text documents etc.), links to landing pages etc. OAI ORE allows us to group and organize these information pieces: the abstract “provided object” (Object), the descriptive metadata (Proxy), any “view” of the provided object (Digital Representation).
The current data model of Europeana are the “Europeana Semantic Elements” (ESE). ESE addresses the issue of interoperability between the data from the different domains represented in Europeana by reducing the data to a “flat”, Dublin-Core like representation. This is a “simple and robust” approach but it has some drawbacks: The original metadata and information perspective are not visible anymore. And at the same time we can not specialize to finer-grained models or connect to external resources like LOD community. The EDM addresses exactly these shortcomings . It tries to transcend the different information perspectives which are represented in Europeana. It acts as a top-level ontology in order to make objects from different domains interoperable while still preserving the original data. The EDM is destined to replace ESE after the 2011 release of Europeana. The ESE will then be an „application profile“ of EDM. That means that all ESE data in Europeana will be still compatible with the new system.
First a few words about the envisioned information architecture of Europeana: This is how the information space of Europeana will be restructured : At the “bottom” we have the objects which are provided to Europeana. Above we have the “Semantic Data Layer” which is new. It contains various kinds of KOSs with knowledge about people, places, concepts, and so on. These concepts are linked to the objects below and thereby contextualize and enrich them.
The data provided to Europeana will come from many different kinds of domains like libraries, archives, or museums. They all will provide their specific collections and KOSs . That will naturally result in „isles of information“ . In order to make the data interoperable the concepts of the various KOSs in the Semantic Data Layer will be aligned , that means they will be connected via cross-vocabulary links . This technically enables applications to navigate through a semantic layer of concepts from different sources and to use it to access objects which are originally described by different but semantically related concepts.
Europeana intends to connect to the Linked Open Data community. In the Linked Open Data cloud we find many more knowledge sources like Dbpedia, Geonames, or Library of Congress Subject Headings. Europeana wants to use them to further contextualize and enrich the objects in its information space. At the same time Europeana wants to make its own data available to other communities. The EDM is crucial for realizing this vision. [ LOD cloud July 2009 ]
The current data model of Europeana are the “Europeana Semantic Elements” (ESE). ESE addresses the issue of interoperability between the data from the different domains represented in Europeana by reducing the data to a “flat”, Dublin-Core like representation. This is a “simple and robust” approach but it has some drawbacks: The original metadata and information perspective are not visible anymore. And at the same time we can not specialize to finer-grained models or connect to external resources like LOD community. The EDM addresses exactly these shortcomings . It tries to transcend the different information perspectives which are represented in Europeana. It acts as a top-level ontology in order to make objects from different domains interoperable while still preserving the original data. The EDM is destined to replace ESE after the 2011 release of Europeana. The ESE will then be an „application profile“ of EDM. That means that all ESE data in Europeana will be still compatible with the new system.
The current data model of Europeana are the “Europeana Semantic Elements” (ESE). ESE addresses the issue of interoperability between the data from the different domains represented in Europeana by reducing the data to a “flat”, Dublin-Core like representation. This is a “simple and robust” approach but it has some drawbacks: The original metadata and information perspective are not visible anymore. And at the same time we can not specialize to finer-grained models or connect to external resources like LOD community. The EDM addresses exactly these shortcomings . It tries to transcend the different information perspectives which are represented in Europeana. It acts as a top-level ontology in order to make objects from different domains interoperable while still preserving the original data. The EDM is destined to replace ESE after the 2011 release of Europeana. The ESE will then be an „application profile“ of EDM. That means that all ESE data in Europeana will be still compatible with the new system.
EDM re-uses three ontologies all of which are defined as a RDFS model. SKOS SKOS is an ontology to model KOS (vocabularies) in the Semantic Data Layer of Europeana. It specifically enables cross-vocabulary matching between concepts. Dublin Core Dublin Core is used to describe the core features of culture objects. ESE uses “old” Dublin Core Element Set. EDM uses “new” Dublin Core Metadata Terms which are specializations of the 15 “old” Dublin Core Elements. The use of DC Terms ensures backward compatibility to ESE. OAI ORE The typical record about an object provided to Europeana will included several information pieces: e.g. with descriptive metadata, views (thumbnails, video files, audio files, text documents etc.), links to landing pages etc. OAI ORE allows us to group and organize these information pieces: the abstract “provided object” (Object), the descriptive metadata (Proxy), any “view” of the provided object (Digital Representation).
In the physical world we create, use, and refer to aggregations of things all the time. We collect pictures in a photo album, read journals that are collections of articles, and burn CDs of our favorite songs. In this physical world these aggregations are frequently tangible - we can hold the photo album, journal, and CD. But, we also aggregate abstract entities – for instance on the WEb. OAI-ORE makes it possible to identify an aggregation .
Mona Lisa as described and depicted by the French ministry of culture (Directions des musees de France)
This is the metadata record of the French ministry of culture modeled in EDM. Each bubble represent a resource. In the bubble you have the class of the resource (its type) in italics and beneath the URI of the resource which identifies it. The arrows are the semantic links (the properties) between the resources. If there are two properties then the one below is the sub-property of the other one with a more specific meaning. First we have the Aggregation node which groups together all information pieces delivered by the Ministry. It aggregates the node representing the physical object “Mona Lisa”, the digital representations of the Mona Lisa, and the proxy node which is specific to a given provider, and is used to represent the description of the provided object, as seen from the perspective of that specific provider. This is how every metadata record provided to Europeana will look like in its basic form. Why manage central nodes for provided objects? The ORE model says so: an ORE proxy has to be proxy for some &quot;view- independent&quot; resource. Users are looking for (real world) objects (the painting Mona Lisa) and not for the specific view on it of Louvre, or Jaconde (of which they normally do not know anyway). So the approach is: Find the object first (PhysicalThing) and then proceed to the specific views on it. This is also the LOD approach.
Europeana wants to contextualize and enrich its objects by linking them to resources which contain additional knowledge. This enables richer functions, such as query expansion (e.g., using alternatives for a creator's name), recommendation of objects using semantic relations between them (objects created by connected artists), etc. This is the same Proxy from the slide before but now all the string values are converted to resources and typed. For example the subject of the painting Mona Lisa “femme” is now a resource typed as a concept and with the english and french spelling of the concept attached taken from a KOS in the Semantic Data Layer. And in the same KOS we could also properly find the broader term for this concept. Furthermore we could semantically align the concept femme with the concept femme in the Wikipedia (LOD cloud) and take all the information available there for this specific subject, including the many translations of the term itself. To increase the data value of its objects.
What we looked at so far can be understood as object-centric modeling. The second general modeling approach is event-centric which tries to tell a story about the object’s history. For this purpose EDM provides a simple “event-centric core” of one class and three properties: ens:Event: hub for event descriptions ens:wasPresentAt, holding between any resource and an event it is involved in; ens:happenedAt, holding between an event and a place; ens:occurredAt, holding between events and the time spans during which they occurred. This is to give you an impression of what is possible without going into details.
This is a (more or less fictional) example of three records about a translation of Edgar Allan Poe’s “ The Narrative of Arthur Gordon Pym of Nantucket ” to french: Record from BNF about an edition from 1868 Record from Gallica about an edition from 1868 (which offers a digital version of the book online: this the WebResource) Record from BNF with an edition from 2007 A few things I want to point out: Two records about the same thing and both point to the same object of interest, the 1868 edition. The user will look for this edition and not for the specific view of Gallica or BNF on this edition. So this node is the point of entry from which a user will proceed to a specific view on the object. It is also apparent now why Proxies for the descriptive metadata are helpful: Because this way we can keep the two views on the 1868 edition distinct. Finally the link „isDerivativeOf“ is an example of an inter-object link. So, for example, if a user found the 2007 edition he will be also hinted to the digital version of the 1868 edition in Gallica. With respect to FRBR one could start discussing now what and where is the work, expression, manifestation, and item here. Although the development of the EDM has been inspired by FRBR it is not implemented yet. That will happen after 2011.
EDM is still under development, and will continue to be refined until the end of 2010. It will be implemented during 2011, in the lead up to the Danube release of Europeana. Before, during and after the implementation of EDM, data that is compliant only with ESE will continue to be accepted. EDM is compatible with ESE and no data will need to be resubmitted. Europeana will make available a converter, and any provider who wishes to resubmit data, in order to increase its richness within Europeana, will be able to do so if they wish but will be under no obligation. How will EDM data be delivered to Europeana? Providers will have to create mapping to EDM and deliver it alongside their data which ideally are metadata records properly linked (IDs) to a vocabulary. The data has to be in XML or RDF. From this Europeana will create EDM data which includes enrichments and linking to external resources (vocabularies in the semantic data layer and/or the LOD cloud). Prototyping? At the end of the year we will start to produce first EDM data for the productive version of Europeana. This data will be taken from existing ESE data and from rich data delivered to Europeana by then.