Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Exploring Audiovisual Archives
through Aligned Thesauri
Victor de Boer, Matthias Priem, Michiel Hildebrand,
Nico Verplanck...
CC-by-nc-nd https://www.flickr.com/photos/joina
The dangers of silos
The modern archiveThe modern archive
...TO CONTEXT: MUTUALLY CONNECTED
COLLECTIONS...
23-11-2016
Connecting collections:
topics, people, genres, etc
Catalogue
...
External: Networked heritage
Links through vocabularies
Case: Flemish Institute for Archiving (VIAA)
and the Netherlands Institute for Sound and Vision
(NISV)
• pipeline of a rea...
Sound and Vision VIAA
Dutch AV heritage
> 1.000.000 hrs of Tv (public
broadcasters)
radio, music, docu, film, commercials,...
Gemeenschappelijke Thesaurus
Audiovisuele Archieven (GTAA)
184,484 terms (concepts, persons, geo,…)
19,695 terms in hierar...
VRT Thesaurus
100.000+ terms
Structured, but not SKOS yet
No concept schemes
.
Mapped to SKOS,
Hierarchies to
skos:broader/narrower
VRT Thesaurus
VRT Thesaurus
102,172 terms
97,744 in hierarchy
4,429 top concepts
212 scopeNotes
6,828 relations
Conversion code availabl...
Collections
VIAA
• Part of the VRT AV collection
• +/- 35,000 items
(out of ~1Million)
• Annotated with VRT
thesaurus
• No...
VRT Thesaurus GTAA
ALIGNMENTVRT Thesaurus GTAA
‘Happy alignments are all alike;
every unhappy alignment is unhappy
in its own way’
Jacco van Ossenbruggen, (with apologie...
http://cultuurlink.beeldengeluid.nl/
Semi-automatic SKOS vocabulary alignment service
Successor of EuropeanaConnect’s Amal...
Example alignment strategy: Concepts
Example alignment strategy: Persons
Four strategies
Type Nr of correspondences
Subjects 4,176
Names 2,197
Locations 4,011
Persons 11,265
Total 21,640
ALIGNMENTVRT Thesaurus GTAA
Demonstrator: Information Retrieval tool
using Spinque search-by-strategy paradigm
No programming needed,
just modelling t...
Demonstrator
http://link.spinque.com/VIAA-1.0/
Input for keyword
search or
thesaurus
concepts
Search results
Collection
indicator
Thesaurus terms
associated with
video. ...
The selected
video appears in
the search field.
Thesaurus terms
associated with
search results
and selection.
Play screen
...
Conclusions
Conversion of structured vocabularies to SKOS
opens possibilities for connecting collections
Interactive align...
Thank you
vdboer@beeldengeluid.nl
http://cultuurlink.beeldengeluid.nl
http://link.spinque.com/VIAA-1.0/
http://semanticweb...
Exploring Audiovisual Archives through Aligned Thesauri
Upcoming SlideShare
Loading in …5
×

Exploring Audiovisual Archives through Aligned Thesauri

471 views

Published on

Slides for the presentation given at the MTSR 2016 conference in Gottingen, Germany for the paper "Exploring Audiovisual Archives through Aligned Thesauri" by Victor de Boer, Matthias Priem, Michiel Hildebrand, Nico Verplancke, Arjen de Vries, and Johan Oomen.

In this paper, we present a case study where partial
collections of two audiovisual archives (Netherlands Institute for Sound and Vision and VIAA) are connected by aligning their thesauri. We report on the conversion of one of the thesauri to SKOS and on the subsequent application of an interactive alignment tool CultuurLINK. Finally, we introduce an cross-collection browser which uses the produced alignment to allow users to explore connections between the two collections.

Published in: Education
  • Be the first to comment

Exploring Audiovisual Archives through Aligned Thesauri

  1. 1. Exploring Audiovisual Archives through Aligned Thesauri Victor de Boer, Matthias Priem, Michiel Hildebrand, Nico Verplancke, Arjen de Vries and Johan Oomen
  2. 2. CC-by-nc-nd https://www.flickr.com/photos/joina The dangers of silos
  3. 3. The modern archiveThe modern archive
  4. 4. ...TO CONTEXT: MUTUALLY CONNECTED COLLECTIONS... 23-11-2016 Connecting collections: topics, people, genres, etc Catalogue Photos Wiki Internal: Video hyperlinking
  5. 5. External: Networked heritage Links through vocabularies
  6. 6. Case: Flemish Institute for Archiving (VIAA) and the Netherlands Institute for Sound and Vision (NISV) • pipeline of a real-world, international use case that illustrates the end-user benefit of aligned SKOS thesauri • method and tools for converting XML thesauri to SKOS; • CultuurLINK, an interactive tool for thesaurus alignment; • application that enables cross-collection search and browsing using the aligned thesauri.
  7. 7. Sound and Vision VIAA Dutch AV heritage > 1.000.000 hrs of Tv (public broadcasters) radio, music, docu, film, commercials, etc Flemish archive, including Flemish broadcaster (VRT)
  8. 8. Gemeenschappelijke Thesaurus Audiovisuele Archieven (GTAA) 184,484 terms (concepts, persons, geo,…) 19,695 terms in hierarchy 9 conceptSchemes 90,708 scopeNotes 33,542 relations Published as SKOS Linked Open Data http://gtaa.beeldengeluid.nl/
  9. 9. VRT Thesaurus 100.000+ terms Structured, but not SKOS yet No concept schemes
  10. 10. . Mapped to SKOS, Hierarchies to skos:broader/narrower VRT Thesaurus
  11. 11. VRT Thesaurus 102,172 terms 97,744 in hierarchy 4,429 top concepts 212 scopeNotes 6,828 relations Conversion code available at https://github.com/viaacode/skoscreator Triples available at http://semanticweb.cs.vu.nl/test
  12. 12. Collections VIAA • Part of the VRT AV collection • +/- 35,000 items (out of ~1Million) • Annotated with VRT thesaurus • Not publicly available NISV • Openimages.eu • +/- 3,000 items out of 800K hrs • Mostly news broadcasts • Annotated with GTAA • Publicly available (CC-by-SA)
  13. 13. VRT Thesaurus GTAA
  14. 14. ALIGNMENTVRT Thesaurus GTAA
  15. 15. ‘Happy alignments are all alike; every unhappy alignment is unhappy in its own way’ Jacco van Ossenbruggen, (with apologies to Tolstoy)
  16. 16. http://cultuurlink.beeldengeluid.nl/ Semi-automatic SKOS vocabulary alignment service Successor of EuropeanaConnect’s Amalgame Users can upload vocabularies and match with existing vocs. Users can design, experiment, improve their alignment strategy Matching, selecting, excluding, sampling, evaluating
  17. 17. Example alignment strategy: Concepts
  18. 18. Example alignment strategy: Persons
  19. 19. Four strategies Type Nr of correspondences Subjects 4,176 Names 2,197 Locations 4,011 Persons 11,265 Total 21,640
  20. 20. ALIGNMENTVRT Thesaurus GTAA
  21. 21. Demonstrator: Information Retrieval tool using Spinque search-by-strategy paradigm No programming needed, just modelling the IR strategy Keyword, vocabulary term or Related-Object search Search on titles, description, vocabulary labels Weight on collection (user- input)
  22. 22. Demonstrator http://link.spinque.com/VIAA-1.0/
  23. 23. Input for keyword search or thesaurus concepts Search results Collection indicator Thesaurus terms associated with video. Terms may appear in one thesaurus or in both thesauriThesaurus terms associated with retrieval results (grouped by type) Slider used to indicate collection preference/weigh t Per results, the thumbnail, title, description, identifier and thesaurus terms are shown
  24. 24. The selected video appears in the search field. Thesaurus terms associated with search results and selection. Play screen In this case, the user positioned the slider all the way to the right, indicating that he/she is interested in Open Images videos related to this VRT item. List of OpenImages videos related to this VRT video. Matching terms are highlighted.
  25. 25. Conclusions Conversion of structured vocabularies to SKOS opens possibilities for connecting collections Interactive alignment produces many useful links Demonstrator shows possibilities of aligned collections Demonstrator will be extended whenmore collections are available > Complete NISV collection metadata (?) > Compete VIAA collection metadata(?)
  26. 26. Thank you vdboer@beeldengeluid.nl http://cultuurlink.beeldengeluid.nl http://link.spinque.com/VIAA-1.0/ http://semanticweb.cs.vu.nl/test https://github.com/viaacode/skoscreator

×