SlideShare a Scribd company logo
1 of 50
Eric Sieverts Media, information & communication Amsterdam University of Applied Sciences  / Section Innovation & Development University Library Utrecht A pair of shoes in the thesaurus reflexions on human and computer indexing Society of Indexers Conference 2010  The challenging future of indexing 30 September 2010, Middelburg
agenda ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
searching in the world of ,[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
searching in the world of ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
search, search, search, search, search, ......  searcher / query documents match the basic search-and-find paradigm Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
search, search, search, ......  validity for free-text matching ? match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 ,[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],to what query is this Google's answer ?
Want to know something about " hallenkerken "  (Dutch for "hall church") thru Google Books? Google's first hit is a book about building thesauri, containing  the word in a single example of broader and narrower terms
searching in the world of ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
match classical situation with controlled human indexing searcher must enter the "term(s)" that have been used to characterize the subject indexer must assign “correct” terms to characterize the  document  in principle perfect match is possible Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
match not user-friendly:  searcher has to  invent  the correct terms expensive:   indexers must  analyze  the document in order to assign the correct terms however Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......  classical situation with controlled human indexing
search in the world of searcher just types some words  (or often only one single word) search system contains (all) the words from the documents themselves often you don't find  all  you need - still satisfied ?  match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
why still user satisfaction ? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
language technology at searcher side original simple query expanded & disambiguated  statistics generate additional terms to refine queries search system contains just the words from the documents themselves improved queries will result in better  answers ? match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
language technology for better "query" ,[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
language technology for better "query" ,[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
language technology at the document search with "correct" or “important” terms language technology enriches document  with "correct" term  (from thesaurus)  or derives characteristic terms from the text in principle perfect match is possible match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
automatic classification
automatic classification or enrichment ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
The Calais Web Service automatically creates rich semantic metadata Named  Entities Facts Events
geographical recognition in Google Books
training a system thesaurus training documents analysis module “ finger- prints” training module enrichment of  thesaurus     Joop van Gent, Irion
classification with system enriched thesaurus new documents analysis module “ finger- prints” classification module    Joop van Gent, Irion enriched documents
endgame tips: checkmate with bishop  and knight (in Dutch:  "horse" ) chess equestrianism
knowledge  organization  systems metadata: more than  keywords or thesauri ?
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],knowledge organization systems Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],knowledge organization systems Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ontologies Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
relations between some concepts  in a simple "wine ontology"
example of the relations  between concepts about the statue of Balzac by  Rodin  [in CIDOC-CRM]
semantic web
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ontologies Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
ontology notation for semantic web ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.sieverts@library.uu.nl  |  http://www.library.uu.nl/medew/it/eric  |  [email_address] resource description framework
rdf triples ,[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 graphical representation of simple network of 4 RDF-triples
SKOS-representation of thesaurus term & relations can be described in RDF Term : Economic cooperation  Used For : Economic co-operation  Broader terms : Economic policy  Narrower terms : Economic integration,  European economic cooperation,  European industrial cooperation,  Industrial cooperation  Related terms : Interdependence  Scope Note : Includes cooperative measures  in banking, trade, industry etc., between  and among countries.   Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
SKOS representation in RDF ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
RDF and "linked data" Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
the "linked data cloud" - september 2010 - 24 billion RDF triples online
viaf: virtual  international authority file dbpedia: data  from Wikipedia last.fm: artists geonames: 6.2 M toponyms BBC: wildlife finder LCSH Reuters: openCalais IMDB
topic maps ,[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
topic maps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
verdi puccini lucca italy italia italië italien tosca madame -butterfly madama -butterfly roma rome occurrences situated in influenced composed location for place of birth simple example of  opera topic-map adopted from  Pepper association types topic types composer opera city country
©  Antony Pitts, Kal Ahmed, MusicDNA Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 topic map application Royal Academy of Music in London developed a model to describe "everything" around music, from  work/composition   to  experience  of a particular performance conceptually similar to relational FRBR model in library world
©  Antony Pitts, Kal Ahmed, MusicDNA
semantic web ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
©   Guus Schreiber UvA / VU rdf annotation of web resource Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
iconclass annotation
"species ontology" Eric Sieverts  |  e.sieverts@library.uu.nl  |  http://www.library.uu.nl/medew/it/eric  |  [email_address] ©   Guus Schreiber UvA / VU
search, search, search, search, search, ......  match ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010

More Related Content

Similar to A pair of shoes in the thesaurus; some reflexions on human and computer indexing

Information research skills for projects and dissertations classics2015
Information research skills for projects and dissertations classics2015Information research skills for projects and dissertations classics2015
Information research skills for projects and dissertations classics2015Royal Holloway University of London
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.BenitoSumpter862
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.SantosConleyha
 
Classics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertationsClassics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertationsRoyal Holloway University of London
 
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URIBIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URINicolaie Constantinescu
 
eResources, Literature search and Reference Management Software
eResources, Literature search and Reference Management SoftwareeResources, Literature search and Reference Management Software
eResources, Literature search and Reference Management SoftwareVenkitachalam Sriram
 
Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Alex Asman
 
Hector, E. Developing a practical workflow for maintaining online learning ob...
Hector, E. Developing a practical workflow for maintaining online learning ob...Hector, E. Developing a practical workflow for maintaining online learning ob...
Hector, E. Developing a practical workflow for maintaining online learning ob...IL Group (CILIP Information Literacy Group)
 
Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Andy Powell
 
UNIGIS Annual Conference - Information management VU
UNIGIS Annual Conference - Information management VUUNIGIS Annual Conference - Information management VU
UNIGIS Annual Conference - Information management VUWilma Van Den Brink
 
Using the library for research
Using the library for researchUsing the library for research
Using the library for researchRoddy MacLeod
 
Search engines, e resources, and search strategy
Search engines, e resources, and search strategySearch engines, e resources, and search strategy
Search engines, e resources, and search strategyDr. Dirgha Raj joshi
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsSusanMRob
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differentlylisld
 
Towards a digital library for York
Towards a digital library for YorkTowards a digital library for York
Towards a digital library for YorkJulie Allinson
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Guus van den Brekel
 
Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)Maggie Verster
 
Challenges for PLE research and development
Challenges for PLE research and developmentChallenges for PLE research and development
Challenges for PLE research and developmentMarco Kalz
 
The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...heila1
 

Similar to A pair of shoes in the thesaurus; some reflexions on human and computer indexing (20)

Information research skills for projects and dissertations classics2015
Information research skills for projects and dissertations classics2015Information research skills for projects and dissertations classics2015
Information research skills for projects and dissertations classics2015
 
020610
020610020610
020610
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.
 
Classics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertationsClassics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertations
 
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URIBIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
 
eResources, Literature search and Reference Management Software
eResources, Literature search and Reference Management SoftwareeResources, Literature search and Reference Management Software
eResources, Literature search and Reference Management Software
 
Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017
 
Hector, E. Developing a practical workflow for maintaining online learning ob...
Hector, E. Developing a practical workflow for maintaining online learning ob...Hector, E. Developing a practical workflow for maintaining online learning ob...
Hector, E. Developing a practical workflow for maintaining online learning ob...
 
Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...
 
UNIGIS Annual Conference - Information management VU
UNIGIS Annual Conference - Information management VUUNIGIS Annual Conference - Information management VU
UNIGIS Annual Conference - Information management VU
 
Using the library for research
Using the library for researchUsing the library for research
Using the library for research
 
Search engines, e resources, and search strategy
Search engines, e resources, and search strategySearch engines, e resources, and search strategy
Search engines, e resources, and search strategy
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libs
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differently
 
Towards a digital library for York
Towards a digital library for YorkTowards a digital library for York
Towards a digital library for York
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?
 
Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)
 
Challenges for PLE research and development
Challenges for PLE research and developmentChallenges for PLE research and development
Challenges for PLE research and development
 
The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...
 

More from Eric Sieverts

Automatische classificatie
Automatische classificatieAutomatische classificatie
Automatische classificatieEric Sieverts
 
Een andere blik op Google
Een andere blik op GoogleEen andere blik op Google
Een andere blik op GoogleEric Sieverts
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowEric Sieverts
 
Wij zullen vinden - ook in 2023
Wij zullen vinden - ook in 2023Wij zullen vinden - ook in 2023
Wij zullen vinden - ook in 2023Eric Sieverts
 
Zoekmachines weten het antwoord
Zoekmachines weten het antwoordZoekmachines weten het antwoord
Zoekmachines weten het antwoordEric Sieverts
 
Vertrouwen op semantische zoeksystemen of zelf aan het stuur
Vertrouwen op semantische zoeksystemen of zelf aan het stuurVertrouwen op semantische zoeksystemen of zelf aan het stuur
Vertrouwen op semantische zoeksystemen of zelf aan het stuurEric Sieverts
 
Semantisch zoeken in een webomgeving
Semantisch zoeken in een webomgevingSemantisch zoeken in een webomgeving
Semantisch zoeken in een webomgevingEric Sieverts
 
Information Retrieval: van specialisme tot commodity
Information Retrieval: van specialisme tot commodityInformation Retrieval: van specialisme tot commodity
Information Retrieval: van specialisme tot commodityEric Sieverts
 
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...Eric Sieverts
 
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.Eric Sieverts
 
Searching the internet - better with Google / Google not always best
Searching the internet - better with Google / Google not always bestSearching the internet - better with Google / Google not always best
Searching the internet - better with Google / Google not always bestEric Sieverts
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowEric Sieverts
 
Zin en onzin van metadata
Zin en onzin van metadataZin en onzin van metadata
Zin en onzin van metadataEric Sieverts
 
40 jaar informatiegebruik
40 jaar informatiegebruik40 jaar informatiegebruik
40 jaar informatiegebruikEric Sieverts
 
UBU 3.0: semantisch web & linked data voor de UB?
UBU 3.0: semantisch web & linked data voor de UB?UBU 3.0: semantisch web & linked data voor de UB?
UBU 3.0: semantisch web & linked data voor de UB?Eric Sieverts
 
Metadata, standaarden, interoperabiliteit, semantisch web en linked data
Metadata, standaarden, interoperabiliteit, semantisch web en linked dataMetadata, standaarden, interoperabiliteit, semantisch web en linked data
Metadata, standaarden, interoperabiliteit, semantisch web en linked dataEric Sieverts
 
Een digitale bibliotheek of alleen Google?
Een digitale bibliotheek of alleen Google?Een digitale bibliotheek of alleen Google?
Een digitale bibliotheek of alleen Google?Eric Sieverts
 
Project Panorama: vistas on validated information
Project Panorama: vistas on validated informationProject Panorama: vistas on validated information
Project Panorama: vistas on validated informationEric Sieverts
 
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overloadLifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overloadEric Sieverts
 

More from Eric Sieverts (20)

Automatische classificatie
Automatische classificatieAutomatische classificatie
Automatische classificatie
 
Een andere blik op Google
Een andere blik op GoogleEen andere blik op Google
Een andere blik op Google
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should know
 
Wij zullen vinden - ook in 2023
Wij zullen vinden - ook in 2023Wij zullen vinden - ook in 2023
Wij zullen vinden - ook in 2023
 
Zoekmachines weten het antwoord
Zoekmachines weten het antwoordZoekmachines weten het antwoord
Zoekmachines weten het antwoord
 
Vertrouwen op semantische zoeksystemen of zelf aan het stuur
Vertrouwen op semantische zoeksystemen of zelf aan het stuurVertrouwen op semantische zoeksystemen of zelf aan het stuur
Vertrouwen op semantische zoeksystemen of zelf aan het stuur
 
Semantisch zoeken in een webomgeving
Semantisch zoeken in een webomgevingSemantisch zoeken in een webomgeving
Semantisch zoeken in een webomgeving
 
Information Retrieval: van specialisme tot commodity
Information Retrieval: van specialisme tot commodityInformation Retrieval: van specialisme tot commodity
Information Retrieval: van specialisme tot commodity
 
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
 
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
 
Searching the internet - better with Google / Google not always best
Searching the internet - better with Google / Google not always bestSearching the internet - better with Google / Google not always best
Searching the internet - better with Google / Google not always best
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should know
 
Zin en onzin van metadata
Zin en onzin van metadataZin en onzin van metadata
Zin en onzin van metadata
 
40 jaar informatiegebruik
40 jaar informatiegebruik40 jaar informatiegebruik
40 jaar informatiegebruik
 
UBU 3.0: semantisch web & linked data voor de UB?
UBU 3.0: semantisch web & linked data voor de UB?UBU 3.0: semantisch web & linked data voor de UB?
UBU 3.0: semantisch web & linked data voor de UB?
 
Metadata, standaarden, interoperabiliteit, semantisch web en linked data
Metadata, standaarden, interoperabiliteit, semantisch web en linked dataMetadata, standaarden, interoperabiliteit, semantisch web en linked data
Metadata, standaarden, interoperabiliteit, semantisch web en linked data
 
Searchtrends
SearchtrendsSearchtrends
Searchtrends
 
Een digitale bibliotheek of alleen Google?
Een digitale bibliotheek of alleen Google?Een digitale bibliotheek of alleen Google?
Een digitale bibliotheek of alleen Google?
 
Project Panorama: vistas on validated information
Project Panorama: vistas on validated informationProject Panorama: vistas on validated information
Project Panorama: vistas on validated information
 
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overloadLifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
 

Recently uploaded

Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...
HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...
HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...Nguyen Thanh Tu Collection
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 

Recently uploaded (20)

Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPSÂź Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...
HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...
HỌC TỐT TIáșŸNG ANH 11 THEO CHÆŻÆ NG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIáșŸT - Cáșą NĂ...
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 

A pair of shoes in the thesaurus; some reflexions on human and computer indexing

  • 1. Eric Sieverts Media, information & communication Amsterdam University of Applied Sciences / Section Innovation & Development University Library Utrecht A pair of shoes in the thesaurus reflexions on human and computer indexing Society of Indexers Conference 2010 The challenging future of indexing 30 September 2010, Middelburg
  • 2.
  • 3.
  • 4.
  • 5. search, search, search, search, search, ...... searcher / query documents match the basic search-and-find paradigm Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010
  • 6.
  • 7.
  • 8. Want to know something about " hallenkerken " (Dutch for "hall church") thru Google Books? Google's first hit is a book about building thesauri, containing the word in a single example of broader and narrower terms
  • 9.
  • 10. match classical situation with controlled human indexing searcher must enter the "term(s)" that have been used to characterize the subject indexer must assign “correct” terms to characterize the document in principle perfect match is possible Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 11. match not user-friendly: searcher has to invent the correct terms expensive: indexers must analyze the document in order to assign the correct terms however Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ...... classical situation with controlled human indexing
  • 12. search in the world of searcher just types some words (or often only one single word) search system contains (all) the words from the documents themselves often you don't find all you need - still satisfied ? match Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 13.
  • 14. language technology at searcher side original simple query expanded & disambiguated statistics generate additional terms to refine queries search system contains just the words from the documents themselves improved queries will result in better answers ? match Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 15.
  • 16.
  • 17. language technology at the document search with "correct" or “important” terms language technology enriches document with "correct" term (from thesaurus) or derives characteristic terms from the text in principle perfect match is possible match Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 19.
  • 20. The Calais Web Service automatically creates rich semantic metadata Named Entities Facts Events
  • 22. training a system thesaurus training documents analysis module “ finger- prints” training module enrichment of thesaurus  Joop van Gent, Irion
  • 23. classification with system enriched thesaurus new documents analysis module “ finger- prints” classification module  Joop van Gent, Irion enriched documents
  • 24. endgame tips: checkmate with bishop and knight (in Dutch: "horse" ) chess equestrianism
  • 25. knowledge organization systems metadata: more than keywords or thesauri ?
  • 26.
  • 27.
  • 28.
  • 29. relations between some concepts in a simple "wine ontology"
  • 30. example of the relations between concepts about the statue of Balzac by Rodin [in CIDOC-CRM]
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. SKOS-representation of thesaurus term & relations can be described in RDF Term : Economic cooperation Used For : Economic co-operation Broader terms : Economic policy Narrower terms : Economic integration, European economic cooperation, European industrial cooperation, Industrial cooperation Related terms : Interdependence Scope Note : Includes cooperative measures in banking, trade, industry etc., between and among countries. Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010
  • 37.
  • 38.
  • 39. the "linked data cloud" - september 2010 - 24 billion RDF triples online
  • 40. viaf: virtual international authority file dbpedia: data from Wikipedia last.fm: artists geonames: 6.2 M toponyms BBC: wildlife finder LCSH Reuters: openCalais IMDB
  • 41.
  • 42.
  • 43. verdi puccini lucca italy italia italiĂ« italien tosca madame -butterfly madama -butterfly roma rome occurrences situated in influenced composed location for place of birth simple example of opera topic-map adopted from Pepper association types topic types composer opera city country
  • 44. © Antony Pitts, Kal Ahmed, MusicDNA Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 topic map application Royal Academy of Music in London developed a model to describe "everything" around music, from work/composition to experience of a particular performance conceptually similar to relational FRBR model in library world
  • 45. © Antony Pitts, Kal Ahmed, MusicDNA
  • 46.
  • 47. © Guus Schreiber UvA / VU rdf annotation of web resource Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010
  • 49. "species ontology" Eric Sieverts | e.sieverts@library.uu.nl | http://www.library.uu.nl/medew/it/eric | [email_address] © Guus Schreiber UvA / VU
  • 50.