SlideShare a Scribd company logo
1 of 50
Eric Sieverts Media, information & communication Amsterdam University of Applied Sciences  / Section Innovation & Development University Library Utrecht A pair of shoes in the thesaurus reflexions on human and computer indexing Society of Indexers Conference 2010  The challenging future of indexing 30 September 2010, Middelburg
agenda ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
searching in the world of ,[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
searching in the world of ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
search, search, search, search, search, ......  searcher / query documents match the basic search-and-find paradigm Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
search, search, search, ......  validity for free-text matching ? match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 ,[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],to what query is this Google's answer ?
Want to know something about " hallenkerken "  (Dutch for "hall church") thru Google Books? Google's first hit is a book about building thesauri, containing  the word in a single example of broader and narrower terms
searching in the world of ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
match classical situation with controlled human indexing searcher must enter the "term(s)" that have been used to characterize the subject indexer must assign “correct” terms to characterize the  document  in principle perfect match is possible Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
match not user-friendly:  searcher has to  invent  the correct terms expensive:   indexers must  analyze  the document in order to assign the correct terms however Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......  classical situation with controlled human indexing
search in the world of searcher just types some words  (or often only one single word) search system contains (all) the words from the documents themselves often you don't find  all  you need - still satisfied ?  match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
why still user satisfaction ? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
language technology at searcher side original simple query expanded & disambiguated  statistics generate additional terms to refine queries search system contains just the words from the documents themselves improved queries will result in better  answers ? match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
language technology for better "query" ,[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
language technology for better "query" ,[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
language technology at the document search with "correct" or “important” terms language technology enriches document  with "correct" term  (from thesaurus)  or derives characteristic terms from the text in principle perfect match is possible match Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 search, search, search, ......
automatic classification
automatic classification or enrichment ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
The Calais Web Service automatically creates rich semantic metadata Named  Entities Facts Events
geographical recognition in Google Books
training a system thesaurus training documents analysis module “ finger- prints” training module enrichment of  thesaurus     Joop van Gent, Irion
classification with system enriched thesaurus new documents analysis module “ finger- prints” classification module    Joop van Gent, Irion enriched documents
endgame tips: checkmate with bishop  and knight (in Dutch:  "horse" ) chess equestrianism
knowledge  organization  systems metadata: more than  keywords or thesauri ?
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],knowledge organization systems Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],knowledge organization systems Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ontologies Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
relations between some concepts  in a simple "wine ontology"
example of the relations  between concepts about the statue of Balzac by  Rodin  [in CIDOC-CRM]
semantic web
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ontologies Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
ontology notation for semantic web ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.sieverts@library.uu.nl  |  http://www.library.uu.nl/medew/it/eric  |  [email_address] resource description framework
rdf triples ,[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 graphical representation of simple network of 4 RDF-triples
SKOS-representation of thesaurus term & relations can be described in RDF Term : Economic cooperation  Used For : Economic co-operation  Broader terms : Economic policy  Narrower terms : Economic integration,  European economic cooperation,  European industrial cooperation,  Industrial cooperation  Related terms : Interdependence  Scope Note : Includes cooperative measures  in banking, trade, industry etc., between  and among countries.   Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
SKOS representation in RDF ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
RDF and "linked data" Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
the "linked data cloud" - september 2010 - 24 billion RDF triples online
viaf: virtual  international authority file dbpedia: data  from Wikipedia last.fm: artists geonames: 6.2 M toponyms BBC: wildlife finder LCSH Reuters: openCalais IMDB
topic maps ,[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
topic maps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
verdi puccini lucca italy italia italië italien tosca madame -butterfly madama -butterfly roma rome occurrences situated in influenced composed location for place of birth simple example of  opera topic-map adopted from  Pepper association types topic types composer opera city country
©  Antony Pitts, Kal Ahmed, MusicDNA Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010 topic map application Royal Academy of Music in London developed a model to describe "everything" around music, from  work/composition   to  experience  of a particular performance conceptually similar to relational FRBR model in library world
©  Antony Pitts, Kal Ahmed, MusicDNA
semantic web ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
©   Guus Schreiber UvA / VU rdf annotation of web resource Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010
iconclass annotation
"species ontology" Eric Sieverts  |  e.sieverts@library.uu.nl  |  http://www.library.uu.nl/medew/it/eric  |  [email_address] ©   Guus Schreiber UvA / VU
search, search, search, search, search, ......  match ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Eric Sieverts  |  e.g.sieverts@uu.nl  |  e.g.sieverts@hva.nl  |  http://www.library.uu.nl/medew/it/eric  |  Middelburg 30-9-2010

More Related Content

Similar to A pair of shoes in the thesaurus; some reflexions on human and computer indexing

10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.
BenitoSumpter862
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.
SantosConleyha
 
Classics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertationsClassics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertations
Royal Holloway University of London
 
Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)
Maggie Verster
 

Similar to A pair of shoes in the thesaurus; some reflexions on human and computer indexing (20)

Information research skills for projects and dissertations classics2015
Information research skills for projects and dissertations classics2015Information research skills for projects and dissertations classics2015
Information research skills for projects and dissertations classics2015
 
020610
020610020610
020610
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.
 
10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.10242021 Printhttpscontent.uagc.eduprintWinckelman.
10242021 Printhttpscontent.uagc.eduprintWinckelman.
 
Classics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertationsClassics Information research skills for projects and dissertations
Classics Information research skills for projects and dissertations
 
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URIBIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
 
eResources, Literature search and Reference Management Software
eResources, Literature search and Reference Management SoftwareeResources, Literature search and Reference Management Software
eResources, Literature search and Reference Management Software
 
Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017
 
Hector, E. Developing a practical workflow for maintaining online learning ob...
Hector, E. Developing a practical workflow for maintaining online learning ob...Hector, E. Developing a practical workflow for maintaining online learning ob...
Hector, E. Developing a practical workflow for maintaining online learning ob...
 
Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...
 
UNIGIS Annual Conference - Information management VU
UNIGIS Annual Conference - Information management VUUNIGIS Annual Conference - Information management VU
UNIGIS Annual Conference - Information management VU
 
Using the library for research
Using the library for researchUsing the library for research
Using the library for research
 
Search engines, e resources, and search strategy
Search engines, e resources, and search strategySearch engines, e resources, and search strategy
Search engines, e resources, and search strategy
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libs
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differently
 
Towards a digital library for York
Towards a digital library for YorkTowards a digital library for York
Towards a digital library for York
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?
 
Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)Organise your life and create frameworks with a digital library (schoolnetsa11)
Organise your life and create frameworks with a digital library (schoolnetsa11)
 
Challenges for PLE research and development
Challenges for PLE research and developmentChallenges for PLE research and development
Challenges for PLE research and development
 
The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...
 

More from Eric Sieverts

Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Eric Sieverts
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should know
Eric Sieverts
 
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overloadLifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
Eric Sieverts
 

More from Eric Sieverts (20)

Automatische classificatie
Automatische classificatieAutomatische classificatie
Automatische classificatie
 
Een andere blik op Google
Een andere blik op GoogleEen andere blik op Google
Een andere blik op Google
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should know
 
Wij zullen vinden - ook in 2023
Wij zullen vinden - ook in 2023Wij zullen vinden - ook in 2023
Wij zullen vinden - ook in 2023
 
Zoekmachines weten het antwoord
Zoekmachines weten het antwoordZoekmachines weten het antwoord
Zoekmachines weten het antwoord
 
Vertrouwen op semantische zoeksystemen of zelf aan het stuur
Vertrouwen op semantische zoeksystemen of zelf aan het stuurVertrouwen op semantische zoeksystemen of zelf aan het stuur
Vertrouwen op semantische zoeksystemen of zelf aan het stuur
 
Semantisch zoeken in een webomgeving
Semantisch zoeken in een webomgevingSemantisch zoeken in een webomgeving
Semantisch zoeken in een webomgeving
 
Information Retrieval: van specialisme tot commodity
Information Retrieval: van specialisme tot commodityInformation Retrieval: van specialisme tot commodity
Information Retrieval: van specialisme tot commodity
 
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
Semantisch Zoeken - knowledge graph, semantisch web, linked data, rdf, ontolo...
 
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
Semantisch zoeken - over knowledge graph, semantisch web, rdf enz.
 
Searching the internet - better with Google / Google not always best
Searching the internet - better with Google / Google not always bestSearching the internet - better with Google / Google not always best
Searching the internet - better with Google / Google not always best
 
Searching the internet - what patent searchers should know
Searching the internet - what patent searchers should knowSearching the internet - what patent searchers should know
Searching the internet - what patent searchers should know
 
Zin en onzin van metadata
Zin en onzin van metadataZin en onzin van metadata
Zin en onzin van metadata
 
40 jaar informatiegebruik
40 jaar informatiegebruik40 jaar informatiegebruik
40 jaar informatiegebruik
 
UBU 3.0: semantisch web & linked data voor de UB?
UBU 3.0: semantisch web & linked data voor de UB?UBU 3.0: semantisch web & linked data voor de UB?
UBU 3.0: semantisch web & linked data voor de UB?
 
Metadata, standaarden, interoperabiliteit, semantisch web en linked data
Metadata, standaarden, interoperabiliteit, semantisch web en linked dataMetadata, standaarden, interoperabiliteit, semantisch web en linked data
Metadata, standaarden, interoperabiliteit, semantisch web en linked data
 
Searchtrends
SearchtrendsSearchtrends
Searchtrends
 
Een digitale bibliotheek of alleen Google?
Een digitale bibliotheek of alleen Google?Een digitale bibliotheek of alleen Google?
Een digitale bibliotheek of alleen Google?
 
Project Panorama: vistas on validated information
Project Panorama: vistas on validated informationProject Panorama: vistas on validated information
Project Panorama: vistas on validated information
 
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overloadLifehacking met RSS en Netvibes? De strijd tegen informatie overload
Lifehacking met RSS en Netvibes? De strijd tegen informatie overload
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Recently uploaded (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 

A pair of shoes in the thesaurus; some reflexions on human and computer indexing

  • 1. Eric Sieverts Media, information & communication Amsterdam University of Applied Sciences / Section Innovation & Development University Library Utrecht A pair of shoes in the thesaurus reflexions on human and computer indexing Society of Indexers Conference 2010 The challenging future of indexing 30 September 2010, Middelburg
  • 2.
  • 3.
  • 4.
  • 5. search, search, search, search, search, ...... searcher / query documents match the basic search-and-find paradigm Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010
  • 6.
  • 7.
  • 8. Want to know something about " hallenkerken " (Dutch for "hall church") thru Google Books? Google's first hit is a book about building thesauri, containing the word in a single example of broader and narrower terms
  • 9.
  • 10. match classical situation with controlled human indexing searcher must enter the "term(s)" that have been used to characterize the subject indexer must assign “correct” terms to characterize the document in principle perfect match is possible Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 11. match not user-friendly: searcher has to invent the correct terms expensive: indexers must analyze the document in order to assign the correct terms however Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ...... classical situation with controlled human indexing
  • 12. search in the world of searcher just types some words (or often only one single word) search system contains (all) the words from the documents themselves often you don't find all you need - still satisfied ? match Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 13.
  • 14. language technology at searcher side original simple query expanded & disambiguated statistics generate additional terms to refine queries search system contains just the words from the documents themselves improved queries will result in better answers ? match Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 15.
  • 16.
  • 17. language technology at the document search with "correct" or “important” terms language technology enriches document with "correct" term (from thesaurus) or derives characteristic terms from the text in principle perfect match is possible match Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 search, search, search, ......
  • 19.
  • 20. The Calais Web Service automatically creates rich semantic metadata Named Entities Facts Events
  • 22. training a system thesaurus training documents analysis module “ finger- prints” training module enrichment of thesaurus  Joop van Gent, Irion
  • 23. classification with system enriched thesaurus new documents analysis module “ finger- prints” classification module  Joop van Gent, Irion enriched documents
  • 24. endgame tips: checkmate with bishop and knight (in Dutch: "horse" ) chess equestrianism
  • 25. knowledge organization systems metadata: more than keywords or thesauri ?
  • 26.
  • 27.
  • 28.
  • 29. relations between some concepts in a simple "wine ontology"
  • 30. example of the relations between concepts about the statue of Balzac by Rodin [in CIDOC-CRM]
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. SKOS-representation of thesaurus term & relations can be described in RDF Term : Economic cooperation Used For : Economic co-operation Broader terms : Economic policy Narrower terms : Economic integration, European economic cooperation, European industrial cooperation, Industrial cooperation Related terms : Interdependence Scope Note : Includes cooperative measures in banking, trade, industry etc., between and among countries. Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010
  • 37.
  • 38.
  • 39. the "linked data cloud" - september 2010 - 24 billion RDF triples online
  • 40. viaf: virtual international authority file dbpedia: data from Wikipedia last.fm: artists geonames: 6.2 M toponyms BBC: wildlife finder LCSH Reuters: openCalais IMDB
  • 41.
  • 42.
  • 43. verdi puccini lucca italy italia italië italien tosca madame -butterfly madama -butterfly roma rome occurrences situated in influenced composed location for place of birth simple example of opera topic-map adopted from Pepper association types topic types composer opera city country
  • 44. © Antony Pitts, Kal Ahmed, MusicDNA Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010 topic map application Royal Academy of Music in London developed a model to describe "everything" around music, from work/composition to experience of a particular performance conceptually similar to relational FRBR model in library world
  • 45. © Antony Pitts, Kal Ahmed, MusicDNA
  • 46.
  • 47. © Guus Schreiber UvA / VU rdf annotation of web resource Eric Sieverts | e.g.sieverts@uu.nl | e.g.sieverts@hva.nl | http://www.library.uu.nl/medew/it/eric | Middelburg 30-9-2010
  • 49. "species ontology" Eric Sieverts | e.sieverts@library.uu.nl | http://www.library.uu.nl/medew/it/eric | [email_address] © Guus Schreiber UvA / VU
  • 50.