How can library materials be ranked in the OPAC?

Dirk Lewandowski
Dirk LewandowskiProfessor at Hamburg University of Applied Sciences
How can library materials be ranked in the
OPAC?
Prof. Dr. Dirk Lewandowski
University of Applied Sciences Hamburg
dirk.lewandowski@haw-hamburg.de


9th International Bielefeld Conference
Bielefeld, 4 February 2009
Agenda




 The state of the OPAC and the importance of relevance ranking

 Ranking factors

 The composition of results lists

 Conclusions




1 | Dirk Lewandowski
Agenda




 The state of the OPAC and the importance of relevance ranking

 Ranking factors

 The composition of results lists

 Conclusions




2 | Dirk Lewandowski
What’s wrong with library catalogueues?




 • catalogueues are incomplete
     – Items from journal article collections, abstracting and indexing databases

 • “Electronic card catalogueue”?

 • User behaviour changed
    – Short queries, fast results, one set of results
    – Search engines strongly influence users’ demands

 • Known item vs. topic-based search
    – OPACs should accomodate both.




3 | Dirk Lewandowski
Some ideas to improve the OPAC (“catalogue 2.0”)




 • Let users participate
    – Write reviews
    – Rate titles

 • Enrich bibliographic data
    – Add reviews
    – Add TOC

 • Improve navigation
     – Drill-down menus on results pages to combine searching and browsing

 • Extend the database
    – Federated search

4 | Dirk Lewandowski
Core of all search appliances: Relevance ranking




 • While Web 2.0 features add value to the catalogue, search is still the core.

 • “Search must work”

 • Users’ needs
    – Users want results quickly.
    – Users are not willing to think too much about formulating their queries.
    – Users are not willing to search for the right database before conducting their
      search.
    – Users are only willing to view a few results on the first results page before
      deciding to continue.




5 | Dirk Lewandowski
Misconceptions about relevance ranking




 • A clear sorting criterion is better than relevance ranking.
    – Ranking does not reduce the number of results, but puts them in a certain order.
    – Other ordering options can be given.

 • Library catalogues do not apply any form of ranking.
     – Even conventional OPACs rank the results (according to publication date).

 • Relevance ranking is useless because it simply doesn’t work.
    – “Relevance” is hard to determine and depends on the context and on the
      individual user. However, a good relevance ranking can at least produce sufficient
      results lists.

 • Ranking is not that complicated. One must only apply standard measures such
   as TF/IDF.
     – For a good ranking, text matching alone is insufficient.

6 | Dirk Lewandowski
Agenda




 The state of the OPAC and the importance of relevance ranking

 Ranking factors

 The composition of results lists

 Conclusions




7 | Dirk Lewandowski
Ranking factors in web search engines



 • Text matching
    – Measures matching between query and document.
    – Term frequency, position of search terms within the documents, etc.
    – Text from document fulltexts, anchor texts.

 • Popularity
    – Measures popularity of the document (overall popularity or topic-based)
    – Link popularity (PageRank etc.), click popularity.

 • Freshness
     – Fresh documents can sometimes be very useful.
     – Derived from documents or from structural data (e.g., linkage)

 • Locality
    – Mainly expressed in differing rankings for country-specific search interfaces.

8 | Dirk Lewandowski: How can library materials be ranked in the OPAC?
         Lewandowski
Text matching




 • Factors
    – Term frequency, inverted document frequency
    – Fields: Title, subject headings, author, etc.

 • Availability of text elements as a ranking factor
    – Fulltext, TOC, reviews, user comments

 • Problems with text matching
    – Not enough text in metadata.
    – Amount of text varies considerably (from mere bibliographic data to hundreds of
      pages of fulltext).




9 | Dirk Lewandowski
Popularity




 • Popularity of
    – Item
    – Author/editor
    – Publisher
    – Book series

 • Measures
    – Number of items (by author, publisher, etc.)
    – Usage (circulation rate, download requests)
    – Average user rating
    – Citations




10 | Dirk Lewandowski
Freshness




 • Freshness is the most-used ranking criterion in catalogues today.

 • It is often difficult to determine whether fresh items will be relevant to a certain
   query.

 • Need for fresh items can be derived from
    – Circulation rate for the individual item
    – Circulation rates for items from a certain group (from broad disciplines to specific
      subject headings)




11 | Dirk Lewandowski
Locality




 • Availability of item
    – from the local library; within a certain distance.
    – Item currently available.

 • Physical location of the user
    – At home (electronic items strongly preferred)
    – At the library




12 | Dirk Lewandowski
Other ranking factors




 • Size of item (no. of pages)

 • Document types
    – Monograph, edited book, proceedings, etc.
    – Article vs. Book
    – Physical vs. online materials

 • User group
    – Professor, undergraduate student, graduate student, etc.

 • Personalization
    – Individual usage data
    – Click-stream data from navigation

13 | Dirk Lewandowski
Agenda




 The state of the OPAC and the importance of relevance ranking

 Ranking factors

 The composition of results lists

 Conclusions




14 | Dirk Lewandowski
Data needed




 • Data from the catalogue

 • Circulation data
    – Anonymous

 • Location data
    – From IP ranges

 • User data

 • Data from remote resources
    – Abstracts (and fulltexts) from publishers.


15 | Dirk Lewandowski
Collections and databases




 • Library controlled
     – catalogue
     – Local digital repositories
     – Course management systems
     – Institutional web sites

 • External collections
    – A&I databases
    – E-journal collections




16 | Dirk Lewandowski
Mixed results lists




 • Ranking algorithms prefer “more of the same”. This does not satisfy users’
   needs for a variety of results.

 • Example for a broad query
    – Reference works (from subject headings + items from reference collection)
    – Text books
    – Relevant databases
    – Some current items
    – Relevant journals




17 | Dirk Lewandowski
“Universal Search”


                                                                                  Additional databases

 • x




                                                                          One box results (e.g., news or images)




18 | Dirk Lewandowski: How can library materials be ranked in the OPAC?
          Lewandowski
Agenda




 The state of the OPAC and the importance of relevance ranking

 Ranking factors

 The composition of results lists

 Conclusions




19 | Dirk Lewandowski
Conclusions




 • Search is the core of the library catalogue.
    – However, other elements must be considered, too:
          – Usability
          – User guidance
          – Spelling corrections
          – etc.

 • A good ranking is always a mixture of ranking factors

 • In addition, results lists should be mixed.
     – Items from different collections.
     – Mixture of direct results and pointers to other collections.

 • Future: catalogue will become more like a search engines.
20 | Dirk Lewandowski
Thank you for your attention.
Prof. Dr.
Dirk Lewandowski

Hamburg University of Applied Sciences
Department Information
Berliner Tor 5
D - 20099 Hamburg
Germany



www.bui.haw-hamburg.de/lewandowski.html
E-Mail: dirk.lewandowski@haw-hamburg.de
1 of 22

Recommended

Search engine user behaviour: How can users be guided to quality content? by
Search engine user behaviour: How can users be guided to quality content?Search engine user behaviour: How can users be guided to quality content?
Search engine user behaviour: How can users be guided to quality content?Dirk Lewandowski
915 views17 slides
Managing discovery and linking services by
Managing discovery and linking servicesManaging discovery and linking services
Managing discovery and linking servicesNASIG
3.2K views59 slides
Text mining 101 what you should know by
Text mining 101 what you should knowText mining 101 what you should know
Text mining 101 what you should knowNASIG
414 views32 slides
Technology Evaluation and Meeting the Needs of People with Disabilities by
Technology Evaluation and Meeting the Needs of People with Disabilities Technology Evaluation and Meeting the Needs of People with Disabilities
Technology Evaluation and Meeting the Needs of People with Disabilities National Information Standards Organization (NISO)
4.7K views23 slides
Data-Informed Decision Making for Libraries - Athenaeum21 by
Data-Informed Decision Making for Libraries - Athenaeum21Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21Megan Hurst
576 views41 slides
Discovery Layer Strategies for Kuali OLE: Indiana University by
Discovery Layer Strategies for Kuali OLE: Indiana UniversityDiscovery Layer Strategies for Kuali OLE: Indiana University
Discovery Layer Strategies for Kuali OLE: Indiana UniversityCourtney McDonald
1.1K views48 slides

More Related Content

What's hot

ArchiveSpark Introduction @ WebSci' 2016 Hackathon by
ArchiveSpark Introduction @ WebSci' 2016 HackathonArchiveSpark Introduction @ WebSci' 2016 Hackathon
ArchiveSpark Introduction @ WebSci' 2016 HackathonHelge Holzmann
473 views16 slides
Role of libraries in accelerating research by
Role of libraries in accelerating researchRole of libraries in accelerating research
Role of libraries in accelerating researchNikesh Narayanan
404 views81 slides
WorldCat Presentation by
WorldCat PresentationWorldCat Presentation
WorldCat PresentationVal MacMillan
2.1K views26 slides
Federated to library discovery platfoms by
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfomsNikesh Narayanan
361 views44 slides
Search & Recommendation: Birds of a Feather? by
Search & Recommendation: Birds of a Feather?Search & Recommendation: Birds of a Feather?
Search & Recommendation: Birds of a Feather?Toine Bogers
1.1K views38 slides
All Your Data Displayed in One Place: Scoping Research for a Library Assessme... by
All Your Data Displayed in One Place: Scoping Research for a Library Assessme...All Your Data Displayed in One Place: Scoping Research for a Library Assessme...
All Your Data Displayed in One Place: Scoping Research for a Library Assessme...Megan Hurst
37.3K views17 slides

What's hot(20)

ArchiveSpark Introduction @ WebSci' 2016 Hackathon by Helge Holzmann
ArchiveSpark Introduction @ WebSci' 2016 HackathonArchiveSpark Introduction @ WebSci' 2016 Hackathon
ArchiveSpark Introduction @ WebSci' 2016 Hackathon
Helge Holzmann473 views
Role of libraries in accelerating research by Nikesh Narayanan
Role of libraries in accelerating researchRole of libraries in accelerating research
Role of libraries in accelerating research
Nikesh Narayanan404 views
WorldCat Presentation by Val MacMillan
WorldCat PresentationWorldCat Presentation
WorldCat Presentation
Val MacMillan2.1K views
Federated to library discovery platfoms by Nikesh Narayanan
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfoms
Nikesh Narayanan361 views
Search & Recommendation: Birds of a Feather? by Toine Bogers
Search & Recommendation: Birds of a Feather?Search & Recommendation: Birds of a Feather?
Search & Recommendation: Birds of a Feather?
Toine Bogers1.1K views
All Your Data Displayed in One Place: Scoping Research for a Library Assessme... by Megan Hurst
All Your Data Displayed in One Place: Scoping Research for a Library Assessme...All Your Data Displayed in One Place: Scoping Research for a Library Assessme...
All Your Data Displayed in One Place: Scoping Research for a Library Assessme...
Megan Hurst37.3K views
Semantic Search by sssw2012
Semantic SearchSemantic Search
Semantic Search
sssw20121.7K views
Shared Print in the Orbis Cascade Alliance and Colorado Alliance (Levine-Clark) by Charleston Conference
Shared Print in the Orbis Cascade Alliance and Colorado Alliance (Levine-Clark)Shared Print in the Orbis Cascade Alliance and Colorado Alliance (Levine-Clark)
Shared Print in the Orbis Cascade Alliance and Colorado Alliance (Levine-Clark)
Promoting Open Access and Open Educational Resources to Faculty by NASIG
Promoting Open Access and Open Educational Resources to FacultyPromoting Open Access and Open Educational Resources to Faculty
Promoting Open Access and Open Educational Resources to Faculty
NASIG452 views
E book acquisition discovery-delivery-support by Jeff Siemon
E book acquisition discovery-delivery-supportE book acquisition discovery-delivery-support
E book acquisition discovery-delivery-support
Jeff Siemon692 views
NISO Standards update: KBart and Demand Driven Acquisitions Best Practices by Jason Price, PhD
NISO Standards update: KBart and Demand Driven Acquisitions Best PracticesNISO Standards update: KBart and Demand Driven Acquisitions Best Practices
NISO Standards update: KBart and Demand Driven Acquisitions Best Practices
Jason Price, PhD892 views
Analyzing workflows and improving communication across departments by NASIG
Analyzing workflows and improving communication across departments Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments
NASIG259 views
Getting on with it (research support at an academic library) presented at Uni... by Reed Elsevier
Getting on with it (research support at an academic library) presented at Uni...Getting on with it (research support at an academic library) presented at Uni...
Getting on with it (research support at an academic library) presented at Uni...
Reed Elsevier977 views
Library Services Navigation by Brian Rennick
Library Services NavigationLibrary Services Navigation
Library Services Navigation
Brian Rennick234 views
Research Support Services ECU Library by Julia Gross
Research Support Services ECU LibraryResearch Support Services ECU Library
Research Support Services ECU Library
Julia Gross5.9K views
Virtual support_to_research_communities by СОБДиЮ
Virtual  support_to_research_communitiesVirtual  support_to_research_communities
Virtual support_to_research_communities
СОБДиЮ531 views
RDA Toolkit Essentials webinar 03.19.14 by jhennelly
RDA Toolkit Essentials webinar 03.19.14RDA Toolkit Essentials webinar 03.19.14
RDA Toolkit Essentials webinar 03.19.14
jhennelly1.6K views

Viewers also liked

Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim... by
Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim...Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim...
Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim...Dirk Lewandowski
1.3K views26 slides
Verwendung von Skalenbewertungen in der Evaluierung von Suchmaschinen by
Verwendung von Skalenbewertungen in der Evaluierung von SuchmaschinenVerwendung von Skalenbewertungen in der Evaluierung von Suchmaschinen
Verwendung von Skalenbewertungen in der Evaluierung von SuchmaschinenDirk Lewandowski
833 views23 slides
Perspektiven eines Open Web Index by
Perspektiven eines Open Web IndexPerspektiven eines Open Web Index
Perspektiven eines Open Web IndexDirk Lewandowski
1.6K views40 slides
Internet-Suchmaschinen: Aktueller Stand und Entwicklungsperspektiven by
Internet-Suchmaschinen: Aktueller Stand und EntwicklungsperspektivenInternet-Suchmaschinen: Aktueller Stand und Entwicklungsperspektiven
Internet-Suchmaschinen: Aktueller Stand und EntwicklungsperspektivenDirk Lewandowski
871 views49 slides
Neue Trends: Google, SEO und Co.? by
Neue Trends: Google, SEO und Co.?Neue Trends: Google, SEO und Co.?
Neue Trends: Google, SEO und Co.?Dirk Lewandowski
729 views24 slides
Wie entwickeln sich Suchmaschinen heute, was kommt morgen? by
Wie entwickeln sich Suchmaschinen heute, was kommt morgen?Wie entwickeln sich Suchmaschinen heute, was kommt morgen?
Wie entwickeln sich Suchmaschinen heute, was kommt morgen?Dirk Lewandowski
1.5K views35 slides

Viewers also liked(9)

Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim... by Dirk Lewandowski
Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim...Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim...
Ordinary Search Engine Users Assessing Difficulty, Effort and Outcome for Sim...
Dirk Lewandowski1.3K views
Verwendung von Skalenbewertungen in der Evaluierung von Suchmaschinen by Dirk Lewandowski
Verwendung von Skalenbewertungen in der Evaluierung von SuchmaschinenVerwendung von Skalenbewertungen in der Evaluierung von Suchmaschinen
Verwendung von Skalenbewertungen in der Evaluierung von Suchmaschinen
Dirk Lewandowski833 views
Perspektiven eines Open Web Index by Dirk Lewandowski
Perspektiven eines Open Web IndexPerspektiven eines Open Web Index
Perspektiven eines Open Web Index
Dirk Lewandowski1.6K views
Internet-Suchmaschinen: Aktueller Stand und Entwicklungsperspektiven by Dirk Lewandowski
Internet-Suchmaschinen: Aktueller Stand und EntwicklungsperspektivenInternet-Suchmaschinen: Aktueller Stand und Entwicklungsperspektiven
Internet-Suchmaschinen: Aktueller Stand und Entwicklungsperspektiven
Dirk Lewandowski871 views
Wie entwickeln sich Suchmaschinen heute, was kommt morgen? by Dirk Lewandowski
Wie entwickeln sich Suchmaschinen heute, was kommt morgen?Wie entwickeln sich Suchmaschinen heute, was kommt morgen?
Wie entwickeln sich Suchmaschinen heute, was kommt morgen?
Dirk Lewandowski1.5K views
Wie Suchmaschinen die Inhalte des Web interpretieren by Dirk Lewandowski
Wie Suchmaschinen die Inhalte des Web interpretierenWie Suchmaschinen die Inhalte des Web interpretieren
Wie Suchmaschinen die Inhalte des Web interpretieren
Dirk Lewandowski1.5K views

Similar to How can library materials be ranked in the OPAC?

Measuring the quality of web search engines by
Measuring the quality of web search enginesMeasuring the quality of web search engines
Measuring the quality of web search enginesDirk Lewandowski
1.1K views40 slides
Snyman unisa battle to build an ebook collection by
Snyman unisa battle to build an ebook collectionSnyman unisa battle to build an ebook collection
Snyman unisa battle to build an ebook collectionFOTIM
316 views28 slides
Introducing the Open Discovery Initiative by
Introducing the Open Discovery InitiativeIntroducing the Open Discovery Initiative
Introducing the Open Discovery InitiativeNASIG
843 views21 slides
Andrew cox rdm rose by
Andrew cox   rdm roseAndrew cox   rdm rose
Andrew cox rdm rosesconul
939 views29 slides
RDA Toolkit Essentials 01.16 by
RDA Toolkit Essentials 01.16RDA Toolkit Essentials 01.16
RDA Toolkit Essentials 01.16jhennelly
1.7K views59 slides

Similar to How can library materials be ranked in the OPAC?(20)

Measuring the quality of web search engines by Dirk Lewandowski
Measuring the quality of web search enginesMeasuring the quality of web search engines
Measuring the quality of web search engines
Dirk Lewandowski1.1K views
Snyman unisa battle to build an ebook collection by FOTIM
Snyman unisa battle to build an ebook collectionSnyman unisa battle to build an ebook collection
Snyman unisa battle to build an ebook collection
FOTIM316 views
Introducing the Open Discovery Initiative by NASIG
Introducing the Open Discovery InitiativeIntroducing the Open Discovery Initiative
Introducing the Open Discovery Initiative
NASIG843 views
Andrew cox rdm rose by sconul
Andrew cox   rdm roseAndrew cox   rdm rose
Andrew cox rdm rose
sconul939 views
RDA Toolkit Essentials 01.16 by jhennelly
RDA Toolkit Essentials 01.16RDA Toolkit Essentials 01.16
RDA Toolkit Essentials 01.16
jhennelly1.7K views
09.19 rda toolkit essentials by jhennelly
09.19 rda toolkit essentials 09.19 rda toolkit essentials
09.19 rda toolkit essentials
jhennelly2K views
07.18 rda toolkit essentials by jhennelly
07.18 rda toolkit essentials07.18 rda toolkit essentials
07.18 rda toolkit essentials
jhennelly1.2K views
11.14 RDA Toolkit essentials by jhennelly
11.14 RDA Toolkit essentials 11.14 RDA Toolkit essentials
11.14 RDA Toolkit essentials
jhennelly1.3K views
Role of libraries in research and scholarly communication by Nikesh Narayanan
Role of libraries in research and scholarly communicationRole of libraries in research and scholarly communication
Role of libraries in research and scholarly communication
Nikesh Narayanan251 views
The OCLC Research Library Partnership by OCLC
The OCLC Research Library PartnershipThe OCLC Research Library Partnership
The OCLC Research Library Partnership
OCLC4K views
Web Scale Discovery Services: Google like search experience by Nikesh Narayanan
Web Scale Discovery Services: Google like search experienceWeb Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experience
Nikesh Narayanan1.4K views
RDA Toolkit Essentials 2015-06-11 by jhennelly
RDA Toolkit Essentials 2015-06-11RDA Toolkit Essentials 2015-06-11
RDA Toolkit Essentials 2015-06-11
jhennelly2.3K views
RDA Toolkit Essentials 2015-09-24 by jhennelly
RDA Toolkit Essentials 2015-09-24RDA Toolkit Essentials 2015-09-24
RDA Toolkit Essentials 2015-09-24
jhennelly4.5K views
Benchmarking Domain-specific Expert Search using Workshop Program Committees by Toine Bogers
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Toine Bogers732 views
Discovery on a budget: Improved searching without a Web-scale discovery product by NASIG
Discovery on a budget: Improved searching without a Web-scale discovery productDiscovery on a budget: Improved searching without a Web-scale discovery product
Discovery on a budget: Improved searching without a Web-scale discovery product
NASIG746 views
Discovery on a budget by Chris Bulock
Discovery on a budgetDiscovery on a budget
Discovery on a budget
Chris Bulock358 views
RDA Toolkit Essentials 2014-12-17 by jhennelly
RDA Toolkit Essentials 2014-12-17RDA Toolkit Essentials 2014-12-17
RDA Toolkit Essentials 2014-12-17
jhennelly1.4K views
RDA Toolkit Essentials 2015-03-18 by jhennelly
RDA Toolkit Essentials 2015-03-18RDA Toolkit Essentials 2015-03-18
RDA Toolkit Essentials 2015-03-18
jhennelly1.4K views

More from Dirk Lewandowski

The Need for and fundamentals of an Open Web Index by
The Need for and fundamentals of an Open Web IndexThe Need for and fundamentals of an Open Web Index
The Need for and fundamentals of an Open Web IndexDirk Lewandowski
230 views17 slides
In a World of Biased Search Engines by
In a World of Biased Search EnginesIn a World of Biased Search Engines
In a World of Biased Search EnginesDirk Lewandowski
583 views48 slides
EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni... by
EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni...EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni...
EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni...Dirk Lewandowski
319 views45 slides
Künstliche Intelligenz bei Suchmaschinen by
Künstliche Intelligenz bei SuchmaschinenKünstliche Intelligenz bei Suchmaschinen
Künstliche Intelligenz bei SuchmaschinenDirk Lewandowski
466 views28 slides
Analysing search engine data on socially relevant topics by
Analysing search engine data on socially relevant topicsAnalysing search engine data on socially relevant topics
Analysing search engine data on socially relevant topicsDirk Lewandowski
191 views36 slides
Google Assistant, Alexa & Co.: Wie sich die Welt der Suche verändert by
Google Assistant, Alexa & Co.: Wie sich die Welt der Suche verändertGoogle Assistant, Alexa & Co.: Wie sich die Welt der Suche verändert
Google Assistant, Alexa & Co.: Wie sich die Welt der Suche verändertDirk Lewandowski
278 views27 slides

More from Dirk Lewandowski(20)

The Need for and fundamentals of an Open Web Index by Dirk Lewandowski
The Need for and fundamentals of an Open Web IndexThe Need for and fundamentals of an Open Web Index
The Need for and fundamentals of an Open Web Index
Dirk Lewandowski230 views
EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni... by Dirk Lewandowski
EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni...EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni...
EIN ANDERER BLICK AUF GOOGLE: Wie interpretieren Nutzer/innen die Suchergebni...
Dirk Lewandowski319 views
Künstliche Intelligenz bei Suchmaschinen by Dirk Lewandowski
Künstliche Intelligenz bei SuchmaschinenKünstliche Intelligenz bei Suchmaschinen
Künstliche Intelligenz bei Suchmaschinen
Dirk Lewandowski466 views
Analysing search engine data on socially relevant topics by Dirk Lewandowski
Analysing search engine data on socially relevant topicsAnalysing search engine data on socially relevant topics
Analysing search engine data on socially relevant topics
Dirk Lewandowski191 views
Google Assistant, Alexa & Co.: Wie sich die Welt der Suche verändert by Dirk Lewandowski
Google Assistant, Alexa & Co.: Wie sich die Welt der Suche verändertGoogle Assistant, Alexa & Co.: Wie sich die Welt der Suche verändert
Google Assistant, Alexa & Co.: Wie sich die Welt der Suche verändert
Dirk Lewandowski278 views
Suchverhalten und die Grenzen von Suchdiensten by Dirk Lewandowski
Suchverhalten und die Grenzen von SuchdienstenSuchverhalten und die Grenzen von Suchdiensten
Suchverhalten und die Grenzen von Suchdiensten
Dirk Lewandowski173 views
Können Nutzer echte Suchergebnisse von Werbung in Suchmaschinen unterscheiden? by Dirk Lewandowski
Können Nutzer echte Suchergebnisse von Werbung in Suchmaschinen unterscheiden?Können Nutzer echte Suchergebnisse von Werbung in Suchmaschinen unterscheiden?
Können Nutzer echte Suchergebnisse von Werbung in Suchmaschinen unterscheiden?
Dirk Lewandowski265 views
Are Ads on Google search engine results pages labeled clearly enough? by Dirk Lewandowski
Are Ads on Google search engine results pages labeled clearly enough?Are Ads on Google search engine results pages labeled clearly enough?
Are Ads on Google search engine results pages labeled clearly enough?
Dirk Lewandowski910 views
Search Engine Bias - sollen wir Googles Suchergebnissen vertrauen? by Dirk Lewandowski
Search Engine Bias - sollen wir Googles Suchergebnissen vertrauen?Search Engine Bias - sollen wir Googles Suchergebnissen vertrauen?
Search Engine Bias - sollen wir Googles Suchergebnissen vertrauen?
Dirk Lewandowski1.2K views
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (3) by Dirk Lewandowski
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (3)Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (3)
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (3)
Dirk Lewandowski646 views
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (2) by Dirk Lewandowski
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (2)Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (2)
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (2)
Dirk Lewandowski595 views
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (1) by Dirk Lewandowski
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (1)Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (1)
Neue Entwicklungen bei Suchmaschinen und deren Relevanz für Bibliotheken (1)
Dirk Lewandowski571 views
Medientage 2013: Die Zukunft der Suche by Dirk Lewandowski
Medientage 2013: Die Zukunft der SucheMedientage 2013: Die Zukunft der Suche
Medientage 2013: Die Zukunft der Suche
Dirk Lewandowski1.1K views
Suchmaschinen: Googlerisierung der Gesellschaft by Dirk Lewandowski
Suchmaschinen: Googlerisierung der GesellschaftSuchmaschinen: Googlerisierung der Gesellschaft
Suchmaschinen: Googlerisierung der Gesellschaft
Dirk Lewandowski1.5K views
Wie beeinflussen Suchmaschinen den Informationsmarkt? by Dirk Lewandowski
Wie beeinflussen Suchmaschinen den Informationsmarkt?Wie beeinflussen Suchmaschinen den Informationsmarkt?
Wie beeinflussen Suchmaschinen den Informationsmarkt?
Dirk Lewandowski543 views
Warum wir Alternativen zu Google benötigen by Dirk Lewandowski
Warum wir Alternativen zu Google benötigenWarum wir Alternativen zu Google benötigen
Warum wir Alternativen zu Google benötigen
Dirk Lewandowski1.2K views

Recently uploaded

Uni Systems for Power Platform.pptx by
Uni Systems for Power Platform.pptxUni Systems for Power Platform.pptx
Uni Systems for Power Platform.pptxUni Systems S.M.S.A.
50 views21 slides
SAP Automation Using Bar Code and FIORI.pdf by
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdfVirendra Rai, PMP
19 views38 slides
Top 10 Strategic Technologies in 2024: AI and Automation by
Top 10 Strategic Technologies in 2024: AI and AutomationTop 10 Strategic Technologies in 2024: AI and Automation
Top 10 Strategic Technologies in 2024: AI and AutomationAutomationEdge Technologies
14 views14 slides
How the World's Leading Independent Automotive Distributor is Reinventing Its... by
How the World's Leading Independent Automotive Distributor is Reinventing Its...How the World's Leading Independent Automotive Distributor is Reinventing Its...
How the World's Leading Independent Automotive Distributor is Reinventing Its...NUS-ISS
15 views25 slides
Report 2030 Digital Decade by
Report 2030 Digital DecadeReport 2030 Digital Decade
Report 2030 Digital DecadeMassimo Talia
14 views41 slides
.conf Go 2023 - Data analysis as a routine by
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routineSplunk
93 views12 slides

Recently uploaded(20)

SAP Automation Using Bar Code and FIORI.pdf by Virendra Rai, PMP
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdf
How the World's Leading Independent Automotive Distributor is Reinventing Its... by NUS-ISS
How the World's Leading Independent Automotive Distributor is Reinventing Its...How the World's Leading Independent Automotive Distributor is Reinventing Its...
How the World's Leading Independent Automotive Distributor is Reinventing Its...
NUS-ISS15 views
.conf Go 2023 - Data analysis as a routine by Splunk
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine
Splunk93 views
Emerging & Future Technology - How to Prepare for the Next 10 Years of Radica... by NUS-ISS
Emerging & Future Technology - How to Prepare for the Next 10 Years of Radica...Emerging & Future Technology - How to Prepare for the Next 10 Years of Radica...
Emerging & Future Technology - How to Prepare for the Next 10 Years of Radica...
NUS-ISS16 views
Combining Orchestration and Choreography for a Clean Architecture by ThomasHeinrichs1
Combining Orchestration and Choreography for a Clean ArchitectureCombining Orchestration and Choreography for a Clean Architecture
Combining Orchestration and Choreography for a Clean Architecture
ThomasHeinrichs169 views
PharoJS - Zürich Smalltalk Group Meetup November 2023 by Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi120 views
Future of Learning - Khoong Chan Meng by NUS-ISS
Future of Learning - Khoong Chan MengFuture of Learning - Khoong Chan Meng
Future of Learning - Khoong Chan Meng
NUS-ISS33 views
AMAZON PRODUCT RESEARCH.pdf by JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta15 views
AI: mind, matter, meaning, metaphors, being, becoming, life values by Twain Liu 刘秋艳
AI: mind, matter, meaning, metaphors, being, becoming, life valuesAI: mind, matter, meaning, metaphors, being, becoming, life values
AI: mind, matter, meaning, metaphors, being, becoming, life values
handbook for web 3 adoption.pdf by Liveplex
handbook for web 3 adoption.pdfhandbook for web 3 adoption.pdf
handbook for web 3 adoption.pdf
Liveplex19 views
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu... by NUS-ISS
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...
NUS-ISS37 views
Web Dev - 1 PPT.pdf by gdsczhcet
Web Dev - 1 PPT.pdfWeb Dev - 1 PPT.pdf
Web Dev - 1 PPT.pdf
gdsczhcet55 views
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV by Splunk
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
Splunk88 views

How can library materials be ranked in the OPAC?

  • 1. How can library materials be ranked in the OPAC? Prof. Dr. Dirk Lewandowski University of Applied Sciences Hamburg dirk.lewandowski@haw-hamburg.de 9th International Bielefeld Conference Bielefeld, 4 February 2009
  • 2. Agenda The state of the OPAC and the importance of relevance ranking Ranking factors The composition of results lists Conclusions 1 | Dirk Lewandowski
  • 3. Agenda The state of the OPAC and the importance of relevance ranking Ranking factors The composition of results lists Conclusions 2 | Dirk Lewandowski
  • 4. What’s wrong with library catalogueues? • catalogueues are incomplete – Items from journal article collections, abstracting and indexing databases • “Electronic card catalogueue”? • User behaviour changed – Short queries, fast results, one set of results – Search engines strongly influence users’ demands • Known item vs. topic-based search – OPACs should accomodate both. 3 | Dirk Lewandowski
  • 5. Some ideas to improve the OPAC (“catalogue 2.0”) • Let users participate – Write reviews – Rate titles • Enrich bibliographic data – Add reviews – Add TOC • Improve navigation – Drill-down menus on results pages to combine searching and browsing • Extend the database – Federated search 4 | Dirk Lewandowski
  • 6. Core of all search appliances: Relevance ranking • While Web 2.0 features add value to the catalogue, search is still the core. • “Search must work” • Users’ needs – Users want results quickly. – Users are not willing to think too much about formulating their queries. – Users are not willing to search for the right database before conducting their search. – Users are only willing to view a few results on the first results page before deciding to continue. 5 | Dirk Lewandowski
  • 7. Misconceptions about relevance ranking • A clear sorting criterion is better than relevance ranking. – Ranking does not reduce the number of results, but puts them in a certain order. – Other ordering options can be given. • Library catalogues do not apply any form of ranking. – Even conventional OPACs rank the results (according to publication date). • Relevance ranking is useless because it simply doesn’t work. – “Relevance” is hard to determine and depends on the context and on the individual user. However, a good relevance ranking can at least produce sufficient results lists. • Ranking is not that complicated. One must only apply standard measures such as TF/IDF. – For a good ranking, text matching alone is insufficient. 6 | Dirk Lewandowski
  • 8. Agenda The state of the OPAC and the importance of relevance ranking Ranking factors The composition of results lists Conclusions 7 | Dirk Lewandowski
  • 9. Ranking factors in web search engines • Text matching – Measures matching between query and document. – Term frequency, position of search terms within the documents, etc. – Text from document fulltexts, anchor texts. • Popularity – Measures popularity of the document (overall popularity or topic-based) – Link popularity (PageRank etc.), click popularity. • Freshness – Fresh documents can sometimes be very useful. – Derived from documents or from structural data (e.g., linkage) • Locality – Mainly expressed in differing rankings for country-specific search interfaces. 8 | Dirk Lewandowski: How can library materials be ranked in the OPAC? Lewandowski
  • 10. Text matching • Factors – Term frequency, inverted document frequency – Fields: Title, subject headings, author, etc. • Availability of text elements as a ranking factor – Fulltext, TOC, reviews, user comments • Problems with text matching – Not enough text in metadata. – Amount of text varies considerably (from mere bibliographic data to hundreds of pages of fulltext). 9 | Dirk Lewandowski
  • 11. Popularity • Popularity of – Item – Author/editor – Publisher – Book series • Measures – Number of items (by author, publisher, etc.) – Usage (circulation rate, download requests) – Average user rating – Citations 10 | Dirk Lewandowski
  • 12. Freshness • Freshness is the most-used ranking criterion in catalogues today. • It is often difficult to determine whether fresh items will be relevant to a certain query. • Need for fresh items can be derived from – Circulation rate for the individual item – Circulation rates for items from a certain group (from broad disciplines to specific subject headings) 11 | Dirk Lewandowski
  • 13. Locality • Availability of item – from the local library; within a certain distance. – Item currently available. • Physical location of the user – At home (electronic items strongly preferred) – At the library 12 | Dirk Lewandowski
  • 14. Other ranking factors • Size of item (no. of pages) • Document types – Monograph, edited book, proceedings, etc. – Article vs. Book – Physical vs. online materials • User group – Professor, undergraduate student, graduate student, etc. • Personalization – Individual usage data – Click-stream data from navigation 13 | Dirk Lewandowski
  • 15. Agenda The state of the OPAC and the importance of relevance ranking Ranking factors The composition of results lists Conclusions 14 | Dirk Lewandowski
  • 16. Data needed • Data from the catalogue • Circulation data – Anonymous • Location data – From IP ranges • User data • Data from remote resources – Abstracts (and fulltexts) from publishers. 15 | Dirk Lewandowski
  • 17. Collections and databases • Library controlled – catalogue – Local digital repositories – Course management systems – Institutional web sites • External collections – A&I databases – E-journal collections 16 | Dirk Lewandowski
  • 18. Mixed results lists • Ranking algorithms prefer “more of the same”. This does not satisfy users’ needs for a variety of results. • Example for a broad query – Reference works (from subject headings + items from reference collection) – Text books – Relevant databases – Some current items – Relevant journals 17 | Dirk Lewandowski
  • 19. “Universal Search” Additional databases • x One box results (e.g., news or images) 18 | Dirk Lewandowski: How can library materials be ranked in the OPAC? Lewandowski
  • 20. Agenda The state of the OPAC and the importance of relevance ranking Ranking factors The composition of results lists Conclusions 19 | Dirk Lewandowski
  • 21. Conclusions • Search is the core of the library catalogue. – However, other elements must be considered, too: – Usability – User guidance – Spelling corrections – etc. • A good ranking is always a mixture of ranking factors • In addition, results lists should be mixed. – Items from different collections. – Mixture of direct results and pointers to other collections. • Future: catalogue will become more like a search engines. 20 | Dirk Lewandowski
  • 22. Thank you for your attention. Prof. Dr. Dirk Lewandowski Hamburg University of Applied Sciences Department Information Berliner Tor 5 D - 20099 Hamburg Germany www.bui.haw-hamburg.de/lewandowski.html E-Mail: dirk.lewandowski@haw-hamburg.de