SlideShare a Scribd company logo
1 of 38
Download to read offline
Search & Recommendation: Birds of a Feather?
Toine Bogers
Aalborg University Copenhagen
Copenhagen, Denmark

‘Mind the Gap ’14’ workshop @ iConference 2014, Berlin
March 4, 2014
Outline
• Introduction
• Search vs. recommendation
- Use case
- Algorithms & evaluation
- Trends

• Conclusions

2
3
Success of search engines
• Search engines have had a huge impact on the information economy
- Academia
‣ Vibrant & growing research community with many dedicated conferences and journals
‣ Evaluation initiatives like TREC were shown to have a great impact on the performance
of Web search engines

- Industry
‣ Google → ~13 billion USD in profit in 2013

4
Amazon.com
Success of recommender systems
• Recommender systems have seen a meteoric rise to success in
past two decades
- Academia
‣ From specialized workshops to dedicated conference and journals

- Industry
‣ Amazon → 35% sales from recommendations
‣ Netflix → 75% of what its users watch comes from recommendations
‣ Google News → recommendations generate 38% more click-through

6
Different perspectives?
• Search and recommendation are commonly treated as different (but
related) research areas

• Search perspective → recommendation is a special type of search
problem

- Smaller research community with few dedicated venues
- Dedicated sessions at major IR conferences

• Recommendation perspective → field of its own
- Rapidly growing research community with s
- Strong industry support
- Separate data sets, experimental protocol, and evaluation

7
....but are they really that different?
• Looking at search and recommendation in isolation can be
counter-productive in many situations!

• Three aspects of where both fields are (growing) close(r)
- Use cases
- Algorithms & evaluation
- Trends

8
Use cases
Comparing use cases
• What are the characteristics of the information access paradigms?
- What problem are they trying to solve?
- What do we know about what the user wants?
- What do we know about the user?
- How do we know we have solved the user’s problem?

10
Comparing definitions

tions
“A recommender system is software that provides sugges
to users on which items could be of use to them.”
— Ricci et al. (2011)

retrieval (IR) is finding material of an
“Information
ation
ructured nature that satisfies an inform
unst
need from within large collections.”
— Manning et al. (2008)

11
Search characteristics
• Information need
- Explicit representation of user’s information need as a query (and
occasionally a description or narrative)
‣ Typically at Taylor’s last two stages

Taylor’s four stages
1. Visceral
2. Conscious
3. Formalized
4. Compromised

• Knowledge about the user
- User characteristics traditionally abstracted away
- More focus on the user in recent years (e.g., search history)
12
Search characteristics
• Matching criteria
- Relevance
‣ Assessment of perceived topicality, pertinence, usefulness or utility of an information
source by an actor or algorithm with reference to a task at a given point in time

- Relevance is a multi-dimensional concept → many different flavors!
‣ Topical relevance most common interpretation
‣ Textual similarity used as a proxy for topical relevance

Saracevic’s categories
• Algorithmic relevance
• Topical relevance
• Temporal relevance
• Situational relevance

- See Borlund (2003) for a comprehensive overview of relevance in IR
13
Recommendation characteristics
• Information need
- Implicit representation of user’s information need as a the user’s profile
‣ Typically at Taylor’s first two stages

• Knowledge about the user

Taylor’s four stages
1. Visceral
2. Conscious
3. Formalized
4. Compromised

- User profile representing the user’s interests
- Usage patterns, past interactions with the system, requirements

• Matching criteria
- Interest / Usefulness
14
No user profile
Classic IR

Explicit
need

Web search

PopularityBrowsing based
methods

Implicit
need
Information
filtering

?

Recommendation
User profile

15
Search & recommendation form a continuum
• Search (“Show me all books about X”)

• Focused recommendation (“Show me interesting books about X!”)

• Recommendation (“Show me interesting books!”)

Search

Focused
recommendation

Recommendation
16
LibraryThing forum topic
How prevalent is focused recommendation?
• Is there evidence for such a continuum?
- Search engines see millions of pure search requests every day
- Netflix and Amazon profit immensely from pure recommendation scenarios
- But how prevalent are these focused recommendation requests?

• Possible explanations for underrepresentation
- Perhaps we are looking in the wrong places?
- Interfaces offer little support for entering complex requests
18
INEX Social Book Search track
• Track running at INEX from 2011-2014 on book search
- Amazon/LibraryThing collection
‣ 2.8 million book metadata records
‣ Mix of metadata from Amazon, Librarything, Library of Congress, and British Library

- Realistic book requests & information needs from LibraryThing fora
‣ Highly varied set of requests that touch upon topics, genres, authors, engagment,
reading level, personal preferences, etc.

19
Topic title

Annotated LT topic
Narrative

Group name

Recommended
books

20
INEX Social Book Search track
• Track running at INEX from 2011-2014 on book search
- Amazon/LibraryThing collection
‣ 2.8 million book metadata records
‣ Mix of metadata from Amazon, Librarything, Library of Congress, and British Library

- Realistic book requests & information needs from LibraryThing fora
‣ Highly varied set of requests that touch upon topics, genres, authors, engagment,
reading level, personal preferences, etc.
‣ Collected & annotated 944 book requests from the LibraryThing fora

- Relevance judgments
‣ Member suggestions (Suggestions made by other Librarything members)
‣ Reading behavior (Has the original requester added any suggestions afterwards?)
21
Relevance aspects of book requests
• Eight LIS students annotated all requests on relevance aspects
Relevance aspects

%

Accessibility

16

Content

74

Engagement

23

Familiarity

36

Known-item

21

Metadata

28

Novelty

4

Socio-cultural

14
0

10 20

30

40

50

60

70

80

22
Continuum of search & recommendation
• How common are the different types of information needs?
Familiarity

No familiarity

Content

Focused
recommendation
(260 requests)

Search
(338 topics)

No content

Recommendation
(66 topics)

Context
(78 topics)

Sign up at https://inex.mmci.uni-saarland
.de/tracks/books/!
23
Not just true for the book domain!

24
What’s next?
• Focused recommendation deserves more attention!
- Combines aspects of search and recommendation

• Open questions
- How can we best address focused recommendation requests?
‣ Likely to require a combination of both search and recommendation approaches
‣ Early indications from INEX track that a combination indeed works best

- How can we support expressing these complex needs through the UI?

25
Algorithms & evaluation
Algorithms & evaluation
• Past decade has seen combination & mutual inspiration
- Both fields have borrowed techniques & metrics from each other
- Dedicated workshops & events
‣ CARR 2011-2014
‣ BARR 2013
‣ Mind The Gap 2014

27
Recommender systems → IR
• Collaborative filtering
- Automates the process of word-of-mouth recommendations by looking for
unseen items among other users with similar interests

• Used in IR for
- Collaborative search
‣ I-SPY search engine by Smyth et al. (2004)

- Query suggestion
- Improving 'More like this' functionality

28
IR → Recommender systems
• Recommender systems has borrowed from many different fields
- Artificial Intelligence (ML, CBR), IR, Natural Language Processing

• Inspiration from IR
- Algorithms
‣ TF·IDF weighting scheme for CF (Breese et al., 1998)
‣ Query expansion for recommender systems (Formosa et al., 2013)
‣ Probability ranking principle in recommender systems (Wang et al., 2006)
‣ Language modeling for recommender systems (Bellojin et al., 2013)

- Evaluation
‣ Increasing use of nDCG (and MAP) as metrics for ranked list recommendation
29
Trends
Context
• Incorporating contextual information into the search/
recommendation process

• Search
- IRiX workshop (2004-2005)
- CARR workshop (2011-2014)
- TREC Contextual Suggestion (2012-2013)

• Recommendation
- CARS workshop (2009-2012)
- CAMRA workshop (2010-2011)
- CARR workshop (2011-2014)
31
Diversity
• Ensuring a diverse range of relevant results/recommendations
• Search
- IDR workshop (2009)
- DDR workshop (2011-2012)
- Many publications addressing diversity in search results

• Recommendation
- DiveRS workshop (2011)
- Many publications addressing diversity in recommender systems
32
Privacy
• Protecting user privacy when generating results/recommendations
or releasing data sets

- Hot topic in the aftermath of release of AOL and Netflix data sets
- Many papers on how to (de-)anonymize of recommendation data sets and
search logs

• Search
- PIR workshop (2014)

• Recommendation
- RESSON workshop (2013)
33
Conclusions
Conclusions
• Search & recommendation form an information access continuum
- Pure search & recommendation needs are addressed well by the respective
research fields

- But many other information needs fall through the cracks!
‣ Need to look at the whole range of information needs
‣ Both in terms of algorithms and interface design

• Search & recommendation are already moving closer together
- Exchange of algorithms & techniques
- Shared evaluation metrics
- Similar research trends

• A continuum of requests requires a continuum of solutions!

35
Questions? Comments? Suggestions?

36
Backup slides

37
Example requests
CONTEXT REQUEST
I've just finished my undergraduate
work, and as I float into the real
world, I find myself missing books-and recommendations for books--in a
serious way. So, those of you in a
similar state (and those of you who
simply love reading, and sharing):
have any books that you find essential
for living? I'll post what I've been
reading, and you can as well...I'm
especially interested in books that are
a little older, a little less known, and
more prone to flying under the radar.
I read almost everything as well, a
sentiment I'm sure most of you are
familiar with.

SEARCH REQUEST
looking for heroine oriented love
triangle romances,
any recommendations appreciated.

RECOMMENDATION REQUEST
Just read and reviewed Moon in the
Water: Reflections on an Aging Parent.
I wonder if other early readers have
recommendations for similar
pieces...this makes me want to go
back and read The Summer of the
Great-Grandmother by Madeline
L'Engle. I glossed through it the first
time, but now that I am closer to that
stage of life I wonder if it will have
more meaning.
38

More Related Content

What's hot

What's hot (8)

Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
 
AAUP 2008: E-Journal Experience (H. McGregor)
AAUP 2008: E-Journal Experience (H. McGregor)AAUP 2008: E-Journal Experience (H. McGregor)
AAUP 2008: E-Journal Experience (H. McGregor)
 
Excerpts from NELA 2016
Excerpts from NELA 2016Excerpts from NELA 2016
Excerpts from NELA 2016
 
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
 
Creating Personas: For the Enrichment of the User Experience Community
Creating Personas: For the Enrichment of the User Experience CommunityCreating Personas: For the Enrichment of the User Experience Community
Creating Personas: For the Enrichment of the User Experience Community
 
Apo presentation research librarians day feb 2017
Apo presentation research librarians day feb 2017Apo presentation research librarians day feb 2017
Apo presentation research librarians day feb 2017
 
Using ACRL's New Framework for Information Literacy to Explore Teaching Strat...
Using ACRL's New Framework for Information Literacy to Explore Teaching Strat...Using ACRL's New Framework for Information Literacy to Explore Teaching Strat...
Using ACRL's New Framework for Information Literacy to Explore Teaching Strat...
 
Encore Presentation of “Using World Bank eLibrary for Country & Regional Rese...
Encore Presentation of “Using World Bank eLibrary for Country & Regional Rese...Encore Presentation of “Using World Bank eLibrary for Country & Regional Rese...
Encore Presentation of “Using World Bank eLibrary for Country & Regional Rese...
 

Similar to Search & Recommendation: Birds of a Feather?

How can library materials be ranked in the OPAC?
How can library materials be ranked in the OPAC?How can library materials be ranked in the OPAC?
How can library materials be ranked in the OPAC?
Dirk Lewandowski
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
enterprisesearchmeetup
 
DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)
DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)
DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)
Digitised Manuscripts to Europeana
 

Similar to Search & Recommendation: Birds of a Feather? (20)

A Decade of Discovery: What We Know and Where We Will Go
A Decade of Discovery: What We Know and Where We Will GoA Decade of Discovery: What We Know and Where We Will Go
A Decade of Discovery: What We Know and Where We Will Go
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Personalized search
Personalized searchPersonalized search
Personalized search
 
Library Science Emerging Career Trends 2016
Library Science Emerging Career Trends 2016Library Science Emerging Career Trends 2016
Library Science Emerging Career Trends 2016
 
Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
How can library materials be ranked in the OPAC?
How can library materials be ranked in the OPAC?How can library materials be ranked in the OPAC?
How can library materials be ranked in the OPAC?
 
Personalizing Information Exploration with an Open User Model
Personalizing Information Exploration with an Open User ModelPersonalizing Information Exploration with an Open User Model
Personalizing Information Exploration with an Open User Model
 
Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!
 
The OCLC Research Library Partnership
The OCLC Research Library PartnershipThe OCLC Research Library Partnership
The OCLC Research Library Partnership
 
Evaluating Search in Digital Cultural Heritage: Thinking Outside the (Search)...
Evaluating Search in Digital Cultural Heritage: Thinking Outside the (Search)...Evaluating Search in Digital Cultural Heritage: Thinking Outside the (Search)...
Evaluating Search in Digital Cultural Heritage: Thinking Outside the (Search)...
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)
DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)
DM2E Project meeting Bergen: WP3 Report on Task 3.4 (Steffen Hennicke)
 
Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16
 
Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
Training and education of library staff for Research Data Services
Training and education of library staff for Research Data ServicesTraining and education of library staff for Research Data Services
Training and education of library staff for Research Data Services
 
Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013
 
VIVO Team Builder - VIVO conference 2014
VIVO Team Builder - VIVO conference 2014VIVO Team Builder - VIVO conference 2014
VIVO Team Builder - VIVO conference 2014
 

More from Toine Bogers

Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while DrivingHands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Toine Bogers
 
A Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index SizeA Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index Size
Toine Bogers
 

More from Toine Bogers (15)

"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C..."If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
"If I like BLANK, what else will I like?": Analyzing a Human Recommendation C...
 
Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while DrivingHands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
Hands-free but not Eyes-free: A Usability Evaluation of Siri while Driving
 
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
“Looking for an Amazing Game I Can Relax and Sink Hours into...”: A Study of ...
 
A Study of Usage and Usability of Intelligent Personal Assistants in Denmark
A Study of Usage and Usability of Intelligent Personal Assistants in DenmarkA Study of Usage and Usability of Intelligent Personal Assistants in Denmark
A Study of Usage and Usability of Intelligent Personal Assistants in Denmark
 
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
“What was this movie about this chick?”: A Comparative Study of Relevance Asp...
 
"I just scroll through my stuff until I find it or give up": A Contextual Inq...
"I just scroll through my stuff until I find it or give up": A Contextual Inq..."I just scroll through my stuff until I find it or give up": A Contextual Inq...
"I just scroll through my stuff until I find it or give up": A Contextual Inq...
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Defining and Supporting Narrative-driven Recommendation
Defining and Supporting Narrative-driven RecommendationDefining and Supporting Narrative-driven Recommendation
Defining and Supporting Narrative-driven Recommendation
 
An In-depth Analysis of Tags and Controlled Metadata for Book Search
An In-depth Analysis of Tags and Controlled Metadata for Book SearchAn In-depth Analysis of Tags and Controlled Metadata for Book Search
An In-depth Analysis of Tags and Controlled Metadata for Book Search
 
A Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index SizeA Longitudinal Analysis of Search Engine Index Size
A Longitudinal Analysis of Search Engine Index Size
 
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
Tagging vs. Controlled Vocabulary: Which is More Helpful for Book Search?
 
Measuring System Performance in Cultural Heritage Systems
Measuring System Performance in Cultural Heritage SystemsMeasuring System Performance in Cultural Heritage Systems
Measuring System Performance in Cultural Heritage Systems
 
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
How 'Social' are Social News Sites? Exploring the Motivations for Using Reddi...
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Search & Recommendation: Birds of a Feather?

  • 1. Search & Recommendation: Birds of a Feather? Toine Bogers Aalborg University Copenhagen Copenhagen, Denmark ‘Mind the Gap ’14’ workshop @ iConference 2014, Berlin March 4, 2014
  • 2. Outline • Introduction • Search vs. recommendation - Use case - Algorithms & evaluation - Trends • Conclusions 2
  • 3. 3
  • 4. Success of search engines • Search engines have had a huge impact on the information economy - Academia ‣ Vibrant & growing research community with many dedicated conferences and journals ‣ Evaluation initiatives like TREC were shown to have a great impact on the performance of Web search engines - Industry ‣ Google → ~13 billion USD in profit in 2013 4
  • 6. Success of recommender systems • Recommender systems have seen a meteoric rise to success in past two decades - Academia ‣ From specialized workshops to dedicated conference and journals - Industry ‣ Amazon → 35% sales from recommendations ‣ Netflix → 75% of what its users watch comes from recommendations ‣ Google News → recommendations generate 38% more click-through 6
  • 7. Different perspectives? • Search and recommendation are commonly treated as different (but related) research areas • Search perspective → recommendation is a special type of search problem - Smaller research community with few dedicated venues - Dedicated sessions at major IR conferences • Recommendation perspective → field of its own - Rapidly growing research community with s - Strong industry support - Separate data sets, experimental protocol, and evaluation 7
  • 8. ....but are they really that different? • Looking at search and recommendation in isolation can be counter-productive in many situations! • Three aspects of where both fields are (growing) close(r) - Use cases - Algorithms & evaluation - Trends 8
  • 10. Comparing use cases • What are the characteristics of the information access paradigms? - What problem are they trying to solve? - What do we know about what the user wants? - What do we know about the user? - How do we know we have solved the user’s problem? 10
  • 11. Comparing definitions tions “A recommender system is software that provides sugges to users on which items could be of use to them.” — Ricci et al. (2011) retrieval (IR) is finding material of an “Information ation ructured nature that satisfies an inform unst need from within large collections.” — Manning et al. (2008) 11
  • 12. Search characteristics • Information need - Explicit representation of user’s information need as a query (and occasionally a description or narrative) ‣ Typically at Taylor’s last two stages Taylor’s four stages 1. Visceral 2. Conscious 3. Formalized 4. Compromised • Knowledge about the user - User characteristics traditionally abstracted away - More focus on the user in recent years (e.g., search history) 12
  • 13. Search characteristics • Matching criteria - Relevance ‣ Assessment of perceived topicality, pertinence, usefulness or utility of an information source by an actor or algorithm with reference to a task at a given point in time - Relevance is a multi-dimensional concept → many different flavors! ‣ Topical relevance most common interpretation ‣ Textual similarity used as a proxy for topical relevance Saracevic’s categories • Algorithmic relevance • Topical relevance • Temporal relevance • Situational relevance - See Borlund (2003) for a comprehensive overview of relevance in IR 13
  • 14. Recommendation characteristics • Information need - Implicit representation of user’s information need as a the user’s profile ‣ Typically at Taylor’s first two stages • Knowledge about the user Taylor’s four stages 1. Visceral 2. Conscious 3. Formalized 4. Compromised - User profile representing the user’s interests - Usage patterns, past interactions with the system, requirements • Matching criteria - Interest / Usefulness 14
  • 15. No user profile Classic IR Explicit need Web search PopularityBrowsing based methods Implicit need Information filtering ? Recommendation User profile 15
  • 16. Search & recommendation form a continuum • Search (“Show me all books about X”) • Focused recommendation (“Show me interesting books about X!”) • Recommendation (“Show me interesting books!”) Search Focused recommendation Recommendation 16
  • 18. How prevalent is focused recommendation? • Is there evidence for such a continuum? - Search engines see millions of pure search requests every day - Netflix and Amazon profit immensely from pure recommendation scenarios - But how prevalent are these focused recommendation requests? • Possible explanations for underrepresentation - Perhaps we are looking in the wrong places? - Interfaces offer little support for entering complex requests 18
  • 19. INEX Social Book Search track • Track running at INEX from 2011-2014 on book search - Amazon/LibraryThing collection ‣ 2.8 million book metadata records ‣ Mix of metadata from Amazon, Librarything, Library of Congress, and British Library - Realistic book requests & information needs from LibraryThing fora ‣ Highly varied set of requests that touch upon topics, genres, authors, engagment, reading level, personal preferences, etc. 19
  • 20. Topic title Annotated LT topic Narrative Group name Recommended books 20
  • 21. INEX Social Book Search track • Track running at INEX from 2011-2014 on book search - Amazon/LibraryThing collection ‣ 2.8 million book metadata records ‣ Mix of metadata from Amazon, Librarything, Library of Congress, and British Library - Realistic book requests & information needs from LibraryThing fora ‣ Highly varied set of requests that touch upon topics, genres, authors, engagment, reading level, personal preferences, etc. ‣ Collected & annotated 944 book requests from the LibraryThing fora - Relevance judgments ‣ Member suggestions (Suggestions made by other Librarything members) ‣ Reading behavior (Has the original requester added any suggestions afterwards?) 21
  • 22. Relevance aspects of book requests • Eight LIS students annotated all requests on relevance aspects Relevance aspects % Accessibility 16 Content 74 Engagement 23 Familiarity 36 Known-item 21 Metadata 28 Novelty 4 Socio-cultural 14 0 10 20 30 40 50 60 70 80 22
  • 23. Continuum of search & recommendation • How common are the different types of information needs? Familiarity No familiarity Content Focused recommendation (260 requests) Search (338 topics) No content Recommendation (66 topics) Context (78 topics) Sign up at https://inex.mmci.uni-saarland .de/tracks/books/! 23
  • 24. Not just true for the book domain! 24
  • 25. What’s next? • Focused recommendation deserves more attention! - Combines aspects of search and recommendation • Open questions - How can we best address focused recommendation requests? ‣ Likely to require a combination of both search and recommendation approaches ‣ Early indications from INEX track that a combination indeed works best - How can we support expressing these complex needs through the UI? 25
  • 27. Algorithms & evaluation • Past decade has seen combination & mutual inspiration - Both fields have borrowed techniques & metrics from each other - Dedicated workshops & events ‣ CARR 2011-2014 ‣ BARR 2013 ‣ Mind The Gap 2014 27
  • 28. Recommender systems → IR • Collaborative filtering - Automates the process of word-of-mouth recommendations by looking for unseen items among other users with similar interests • Used in IR for - Collaborative search ‣ I-SPY search engine by Smyth et al. (2004) - Query suggestion - Improving 'More like this' functionality 28
  • 29. IR → Recommender systems • Recommender systems has borrowed from many different fields - Artificial Intelligence (ML, CBR), IR, Natural Language Processing • Inspiration from IR - Algorithms ‣ TF·IDF weighting scheme for CF (Breese et al., 1998) ‣ Query expansion for recommender systems (Formosa et al., 2013) ‣ Probability ranking principle in recommender systems (Wang et al., 2006) ‣ Language modeling for recommender systems (Bellojin et al., 2013) - Evaluation ‣ Increasing use of nDCG (and MAP) as metrics for ranked list recommendation 29
  • 31. Context • Incorporating contextual information into the search/ recommendation process • Search - IRiX workshop (2004-2005) - CARR workshop (2011-2014) - TREC Contextual Suggestion (2012-2013) • Recommendation - CARS workshop (2009-2012) - CAMRA workshop (2010-2011) - CARR workshop (2011-2014) 31
  • 32. Diversity • Ensuring a diverse range of relevant results/recommendations • Search - IDR workshop (2009) - DDR workshop (2011-2012) - Many publications addressing diversity in search results • Recommendation - DiveRS workshop (2011) - Many publications addressing diversity in recommender systems 32
  • 33. Privacy • Protecting user privacy when generating results/recommendations or releasing data sets - Hot topic in the aftermath of release of AOL and Netflix data sets - Many papers on how to (de-)anonymize of recommendation data sets and search logs • Search - PIR workshop (2014) • Recommendation - RESSON workshop (2013) 33
  • 35. Conclusions • Search & recommendation form an information access continuum - Pure search & recommendation needs are addressed well by the respective research fields - But many other information needs fall through the cracks! ‣ Need to look at the whole range of information needs ‣ Both in terms of algorithms and interface design • Search & recommendation are already moving closer together - Exchange of algorithms & techniques - Shared evaluation metrics - Similar research trends • A continuum of requests requires a continuum of solutions! 35
  • 38. Example requests CONTEXT REQUEST I've just finished my undergraduate work, and as I float into the real world, I find myself missing books-and recommendations for books--in a serious way. So, those of you in a similar state (and those of you who simply love reading, and sharing): have any books that you find essential for living? I'll post what I've been reading, and you can as well...I'm especially interested in books that are a little older, a little less known, and more prone to flying under the radar. I read almost everything as well, a sentiment I'm sure most of you are familiar with. SEARCH REQUEST looking for heroine oriented love triangle romances, any recommendations appreciated. RECOMMENDATION REQUEST Just read and reviewed Moon in the Water: Reflections on an Aging Parent. I wonder if other early readers have recommendations for similar pieces...this makes me want to go back and read The Summer of the Great-Grandmother by Madeline L'Engle. I glossed through it the first time, but now that I am closer to that stage of life I wonder if it will have more meaning. 38