SlideShare a Scribd company logo
Contextual Support for
Collaborative Information Retrieval
Shuguang Han, Daqing He, Zhen Yue
and Jiepu Jiang
1
Shuguang Han, Daqing He, Zhen Yue, and Jiepu Jiang. 2016. Contextual Support for Collaborative Information
Retrieval. In Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval (CHIIR
'16). ACM, New York, NY, USA, 33-42. DOI=http://dx.doi.org/10.1145/2854946.2854963
Motivation
Collaborative Information Retrieval (COL) VS. Individual Information Retrieval (IND)
Studies observed an increasing trend of COL, in which people search together for the same task
COL provides unique & interesting contextual factors for search support
team project
ski trip
students
family
Communication & Coordination
Shared Search History
Search
Search
2
Research Questions
RQ1: Are IND contextual support directly applicable in COL without
handling complex COL interactions?
3
RQ2: Can partners’ search histories be employed for contextual search
support in COL?
RQ3: Can team members’ chat content be applied for
contextual search support in COL?
Contextual Support for COL
Search contexts
Five types (since chat is highly
interactive and is not isolated,
we merge self- and partner-chat)
4
Contextual search support models
Document re-ranking based on search contexts
Experiment setup for contextual search support
Data about user search contexts
Ground-truth
Obtaining Data Collection
User study
A self-developed collaborative web search system
Search
Shared Workspace
Shared Query History
Communication & Coordination 5
Obtaining Data Collection
Search Tasks
T1: Academic Task
Writing a report to summarize the development of SNS
T2: Leisure Task
Planning a trip to Helsinki
User Study Procedure
Conditions: Collaborative search (COL) & Individual search (IND)
Within subject design
Each team (in COL) / individual(in IND) works 30 mins for each task
Post-task questionnaire for relevance rating of each saved document
User Study Data
18 pairs (for COL) and 18 individuals (for IND)
970 queries, 1,384 clicks and 909 saved documents
6
Ground-truth
● For each team {U, V}, pooling document relevance
○ Aggregating document relevance by averaging relevance judgement from other users
● Personalize ground-truth for each query q
○ Pooled relevant documents already-saved relevant documents
7
Utilizing Self Search Histories in COL
RQ1: Are IND contextual support directly applicable
in COL?
8
NDCG@NMAP
G: pure google results (baseline)
G + H_QUS: re-rank Google results using self query history
G + H_CLS: re-rank Google results using self click history
Take-away messages
● Self search history is also useful in COL
● QU > CL (different from [Shen et al. 2004])
○ may because of the task nature
○ may because of that self search behaviors in
COL are influenced by collaboration
Utilizing Self Search Histories in IND
Academic Leisure
#chat messages 21.56 (20.98) 38.86 (27.24)
#queries 12.97 (6.98) 9.25 (5.30)
#click-through 24.00 (13.23) 14.44 (7.25)
search-intensive task collaboration-intensive task 9
NDCG@NMAP
Take-away messages
● IND is consistent with COL for Academic task
● IND is inconsistent with COL for Leisure task
○ Hypothesis: Strong collaboration effect in leisure task
G: pure google results (baseline)
G + H_QUS: re-rank Google results using self query history
G + H_CLS: re-rank Google results using self click history
Utilizing Partners’ Search Histories in COLNDCG@N
RQ2: Can partners’ search histories be employed
for contextual search support in COL?
MAP
10
G + H_QU: re-rank Google by both (self + partner) query history
G + H_CL: re-rank Google by both (self + partner) click history
Take-away messages
● Incorporating search histories from partners
can provide a better contextual support
Utilizing Chat Histories in COL
Take-away messages
● Chat-based approach performs the best in the
leisure task
● Chat-based approach outperforms click-based
method in the academic task
● This finding goes beyond our expectation
○ Chat contains a lot noise (our previous study find
that 30% of chats are irrlevant to tasks)
NDCG@N
RQ3: Can team members’ chat content be applied
for contextual search support in COL?
G + H_CH: re-rank Google results by chat history
MAP
11
Category Description
Task social (TS) Chats concerning attitude to obtained resources
Task coordination (TC) Chats about the coordination of a task
Task content (TT) Chats related to the content of the search task
Non-task (NT) Chats that are not related to the search task
G + H_X: re-rank Google with {TS, TC, TT, NT} chat
history
NDCG@NMAP
Take-away messages
● Any type of chat (even for NT) helps improve search
● Involving all chat messages achieves the best
Categorization schema for manual chat message coding [2]
12
Utilizing Chat Histories in COL
Analyzing Non-Task Chat in COL
Task NT (×10-3) Others (×10-3)
T1 p(X|C) 0.520 1.936
p(X|R) 0.509 2.422
T2 p(X|C) 0.333 2.120
p(X|R) 0.346 3.139
Why NT works?
● They contain useful information. Each chat message refers to all the content a user typed in chat box
before she hits the submit button, which may cover multiple types of chat content.
● Noise in NT (e.g., lol, haha, okay) does not hurt result ranking
13
● p(NT|R) ≈ p(NT|C)
● p(Others|R) > p(Others|C)
p(X|C)
p(X|R)
corpus
relevant docs
C
R
Take-away Messages
Contextual support method developed in IND can be directly applied in COL
Task may affect the utility of different search contexts
Partners’ search histories can help for contextual support in COL
Combination of self and partners’ search histories provides better utility
Chat messages can help in for contextual support in COL
Noise in chat messages may not affect the utility of chat-based context
14
Q & A
15
Task Social
well, hello there
yeah! we are going to Helsinki!
everything looks great so far!
Task Coordination
you do stats and I’ll do impacts on students and professionals
have you done impact yet?
Task Content
ok so outdoor activities will be hard
16
Examples of Different Chat Types

More Related Content

Similar to Contextual support for collaborative information retrieval

Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...
Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...
Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...
Sambit Praharaj
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
paperpublications3
 
Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...
Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...
Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...
Dr. David Wagner
 
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Peter Brusilovsky
 
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Yu-Chang Hsu
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Ed Chi
 
Article One must appropriately organize qualitative data collecting for analy...
Article One must appropriately organize qualitative data collecting for analy...Article One must appropriately organize qualitative data collecting for analy...
Article One must appropriately organize qualitative data collecting for analy...
4934bk
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag Recommendations
Dominik Kowald
 
DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha A_Palalas C_G...
DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha  A_Palalas C_G...DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha  A_Palalas C_G...
DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha A_Palalas C_G...Agnieszka (Aga) Palalas, Ed.D.
 
Continuous Improvement: How systems design can benefit the data-driven design...
Continuous Improvement: How systems design can benefit the data-driven design...Continuous Improvement: How systems design can benefit the data-driven design...
Continuous Improvement: How systems design can benefit the data-driven design...
RSD7 Symposium
 
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
IOSR Journals
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald
 
Research Design: Twitter and professional learning
Research Design: Twitter and professional learningResearch Design: Twitter and professional learning
Research Design: Twitter and professional learning
Peter Evans
 
Feedback in Scrum: Data-Informed Retrospectives
Feedback in Scrum: Data-Informed Retrospectives Feedback in Scrum: Data-Informed Retrospectives
Feedback in Scrum: Data-Informed Retrospectives
Christoph Matthies
 
CORE: co-author recommendation using network information and interest similarity
CORE: co-author recommendation using network information and interest similarityCORE: co-author recommendation using network information and interest similarity
CORE: co-author recommendation using network information and interest similarity
Rory Sie
 
SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...
SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...
SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...
cscpconf
 
Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...
Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...
Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...
csandit
 
Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...
Fernando de Assis Rodrigues
 
32 using social bookmarking presentation
32 using social bookmarking presentation32 using social bookmarking presentation
32 using social bookmarking presentationCITE
 
Deep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational AgentsDeep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational Agents
Martino Mensio
 

Similar to Contextual support for collaborative information retrieval (20)

Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...
Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...
Multimodal Analytics for Real-time Feedback in Co-located Collaboration, EC-T...
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
 
Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...
Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...
Exploring the Strategic Impact of Online Communities: A Dynamic Capabilities ...
 
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
 
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
 
Article One must appropriately organize qualitative data collecting for analy...
Article One must appropriately organize qualitative data collecting for analy...Article One must appropriately organize qualitative data collecting for analy...
Article One must appropriately organize qualitative data collecting for analy...
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag Recommendations
 
DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha A_Palalas C_G...
DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha  A_Palalas C_G...DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha  A_Palalas C_G...
DBR (Design-Based Research) in mobile learning-Mlearn2013 Doha A_Palalas C_G...
 
Continuous Improvement: How systems design can benefit the data-driven design...
Continuous Improvement: How systems design can benefit the data-driven design...Continuous Improvement: How systems design can benefit the data-driven design...
Continuous Improvement: How systems design can benefit the data-driven design...
 
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
A Literature Survey on Ranking Tagged Web Documents in Social Bookmarking Sys...
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
 
Research Design: Twitter and professional learning
Research Design: Twitter and professional learningResearch Design: Twitter and professional learning
Research Design: Twitter and professional learning
 
Feedback in Scrum: Data-Informed Retrospectives
Feedback in Scrum: Data-Informed Retrospectives Feedback in Scrum: Data-Informed Retrospectives
Feedback in Scrum: Data-Informed Retrospectives
 
CORE: co-author recommendation using network information and interest similarity
CORE: co-author recommendation using network information and interest similarityCORE: co-author recommendation using network information and interest similarity
CORE: co-author recommendation using network information and interest similarity
 
SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...
SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...
SEMANTIC ANALYSIS OVER LESSONS LEARNED CONTAINED IN SOCIAL NETWORKS FOR GENER...
 
Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...
Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...
Semantic Analysis Over Lessons Learned Contained in Social Networks for Gener...
 
Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...
 
32 using social bookmarking presentation
32 using social bookmarking presentation32 using social bookmarking presentation
32 using social bookmarking presentation
 
Deep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational AgentsDeep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational Agents
 

Recently uploaded

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 

Contextual support for collaborative information retrieval

  • 1. Contextual Support for Collaborative Information Retrieval Shuguang Han, Daqing He, Zhen Yue and Jiepu Jiang 1 Shuguang Han, Daqing He, Zhen Yue, and Jiepu Jiang. 2016. Contextual Support for Collaborative Information Retrieval. In Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval (CHIIR '16). ACM, New York, NY, USA, 33-42. DOI=http://dx.doi.org/10.1145/2854946.2854963
  • 2. Motivation Collaborative Information Retrieval (COL) VS. Individual Information Retrieval (IND) Studies observed an increasing trend of COL, in which people search together for the same task COL provides unique & interesting contextual factors for search support team project ski trip students family Communication & Coordination Shared Search History Search Search 2
  • 3. Research Questions RQ1: Are IND contextual support directly applicable in COL without handling complex COL interactions? 3 RQ2: Can partners’ search histories be employed for contextual search support in COL? RQ3: Can team members’ chat content be applied for contextual search support in COL?
  • 4. Contextual Support for COL Search contexts Five types (since chat is highly interactive and is not isolated, we merge self- and partner-chat) 4 Contextual search support models Document re-ranking based on search contexts Experiment setup for contextual search support Data about user search contexts Ground-truth
  • 5. Obtaining Data Collection User study A self-developed collaborative web search system Search Shared Workspace Shared Query History Communication & Coordination 5
  • 6. Obtaining Data Collection Search Tasks T1: Academic Task Writing a report to summarize the development of SNS T2: Leisure Task Planning a trip to Helsinki User Study Procedure Conditions: Collaborative search (COL) & Individual search (IND) Within subject design Each team (in COL) / individual(in IND) works 30 mins for each task Post-task questionnaire for relevance rating of each saved document User Study Data 18 pairs (for COL) and 18 individuals (for IND) 970 queries, 1,384 clicks and 909 saved documents 6
  • 7. Ground-truth ● For each team {U, V}, pooling document relevance ○ Aggregating document relevance by averaging relevance judgement from other users ● Personalize ground-truth for each query q ○ Pooled relevant documents already-saved relevant documents 7
  • 8. Utilizing Self Search Histories in COL RQ1: Are IND contextual support directly applicable in COL? 8 NDCG@NMAP G: pure google results (baseline) G + H_QUS: re-rank Google results using self query history G + H_CLS: re-rank Google results using self click history Take-away messages ● Self search history is also useful in COL ● QU > CL (different from [Shen et al. 2004]) ○ may because of the task nature ○ may because of that self search behaviors in COL are influenced by collaboration
  • 9. Utilizing Self Search Histories in IND Academic Leisure #chat messages 21.56 (20.98) 38.86 (27.24) #queries 12.97 (6.98) 9.25 (5.30) #click-through 24.00 (13.23) 14.44 (7.25) search-intensive task collaboration-intensive task 9 NDCG@NMAP Take-away messages ● IND is consistent with COL for Academic task ● IND is inconsistent with COL for Leisure task ○ Hypothesis: Strong collaboration effect in leisure task G: pure google results (baseline) G + H_QUS: re-rank Google results using self query history G + H_CLS: re-rank Google results using self click history
  • 10. Utilizing Partners’ Search Histories in COLNDCG@N RQ2: Can partners’ search histories be employed for contextual search support in COL? MAP 10 G + H_QU: re-rank Google by both (self + partner) query history G + H_CL: re-rank Google by both (self + partner) click history Take-away messages ● Incorporating search histories from partners can provide a better contextual support
  • 11. Utilizing Chat Histories in COL Take-away messages ● Chat-based approach performs the best in the leisure task ● Chat-based approach outperforms click-based method in the academic task ● This finding goes beyond our expectation ○ Chat contains a lot noise (our previous study find that 30% of chats are irrlevant to tasks) NDCG@N RQ3: Can team members’ chat content be applied for contextual search support in COL? G + H_CH: re-rank Google results by chat history MAP 11
  • 12. Category Description Task social (TS) Chats concerning attitude to obtained resources Task coordination (TC) Chats about the coordination of a task Task content (TT) Chats related to the content of the search task Non-task (NT) Chats that are not related to the search task G + H_X: re-rank Google with {TS, TC, TT, NT} chat history NDCG@NMAP Take-away messages ● Any type of chat (even for NT) helps improve search ● Involving all chat messages achieves the best Categorization schema for manual chat message coding [2] 12 Utilizing Chat Histories in COL
  • 13. Analyzing Non-Task Chat in COL Task NT (×10-3) Others (×10-3) T1 p(X|C) 0.520 1.936 p(X|R) 0.509 2.422 T2 p(X|C) 0.333 2.120 p(X|R) 0.346 3.139 Why NT works? ● They contain useful information. Each chat message refers to all the content a user typed in chat box before she hits the submit button, which may cover multiple types of chat content. ● Noise in NT (e.g., lol, haha, okay) does not hurt result ranking 13 ● p(NT|R) ≈ p(NT|C) ● p(Others|R) > p(Others|C) p(X|C) p(X|R) corpus relevant docs C R
  • 14. Take-away Messages Contextual support method developed in IND can be directly applied in COL Task may affect the utility of different search contexts Partners’ search histories can help for contextual support in COL Combination of self and partners’ search histories provides better utility Chat messages can help in for contextual support in COL Noise in chat messages may not affect the utility of chat-based context 14
  • 16. Task Social well, hello there yeah! we are going to Helsinki! everything looks great so far! Task Coordination you do stats and I’ll do impacts on students and professionals have you done impact yet? Task Content ok so outdoor activities will be hard 16 Examples of Different Chat Types

Editor's Notes

  1. http://www.pitt.edu/~shh69/resources/ccir-preprint.pdf
  2. Recent studies have observed an increasing trend of COL, in which two or more people search together for the same search task. For example, a group of students may search and collaborate together on a course project and family members may search and plan together for a ski trip. Comparing to IND that only contains search activities, COL has several unique characteristics such as communication, coordination and shared search history. These are unique & interesting factors that could be useful for better contextual search support.
  3. To understand the usefulness of each type of search context, we study the following research questions. Suppose that we have two users working on a same search task. To support query q_{2n} for u_2, we can follow the method developed in IND that utilizes u_{2}’s self query and click histories. Although they have been demonstrated to be useful in IND, it remains unclear in COL since users’ search behaviors are affected Note that communication and collaboration will affect search behaviors of COL. This leads to RQ1. Besides self-search history, we can also obtain partners’ search history. This is a unique contextual information and we want to examine its usefulness in RQ2. Additionally, explicit communication, in this case chat, is another unique search context and we want to understand its effectiveness in RQ3.
  4. We mentioned several times about search contexts and contextual support. Here, we provide a detailed explanation. In this paper, we care about search contexts inferred from search history and chat history, where search history can be self and partner. Then, each type of search context will be used in contextual search support. In this study, we examine how each does search context support find relevant documents, in which we apply the document re-ranking approach. Then, we evaluate the effectiveness of each search context based on MAP and nDCG re-ranked search results. However, this requires the ground-truth of relevant documents.
  5. Since there lacks publicly-available data collections & collaborative web search systems. Our data collection is obtained through a user study based on a self-developed CIR system. Here, we show an interface to illustrate the major system functionalities. Users can search a query and system will, by default, return Google results. They can also chat, look at other users’ search queries and saved documents,
  6. Details (537 queries from T1 and 433 from T2)
  7. When building the ground-truth document relevance for a pair {U,V} Here, ground-truth is changed with the change of different teams and contexts - we only care about these new & relevant documents.
  8. Now, we talk about our experimental results. We firstly examine RQ1, in which we try to understand whether query and click history-based search contexts often applied in IND can also be used in COL. Note that users’ search behaviors will be different since collaboration can interrupt individuals’ search process. The MAP and nDCG evaluations are provided in the left diagram, where the gray means the performance for pure google-based ranking, red indicates re-ranking based on query history and blue means click history. We find that same as IND, self search history is also useful in COL. However, different from IND, QU > CL in COL which may due to: (1) the task nature, meaning that QU will also be better than CL even if in IND; (2) it may also due to that individual’s search behaviors will be affected by collaboration
  9. To test the above two reasons, we apply the same procedure as we did for COL but examine the effectiveness of self search history for IND. Again, the results are shown in this diagram, where we find that (1) QU > CL in academic task which is consistent with COL; (2) while CL has no difference with QU and seems to be even better. This is inconsistent with COL and we think it is because of the strong collaboration effect for leisure task.
  10. Red one uses BOTH query histories from self + partner, Blue one uses BOTH click histories from self + partner. Pink and light blue ones are self-based query and click histories. We find that including both search histories are useful.
  11. not every chat seems relevant to the task. we want to analyze different types of chat.. This motivates us to look further for the task
  12. find a good example for different chat types. Back up slides
  13. too much time. even pure noise, doesn’t matter reformulate the presentation of table & figure. Boundary.
  14. The whole conclusions & future work can be found in our paper. Here are two take-away messages we want to highlight. (Two or session -- chat & partners)
  15. pre-search knowledge