SlideShare a Scribd company logo
Tien-Thanh Vu
Computing and Communications Department
The Open University
Improving Search Personalisation
with Dynamic Group Formation
Table of Contents
 Motivation
 Methodology
 A pilot study
 Conclusions and Future Work
 Motivation
 Methodology
 A pilot study
 Conclusions and Future Work
What is search personalisation?
The search performance
depends on
the richness of a user profile
J. Teevan, M. R. Morris, and S. Bush. Discovering and using groups to improve personalized search. WSDM’09,
pages 15-24, USA, 2009. ACM
How can we enrich a user
profile?
 Use information of the group of users who share
common interests
R. W. White, W. Chu, A. Hassan, X. He, Y. Song, and H. Wang. Enhancing personalized search by mining and
modeling task behavior. WWW '13, pages 1411-1420, Switzerland, 2013. ACM
What is the main research problem?
 Construct groups statically using some
predetermined criterions such as common clicked
documents
Users in a group may have different interests on
different topics w.r.t the input query
Z. Dou, R. Song, and J.-R. Wen. A large-scale evaluation and analysis of personalized search strategies. WWW '07,
pages 581-590, NY, USA, 2007. ACM.
Our proposal
 The groups should be dynamically constructed
in response to the user’s input query
Research question
How can we improve the performance of search
personalisation with dynamic group formation?
 How can we dynamically group users who share
common interests?
 How can we enrich user profiles with group
information?
 Can enriched user profiles help to improve search
performance?
 Motivation
 Methodology
 A pilot study
 Conclusions and Future Work
Constructing a user profile
 Construct implicitly using the relevant data
extracted from each user’s search history (i.e.
query logs)
 Extract related topics from the data
 Build a user profile based on the topics
Constructing a user profile
 Average the relevant documents over topics
Query-dependent user grouping
 Construct shared user profiles
 Use the input query as an indicator for grouping
users
Constructing a shared user
profile
Query-dependent user grouping
Query-dependent user grouping
 CosineSimilarity sp, q =
sp.q
sp .|q|
= t∈T wpt.wqt
t∈T wpt
2
t∈T wqt
2
 E.g. sp = (1,2,5); q = (4,5,6)
CosineSimilarity sp, q =
1∗4+2∗5+5∗6
12+22+52 42+52+62
≈ 0.915
The 2-nearest users
Enriching a user profile
 Average all users in the group over topics
Re-ranking search results
 For each input query
 Download the top n ranked search results from the
search engine
 Compute a personalised score for each web page d
given the current user u
 Combine the personalised score p(d|u) and the
original rank r(q,d), to get a final score
),(
)|(
),|(
dqr
udp
qudf 
Re-ranking search results
 Motivation
 Methodology
 A pilot study
 Conclusions and Future Work
A pilot study
 Evaluation metrics
 Dataset
 Preliminary results
Evaluation metric
 Personalisation Gain (P-Gain)
worse#+better#
worse#-better#
 GainP
Dataset
 Query logs from Bing search engine for 15 days
from 1st to 15th July 2012, 106 anonymous users
 A relevant document is a click with dwell time of
at least 30 seconds or the last click in a session
(SAT click)
Item ALL Training Test
#days 15 10 5
#users 106 106 106
#queries 17947 11695 6252
#distinct queries 8008 5237 3102
#clicks 24041 15688 8353
#SAT clicks 16166 10607 5559
#SAT clicks/#queries 0.9008 0.9069 0.8892
Preliminary results
 Baseline and Personalisation Strategies
 Baseline: The original ranked results from Bing
 S_Profile: Use only the current user profile
 S_Group: Enrich the profile with static group
 D_Group: Enrich the profile with dynamic group
Preliminary results
 Overall Performance
Strategy #Better #Worse P-Gain
S_Profile 913 664 0.1579
S_Group 882 491 0.2848
D_Group 884 450 0.3253
 Motivation
 Methodology
 A pilot study
 Conclusions and Future Work
Conclusions
 Grouping improves search performance
 Dynamic grouping outperforms static grouping
Outlook
 Carry out evaluation on larger-scale data sets
 Extend the model to capture user’s interests that
change over time
Thank You!
Any Questions?
Constructing a user profile
 Use a topic modelling method to learn topics from
the data
How can we build a user profile?
 Ask the user explicitly to provide her interests
(e.g. questionnaires)
 Infer her interests implicitly using the user’s
search history

More Related Content

What's hot

2 ijmtst031002
2 ijmtst0310022 ijmtst031002
2 ijmtst031002
IJMTST Journal
 
Query formulation process
Query formulation processQuery formulation process
Query formulation processmalathimurugan
 
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTIONTEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
ijistjournal
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with Idioms
Waqas Tariq
 
C017510717
C017510717C017510717
C017510717
IOSR Journals
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
IRJET Journal
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie Reviews
Editor IJMTER
 
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
cscpconf
 
Search Interface Feature Evaluation
Search Interface Feature EvaluationSearch Interface Feature Evaluation
Search Interface Feature Evaluation
Simona Galdikaite, B.B.A, MBA
 
Search Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in BiosciencesSearch Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in Biosciences
Zanda Mark
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documents
IJECEIAES
 
The comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesThe comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search engines
fawad khan
 
Query formulation (chapter 1)
Query formulation (chapter 1)Query formulation (chapter 1)
Query formulation (chapter 1)Mohamed Rafique
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
Mounia Lalmas-Roelleke
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
BAIRAVI T
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
paperpublications3
 
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
dannyijwest
 
An Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationAn Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text Classification
IJCSIS Research Publications
 

What's hot (20)

2 ijmtst031002
2 ijmtst0310022 ijmtst031002
2 ijmtst031002
 
Query formulation process
Query formulation processQuery formulation process
Query formulation process
 
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTIONTEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with Idioms
 
C017510717
C017510717C017510717
C017510717
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie Reviews
 
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
 
Lec 2
Lec 2Lec 2
Lec 2
 
Search Interface Feature Evaluation
Search Interface Feature EvaluationSearch Interface Feature Evaluation
Search Interface Feature Evaluation
 
Search Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in BiosciencesSearch Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in Biosciences
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documents
 
The comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesThe comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search engines
 
Query formulation (chapter 1)
Query formulation (chapter 1)Query formulation (chapter 1)
Query formulation (chapter 1)
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
 
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
 
An Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationAn Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text Classification
 

Viewers also liked

como crear un triptico
como crear un tripticocomo crear un triptico
como crear un triptico
roger david lozada delgado
 
De la cruz valencia cristian
De la cruz valencia cristianDe la cruz valencia cristian
De la cruz valencia cristian
cristian de la cruz valencia
 
INCHI 2016
INCHI 2016INCHI 2016
INCHI 2016
Christian Romero
 
id.net for game developers
id.net for game developersid.net for game developers
id.net for game developers
Edwin J.
 
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)Nova Weng
 
id.net APIs for Hosts
id.net APIs for Hostsid.net APIs for Hosts
id.net APIs for Hosts
Edwin J.
 
Ford-Truck-Whats-New-2001
Ford-Truck-Whats-New-2001Ford-Truck-Whats-New-2001
Ford-Truck-Whats-New-2001Jason Weber
 
NACCL-Requirements-Specification-Template
NACCL-Requirements-Specification-TemplateNACCL-Requirements-Specification-Template
NACCL-Requirements-Specification-TemplateJason Weber
 
Publisher
PublisherPublisher
Facebook
FacebookFacebook
Comunidades virtuales
Comunidades virtualesComunidades virtuales
Comunidades virtuales
cristian de la cruz valencia
 
Equidad de genero 1
Equidad de genero 1Equidad de genero 1
Tarjeta
TarjetaTarjeta
Practica.n9
Practica.n9Practica.n9
Practica.n9
Andres Araque
 

Viewers also liked (20)

como crear un triptico
como crear un tripticocomo crear un triptico
como crear un triptico
 
DVT-Manual
DVT-ManualDVT-Manual
DVT-Manual
 
De la cruz valencia cristian
De la cruz valencia cristianDe la cruz valencia cristian
De la cruz valencia cristian
 
INCHI 2016
INCHI 2016INCHI 2016
INCHI 2016
 
id.net for game developers
id.net for game developersid.net for game developers
id.net for game developers
 
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
 
id.net APIs for Hosts
id.net APIs for Hostsid.net APIs for Hosts
id.net APIs for Hosts
 
Ford-Truck-Whats-New-2001
Ford-Truck-Whats-New-2001Ford-Truck-Whats-New-2001
Ford-Truck-Whats-New-2001
 
NACCL-Requirements-Specification-Template
NACCL-Requirements-Specification-TemplateNACCL-Requirements-Specification-Template
NACCL-Requirements-Specification-Template
 
Publisher
PublisherPublisher
Publisher
 
Facebook
FacebookFacebook
Facebook
 
flowcharts
flowchartsflowcharts
flowcharts
 
Lehman Syllabus
Lehman SyllabusLehman Syllabus
Lehman Syllabus
 
Comunidades virtuales
Comunidades virtualesComunidades virtuales
Comunidades virtuales
 
Equidad de genero 1
Equidad de genero 1Equidad de genero 1
Equidad de genero 1
 
Slide DEF
Slide DEFSlide DEF
Slide DEF
 
Tarjeta
TarjetaTarjeta
Tarjeta
 
Practica.n9
Practica.n9Practica.n9
Practica.n9
 
LGA Syllabus Spring2003
LGA Syllabus Spring2003LGA Syllabus Spring2003
LGA Syllabus Spring2003
 
LGA Syllabus Spring2002
LGA Syllabus Spring2002LGA Syllabus Spring2002
LGA Syllabus Spring2002
 

Similar to Improving search personalisation with dynamic group formation

WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip RadlinskiWSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
Nicolaas Matthijs
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Ed Chi
 
Personalized Search at Sandia National Labs
Personalized Search at Sandia National LabsPersonalized Search at Sandia National Labs
Personalized Search at Sandia National Labs
Lucidworks
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
DataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeDataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeSreenivas Ravi
 
Predicting User Knowledge Gain in Informational Search Sessions
Predicting User Knowledge Gain in Informational Search SessionsPredicting User Knowledge Gain in Informational Search Sessions
Predicting User Knowledge Gain in Informational Search Sessions
Ran Yu
 
Tools and techniques in qualitative and quantitative research
Tools and techniques in qualitative and quantitative researchTools and techniques in qualitative and quantitative research
Tools and techniques in qualitative and quantitative research
Deepikakohli10
 
Needs Assessment and Program Planning
Needs Assessment and Program PlanningNeeds Assessment and Program Planning
Needs Assessment and Program Planning
Lynda Milne
 
A Study on Impact of Team Work on Organizational Productivity with Special Re...
A Study on Impact of Team Work on Organizational Productivity with Special Re...A Study on Impact of Team Work on Organizational Productivity with Special Re...
A Study on Impact of Team Work on Organizational Productivity with Special Re...
ijtsrd
 
Query Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachQuery Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering Approach
IRJET Journal
 
Beyond Collaborative Filtering: Learning to Rank Research Articles
Beyond Collaborative Filtering: Learning to Rank Research ArticlesBeyond Collaborative Filtering: Learning to Rank Research Articles
Beyond Collaborative Filtering: Learning to Rank Research Articles
Maya Hristakeva
 
Qualitative Research vs Quantitative Research - a QuestionPro Academic Webinar
Qualitative Research vs Quantitative Research - a QuestionPro Academic WebinarQualitative Research vs Quantitative Research - a QuestionPro Academic Webinar
Qualitative Research vs Quantitative Research - a QuestionPro Academic Webinar
QuestionPro
 
Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...
ijtsrd
 
Process And Methodology Research
Process And Methodology ResearchProcess And Methodology Research
Process And Methodology ResearchMiles Price
 
McGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and ScalingMcGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and Scaling
National Information Standards Organization (NISO)
 
Steps for Preparing Research Methodology - Phdassistance
Steps for Preparing Research Methodology - PhdassistanceSteps for Preparing Research Methodology - Phdassistance
Steps for Preparing Research Methodology - Phdassistance
PhD Assistance
 
User Control in Adaptive Information Access
User Control in Adaptive Information AccessUser Control in Adaptive Information Access
User Control in Adaptive Information Access
Peter Brusilovsky
 
Evaluation Of A Research Method
Evaluation Of A Research MethodEvaluation Of A Research Method
Evaluation Of A Research Method
Susan Kennedy
 

Similar to Improving search personalisation with dynamic group formation (20)

Naresh sharma
Naresh sharmaNaresh sharma
Naresh sharma
 
WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip RadlinskiWSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
 
Personalized Search at Sandia National Labs
Personalized Search at Sandia National LabsPersonalized Search at Sandia National Labs
Personalized Search at Sandia National Labs
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
DataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeDataGathering-Qualitative and Quantitative
DataGathering-Qualitative and Quantitative
 
Predicting User Knowledge Gain in Informational Search Sessions
Predicting User Knowledge Gain in Informational Search SessionsPredicting User Knowledge Gain in Informational Search Sessions
Predicting User Knowledge Gain in Informational Search Sessions
 
Tools and techniques in qualitative and quantitative research
Tools and techniques in qualitative and quantitative researchTools and techniques in qualitative and quantitative research
Tools and techniques in qualitative and quantitative research
 
Needs Assessment and Program Planning
Needs Assessment and Program PlanningNeeds Assessment and Program Planning
Needs Assessment and Program Planning
 
A Study on Impact of Team Work on Organizational Productivity with Special Re...
A Study on Impact of Team Work on Organizational Productivity with Special Re...A Study on Impact of Team Work on Organizational Productivity with Special Re...
A Study on Impact of Team Work on Organizational Productivity with Special Re...
 
Query Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachQuery Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering Approach
 
Beyond Collaborative Filtering: Learning to Rank Research Articles
Beyond Collaborative Filtering: Learning to Rank Research ArticlesBeyond Collaborative Filtering: Learning to Rank Research Articles
Beyond Collaborative Filtering: Learning to Rank Research Articles
 
Qualitative Research vs Quantitative Research - a QuestionPro Academic Webinar
Qualitative Research vs Quantitative Research - a QuestionPro Academic WebinarQualitative Research vs Quantitative Research - a QuestionPro Academic Webinar
Qualitative Research vs Quantitative Research - a QuestionPro Academic Webinar
 
Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...Experimental Result Analysis of Text Categorization using Clustering and Clas...
Experimental Result Analysis of Text Categorization using Clustering and Clas...
 
Process And Methodology Research
Process And Methodology ResearchProcess And Methodology Research
Process And Methodology Research
 
McGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and ScalingMcGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and Scaling
 
Steps for Preparing Research Methodology - Phdassistance
Steps for Preparing Research Methodology - PhdassistanceSteps for Preparing Research Methodology - Phdassistance
Steps for Preparing Research Methodology - Phdassistance
 
User Control in Adaptive Information Access
User Control in Adaptive Information AccessUser Control in Adaptive Information Access
User Control in Adaptive Information Access
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Evaluation Of A Research Method
Evaluation Of A Research MethodEvaluation Of A Research Method
Evaluation Of A Research Method
 

Recently uploaded

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 

Recently uploaded (20)

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 

Improving search personalisation with dynamic group formation

  • 1. Tien-Thanh Vu Computing and Communications Department The Open University Improving Search Personalisation with Dynamic Group Formation
  • 2. Table of Contents  Motivation  Methodology  A pilot study  Conclusions and Future Work
  • 3.  Motivation  Methodology  A pilot study  Conclusions and Future Work
  • 4. What is search personalisation?
  • 5. The search performance depends on the richness of a user profile J. Teevan, M. R. Morris, and S. Bush. Discovering and using groups to improve personalized search. WSDM’09, pages 15-24, USA, 2009. ACM
  • 6. How can we enrich a user profile?  Use information of the group of users who share common interests R. W. White, W. Chu, A. Hassan, X. He, Y. Song, and H. Wang. Enhancing personalized search by mining and modeling task behavior. WWW '13, pages 1411-1420, Switzerland, 2013. ACM
  • 7. What is the main research problem?  Construct groups statically using some predetermined criterions such as common clicked documents Users in a group may have different interests on different topics w.r.t the input query Z. Dou, R. Song, and J.-R. Wen. A large-scale evaluation and analysis of personalized search strategies. WWW '07, pages 581-590, NY, USA, 2007. ACM.
  • 8. Our proposal  The groups should be dynamically constructed in response to the user’s input query
  • 9. Research question How can we improve the performance of search personalisation with dynamic group formation?  How can we dynamically group users who share common interests?  How can we enrich user profiles with group information?  Can enriched user profiles help to improve search performance?
  • 10.  Motivation  Methodology  A pilot study  Conclusions and Future Work
  • 11. Constructing a user profile  Construct implicitly using the relevant data extracted from each user’s search history (i.e. query logs)  Extract related topics from the data  Build a user profile based on the topics
  • 12. Constructing a user profile  Average the relevant documents over topics
  • 13. Query-dependent user grouping  Construct shared user profiles  Use the input query as an indicator for grouping users
  • 14. Constructing a shared user profile
  • 16. Query-dependent user grouping  CosineSimilarity sp, q = sp.q sp .|q| = t∈T wpt.wqt t∈T wpt 2 t∈T wqt 2  E.g. sp = (1,2,5); q = (4,5,6) CosineSimilarity sp, q = 1∗4+2∗5+5∗6 12+22+52 42+52+62 ≈ 0.915 The 2-nearest users
  • 17. Enriching a user profile  Average all users in the group over topics
  • 18. Re-ranking search results  For each input query  Download the top n ranked search results from the search engine  Compute a personalised score for each web page d given the current user u  Combine the personalised score p(d|u) and the original rank r(q,d), to get a final score ),( )|( ),|( dqr udp qudf 
  • 20.  Motivation  Methodology  A pilot study  Conclusions and Future Work
  • 21. A pilot study  Evaluation metrics  Dataset  Preliminary results
  • 22. Evaluation metric  Personalisation Gain (P-Gain) worse#+better# worse#-better#  GainP
  • 23. Dataset  Query logs from Bing search engine for 15 days from 1st to 15th July 2012, 106 anonymous users  A relevant document is a click with dwell time of at least 30 seconds or the last click in a session (SAT click) Item ALL Training Test #days 15 10 5 #users 106 106 106 #queries 17947 11695 6252 #distinct queries 8008 5237 3102 #clicks 24041 15688 8353 #SAT clicks 16166 10607 5559 #SAT clicks/#queries 0.9008 0.9069 0.8892
  • 24. Preliminary results  Baseline and Personalisation Strategies  Baseline: The original ranked results from Bing  S_Profile: Use only the current user profile  S_Group: Enrich the profile with static group  D_Group: Enrich the profile with dynamic group
  • 25. Preliminary results  Overall Performance Strategy #Better #Worse P-Gain S_Profile 913 664 0.1579 S_Group 882 491 0.2848 D_Group 884 450 0.3253
  • 26.  Motivation  Methodology  A pilot study  Conclusions and Future Work
  • 27. Conclusions  Grouping improves search performance  Dynamic grouping outperforms static grouping
  • 28. Outlook  Carry out evaluation on larger-scale data sets  Extend the model to capture user’s interests that change over time
  • 30.
  • 31. Constructing a user profile  Use a topic modelling method to learn topics from the data
  • 32. How can we build a user profile?  Ask the user explicitly to provide her interests (e.g. questionnaires)  Infer her interests implicitly using the user’s search history