SlideShare a Scribd company logo
1 of 37
Thanh Vu, Alistair Willis,
Dawei Song
The Open University, UK
Temporal Latent Topic User Profiles for
Search Personalisation
Son N. Tran
City London University
The 37th European Conference on Information
Retrieval
st
Search Personalisation
2
 Return search results based on
 The input query
 The user searching interests
Different users submit the same input query will
probably get different search result lists
Even an individual user will get different search
results at different search times (e.g., Open US)
Temporal Latent Topic User Profiles for Search
Personalisation
The performance of search
personalisation
depends on
the richness of a user profile
J. Teevan, M. R. Morris, and S. Bush. Discovering and using groups to improve personalized search. In WSDM’2009
3
Temporal Latent Topic User Profiles for Search
Personalisation
Topic-based user profiles
4
 Use Human generated ontology (ODP –
dmoz.org) to extract topics from all
clicked/relevant documents of a specific user to
build her profile
1. R. W. White, et al., Enhancing Personalized Search by Mining and Modeling Task Behavior. In WWW’2013
2. P. N. Bennett, et al., Modeling the impact of short- and long-term behavior on search personalization. In SIGIR’2012
Temporal Latent Topic User Profiles for Search
Personalisation
Challenges for Human Generated
Ontology
5
 New topics which are not covered in the Ontology
will possibly emerge overtime
 Expensive human effort to classify/maintain each
document into correct categories
Temporal Latent Topic User Profiles for Search
Personalisation
Challenges for Time-awareness
6
 Previous methods use all the clicked/relevant
documents of a user to build her searching profile
 The documents are treated equally without
considering temporal features (i.e., the time of
documents being clicked and viewed)
The profile is too broad
Cannot fully express the current interest of the user
1. T. T. Vu, et al., Improving search personalisation with dynamic group formation. In SIGIR’2014
2. K. Raman, et al., Toward whole-session relevance: Exploring intrinsic diversity in web search. In SIGIR’2013
Temporal Latent Topic User Profiles for Search
Personalisation
Research Questions
7
1. How can we build user profiles with time-
awareness?
2. Do the time-aware profiles help improve search
performance?
Temporal Latent Topic User Profiles for Search
Personalisation
Applying Latent Dirichlet
Allocation
8
Temporal Latent Topic User Profiles for Search
Personalisation
Building temporal latent topic user
profiles (1)
9
 Non-temporal method
Temporal Latent Topic User Profiles for Search
Personalisation
4th 1st2nd3rd
Football
Law
Health
OS
0.51
0.33
0.11
0.05
Clicked documents
Football
Law
OS
Health
0.55
0.27
0.10
0.08
Law
OS
Health
Football
0.41
0.37
0.12
0.10
OS
Law
Football
Health
0.65
0.21
0.10
0.04
Distribution over topics
Football
Law
OS
Health
0.32
0.30
0.29
0.09
Means over topics
The topic-based user profile
Building temporal latent topic user
profiles (2)
10
 Our method
Temporal Latent Topic User Profiles for Search
Personalisation
1st
Football
Law
Health
OS
0.51
0.33
0.11
0.05
Football
Law
Health
OS
0.51
0.33
0.11
0.05
The temporal topic user profile
0.90
Football
Law
Health
OS
0.53
0.30
0.09
0.08
Building temporal latent topic user
profiles (2)
11
Temporal Latent Topic User Profiles for Search
Personalisation
2nd 1st
Football
Law
Health
OS
0.51
0.33
0.11
0.05
Football
Law
OS
Health
0.55
0.27
0.10
0.08
The temporal topic user profile
0.91 0.90
Football
Law
OS
Health
0.37
0.34
0.19
0.10
0.910.92
Building temporal latent topic user
profiles (2)
12
Temporal Latent Topic User Profiles for Search
Personalisation
3rd 1st2nd
Football
Law
Health
OS
0.51
0.33
0.11
0.05
Football
Health
OS
Law
0.55
0.27
0.10
0.08
Law
OS
Health
Football
0.41
0.37
0.12
0.10
The temporal topic user profile
0.90
OS
Law
Football
Health
0.32
0.30
0.29
0.09
Building temporal latent topic user
profiles (2)
13
Temporal Latent Topic User Profiles for Search
Personalisation
4th 1st2nd3rd
Football
Law
Health
OS
0.51
0.33
0.11
0.05
Football
Health
OS
Law
0.55
0.27
0.10
0.08
Law
OS
Health
Football
0.41
0.37
0.12
0.10
OS
Law
Football
Health
0.65
0.21
0.10
0.04
Temporal topic profile
0.93
0.92 0.91
0.90
Football
Law
OS
Health
0.32
0.30
0.29
0.09
Non-temporal topic profile
Building temporal latent topic user
profiles (3)
Temporal Latent Topic User Profiles for Search
Personalisation14
 Du = {d1, d2, …, dn} is a relevant document set of
the user u
 The user profile of u is a distribution over the
topic Z (extracted by LDA)
 tdi = n indicates that di is the nth most
relevant/clicked document of u
 α is the decay parameter; K is the normalisation
factor
Building temporal latent topic user
profiles (4)
15
 Long-term user profile
 Use relevant documents extracted from the user’s
whole search history
 Daily user profile
 Use relevant documents extracted from the search
history of the user in the current searching day
 Session user profile
 Use relevant documents extracted from the search
history of the user in the current search session
Temporal Latent Topic User Profiles for Search
Personalisation
Re-ranking search results (1)
16
Temporal Latent Topic User Profiles for Search
Personalisation
1 32
Health
Law
Football
OS
0.51
0.33
0.11
0.05
Football
Law
Health
OS
0.55
0.27
0.13
0.05
Football
OS
Health
Law
0.41
0.37
0.12
0.10
Original Rank
132
Health
Law
Football
OS
0.51
0.33
0.11
0.05
Football
Law
Health
OS
0.55
0.27
0.13
0.05
Football
OS
Health
Law
0.41
0.37
0.12
0.10
After re-ranking
Football
Law
OS
Health
0.47
0.24
0.16
0.12
The user profile (p)
Re-ranking search results (2)
17
 Personalised scores
 Use Jensen-Shannon divergence (DJS[d||p] )
Temporal Latent Topic User Profiles for Search
Personalisation
1 32
Health
Law
Football
OS
0.51
0.33
0.11
0.05
Football
Law
Health
OS
0.55
0.27
0.13
0.05
Football
OS
Health
Law
0.41
0.37
0.12
0.10
Football
Law
OS
Health
0.47
0.24
0.16
0.12
Returned documents (d)
The user profile (p)
Re-ranking search results (3)
18
 Re-ranking Features
 Re-Ranking Algorithm: LambdaMART[1]
1. C. J. Burges, et al., Learning to rank with non-smooth cost functions. In NIPS’2007.
Feature Description
Personalised Features
LongTermScore Personalised score between document and long-term
profile
DailyScore Personalised score between document and daily profile
SessionScore Personalised score between document and session
profile
Non-personalised Features
DocRank Rank of document on original returned list
QuerySim Cosine similarity score between current and previous
queries
QueryNo Total number of queries that have been submitted in the
current search session (included the current query)
Evaluation
19
Dataset
 The query logs of 1166 anonymous users in four
weeks, from 01st to 28th July 2012
 A log entity consists of an anonymous user
identifier, a query, top-10 returned URLs, and
clicked documents along with the user’s dwell
time
 Download all the URLs’ content for learning topics
 A search session is demarcated by 30 minutes of
user inactivity
 A relevant document is a click with dwell time of
at least 30 seconds or the last click in a session
(SAT click)Temporal Latent Topic User Profiles for Search
Personalisation
Evaluation methodology
20
 Assign a positive (relevant) label to a returned
URL if
 it is a SAT click in the current query
 it is a SAT click in one of the other repeated queries
in the same search session
 Assign negative (irrelevant) labels to the rest of
URLs
Temporal Latent Topic User Profiles for Search
Personalisation
Personalisation Methods and
Baselines
21
 Personalisation Methods
 LON uses only LongTermScore from long-term
profile
 DAI uses only DailyScore from daily profile
 SES uses SessionScore from session profile
 ALL uses all personalised scores from three
profiles (ALL)
 Baselines
 Default is the default ranking returned by the
search engine
 Static uses the LongTermScore from long-term
profile without time-awareness (i.e., not using decay
function)
Temporal Latent Topic User Profiles for Search
Personalisation
Results
22
 Evaluation metrics
 Mean Average Precision (MAP)
 Precision (P@k)
 Mean Reciprocal Rank (MRR)
 Normalized Discounted Cumulative Gain
(nDCG@k)
 For each evaluation metric, the higher value
indicates the better ranking
Temporal Latent Topic User Profiles for Search
Personalisation
Overall Performance
23
Temporal Latent Topic User Profiles for Search
Personalisation
• All the improvements over the baselines
are all significant with paired t-test of p <
0.001
• Three temporal profiles help to improve
search performance over default ranking
and the use of non-temporal profile
Conclusions (1)
24
Temporal Latent Topic User Profiles for Search
Personalisation
• Using all features (ALL) achieves the
highest performance
Conclusions (2)
25
Temporal Latent Topic User Profiles for Search
Personalisation
Conclusions (3)
26
Temporal Latent Topic User Profiles for Search
Personalisation
• The session profile achieves better
performance than the daily profile
• The daily profile gains advantages over
the long-term profile
Conclusions (4)
27
Temporal Latent Topic User Profiles for Search
Personalisation
• Without time-awareness, the long-term
profile gets no improvement over the
default ranking
Summary
Temporal Latent Topic User Profiles for Search
Personalisation28
 Build long-term, daily and session profiles with
time-awareness using topics extracted
automatically from relevant documents in different
time scales
 Use the three profiles to re-rank search results
returned by Bing and show the significant
improvement in search performances
Thank you!
Any questions?
29
Temporal Latent Topic User Profiles for Search
Personalisation
Dataset (2)
30
Temporal Latent Topic User Profiles for Search
Personalisation
Example of query logs
31
Temporal Latent Topic User Profiles for Search
Personalisation
Click Entropies
32
 P(d|q) is the percentage of the clicks on
document d among all the clicks for q
 A smaller query click entropy value indicates
more agreement between users on clicking a
small number of web pages
Temporal Latent Topic User Profiles for Search
Personalisation
Click entropies
33
Temporal Latent Topic User Profiles for Search
Personalisation
Query Positions in Search
Session
34
 Aim to study whether the position of a query has
any effect on the performance of the temporal
latent topic profiles
 Label the queries by their positions during the
search
Temporal Latent Topic User Profiles for Search
Personalisation
Temporal Latent Topic User Profiles for Search
Personalisation35
Footbal
l
Law
Health
OS
0.51
0.33
0.11
0.05
Clicked documents
Footbal
l
Health
OS
Law
0.55
0.27
0.13
0.05
Law
OS
Health
Footbal
l
0.41
0.37
0.12
0.10
OS
Law
Footbal
l
Health
0.65
0.15
0.11
0.09
Distribution over topics
Footbal
l
Law
OS
Health
0.32
0.29
0.28
0.11
Means over topics
The topic-based user
profile
Re-ranking search results (1)
Temporal Latent Topic User Profiles for Search
Personalisation36
Query: MU
Pre-processing
37
 Remove the queries whose positive label set is
empty from the dataset
 Discard the domain-related queries (e.g.,
Facebook, Youtube)
Temporal Latent Topic User Profiles for Search
Personalisation

More Related Content

What's hot

IRJET- Text Document Clustering using K-Means Algorithm
IRJET-  	  Text Document Clustering using K-Means Algorithm IRJET-  	  Text Document Clustering using K-Means Algorithm
IRJET- Text Document Clustering using K-Means Algorithm IRJET Journal
 
Query formulation process
Query formulation processQuery formulation process
Query formulation processmalathimurugan
 
Search Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in BiosciencesSearch Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in BiosciencesZanda Mark
 
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTIONTEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTIONijistjournal
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsIRJET Journal
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsWaqas Tariq
 
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION cscpconf
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsEditor IJMTER
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...dannyijwest
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsIJECEIAES
 
The comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesThe comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesfawad khan
 
An Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationAn Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationIJCSIS Research Publications
 
Query formulation (chapter 1)
Query formulation (chapter 1)Query formulation (chapter 1)
Query formulation (chapter 1)Mohamed Rafique
 
Ethnograph 11 Jul07
Ethnograph 11 Jul07Ethnograph 11 Jul07
Ethnograph 11 Jul07Clara Kwan
 
Ethnograph 10 Jul07
Ethnograph 10 Jul07Ethnograph 10 Jul07
Ethnograph 10 Jul07Clara Kwan
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Techniquepaperpublications3
 

What's hot (19)

IRJET- Text Document Clustering using K-Means Algorithm
IRJET-  	  Text Document Clustering using K-Means Algorithm IRJET-  	  Text Document Clustering using K-Means Algorithm
IRJET- Text Document Clustering using K-Means Algorithm
 
Query formulation process
Query formulation processQuery formulation process
Query formulation process
 
Search Interface Feature Evaluation
Search Interface Feature EvaluationSearch Interface Feature Evaluation
Search Interface Feature Evaluation
 
Search Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in BiosciencesSearch Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in Biosciences
 
C017510717
C017510717C017510717
C017510717
 
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTIONTEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with Idioms
 
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie Reviews
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documents
 
The comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search enginesThe comparative study of information retrieval models used in search engines
The comparative study of information retrieval models used in search engines
 
An Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationAn Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text Classification
 
Query formulation (chapter 1)
Query formulation (chapter 1)Query formulation (chapter 1)
Query formulation (chapter 1)
 
Ethnograph 11 Jul07
Ethnograph 11 Jul07Ethnograph 11 Jul07
Ethnograph 11 Jul07
 
Ethnograph 10 Jul07
Ethnograph 10 Jul07Ethnograph 10 Jul07
Ethnograph 10 Jul07
 
An Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search TechniqueAn Advanced IR System of Relational Keyword Search Technique
An Advanced IR System of Relational Keyword Search Technique
 

Viewers also liked (20)

LGA Syllabus Spring2002
LGA Syllabus Spring2002LGA Syllabus Spring2002
LGA Syllabus Spring2002
 
Lehman Syllabus
Lehman SyllabusLehman Syllabus
Lehman Syllabus
 
Equidad de genero 1
Equidad de genero 1Equidad de genero 1
Equidad de genero 1
 
id.net APIs for Hosts
id.net APIs for Hostsid.net APIs for Hosts
id.net APIs for Hosts
 
flowcharts
flowchartsflowcharts
flowcharts
 
Slide DEF
Slide DEFSlide DEF
Slide DEF
 
DVT-Manual
DVT-ManualDVT-Manual
DVT-Manual
 
INCHI 2016
INCHI 2016INCHI 2016
INCHI 2016
 
Presentation4
Presentation4Presentation4
Presentation4
 
NACCL-Requirements-Specification-Template
NACCL-Requirements-Specification-TemplateNACCL-Requirements-Specification-Template
NACCL-Requirements-Specification-Template
 
id.net for game developers
id.net for game developersid.net for game developers
id.net for game developers
 
LGA Syllabus Spring2003
LGA Syllabus Spring2003LGA Syllabus Spring2003
LGA Syllabus Spring2003
 
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
Nova Weng - %22Early Breast Cancer Awareness & Prevention in China%22 (2016)
 
LGA Syllabus Fall2001
LGA Syllabus Fall2001LGA Syllabus Fall2001
LGA Syllabus Fall2001
 
Practica.n9
Practica.n9Practica.n9
Practica.n9
 
Tarjeta
TarjetaTarjeta
Tarjeta
 
como crear un triptico
como crear un tripticocomo crear un triptico
como crear un triptico
 
Ford-Truck-Whats-New-2001
Ford-Truck-Whats-New-2001Ford-Truck-Whats-New-2001
Ford-Truck-Whats-New-2001
 
Publisher
PublisherPublisher
Publisher
 
De la cruz valencia cristian
De la cruz valencia cristianDe la cruz valencia cristian
De la cruz valencia cristian
 

Similar to Temporal Latent Topic Profiles Improve Search Personalization

IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Supporting Exploratory People Search: A Study of Factor Transparency and User...
Supporting Exploratory People Search: A Study of Factor Transparency and User...Supporting Exploratory People Search: A Study of Factor Transparency and User...
Supporting Exploratory People Search: A Study of Factor Transparency and User...Shuguang Han
 
Delving deep into personal photo and video search
Delving deep into personal photo and video searchDelving deep into personal photo and video search
Delving deep into personal photo and video searchJason Tang
 
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn SearchStructure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn SearchC4Media
 
WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip RadlinskiWSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip RadlinskiNicolaas Matthijs
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web SearchIJMER
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web SearchIJMER
 
Study and Implementation of a Personalized Mobile Search Engine for Secure Se...
Study and Implementation of a Personalized Mobile Search Engine for Secure Se...Study and Implementation of a Personalized Mobile Search Engine for Secure Se...
Study and Implementation of a Personalized Mobile Search Engine for Secure Se...IRJET Journal
 
User behavior model & recommendation on basis of social networks
User behavior model & recommendation on basis of social networks User behavior model & recommendation on basis of social networks
User behavior model & recommendation on basis of social networks Shah Alam Sabuj
 
Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018Gabriel Moreira
 
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCHSUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCHnikhil421080
 
Optimizing Search User Interfaces and Interactions within Professional Social...
Optimizing Search User Interfaces and Interactions within Professional Social...Optimizing Search User Interfaces and Interactions within Professional Social...
Optimizing Search User Interfaces and Interactions within Professional Social...Nik Spirin
 
Enterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FASTEnterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FASTBert Johnson
 
Personalized Search at Sandia National Labs
Personalized Search at Sandia National LabsPersonalized Search at Sandia National Labs
Personalized Search at Sandia National LabsLucidworks
 
IDS Search Presentation to CUNY Libraries
IDS Search Presentation to CUNY LibrariesIDS Search Presentation to CUNY Libraries
IDS Search Presentation to CUNY Librariesshannon-pritting
 
Considering users' behaviours in improving the responses of an informacion base
Considering users' behaviours in improving the responses of an informacion baseConsidering users' behaviours in improving the responses of an informacion base
Considering users' behaviours in improving the responses of an informacion baseinscit2006
 
4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx
4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx
4 postsRe Topic 2 DQ 1Qualitative research produces a v.docxmeghanivkwserie
 

Similar to Temporal Latent Topic Profiles Improve Search Personalization (20)

G017415465
G017415465G017415465
G017415465
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Ac02411221125
Ac02411221125Ac02411221125
Ac02411221125
 
Supporting Exploratory People Search: A Study of Factor Transparency and User...
Supporting Exploratory People Search: A Study of Factor Transparency and User...Supporting Exploratory People Search: A Study of Factor Transparency and User...
Supporting Exploratory People Search: A Study of Factor Transparency and User...
 
Delving deep into personal photo and video search
Delving deep into personal photo and video searchDelving deep into personal photo and video search
Delving deep into personal photo and video search
 
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn SearchStructure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
 
WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip RadlinskiWSDM 2011 - Nicolaas Matthijs and Filip Radlinski
WSDM 2011 - Nicolaas Matthijs and Filip Radlinski
 
Naresh sharma
Naresh sharmaNaresh sharma
Naresh sharma
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web Search
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web Search
 
Study and Implementation of a Personalized Mobile Search Engine for Secure Se...
Study and Implementation of a Personalized Mobile Search Engine for Secure Se...Study and Implementation of a Personalized Mobile Search Engine for Secure Se...
Study and Implementation of a Personalized Mobile Search Engine for Secure Se...
 
User behavior model & recommendation on basis of social networks
User behavior model & recommendation on basis of social networks User behavior model & recommendation on basis of social networks
User behavior model & recommendation on basis of social networks
 
Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018
 
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCHSUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
SUPPORTING PRIVACY PROTECTION IN PERSONALIZED WEB SEARCH
 
Optimizing Search User Interfaces and Interactions within Professional Social...
Optimizing Search User Interfaces and Interactions within Professional Social...Optimizing Search User Interfaces and Interactions within Professional Social...
Optimizing Search User Interfaces and Interactions within Professional Social...
 
Enterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FASTEnterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FAST
 
Personalized Search at Sandia National Labs
Personalized Search at Sandia National LabsPersonalized Search at Sandia National Labs
Personalized Search at Sandia National Labs
 
IDS Search Presentation to CUNY Libraries
IDS Search Presentation to CUNY LibrariesIDS Search Presentation to CUNY Libraries
IDS Search Presentation to CUNY Libraries
 
Considering users' behaviours in improving the responses of an informacion base
Considering users' behaviours in improving the responses of an informacion baseConsidering users' behaviours in improving the responses of an informacion base
Considering users' behaviours in improving the responses of an informacion base
 
4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx
4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx
4 postsRe Topic 2 DQ 1Qualitative research produces a v.docx
 

Recently uploaded

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 

Recently uploaded (20)

E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 

Temporal Latent Topic Profiles Improve Search Personalization

  • 1. Thanh Vu, Alistair Willis, Dawei Song The Open University, UK Temporal Latent Topic User Profiles for Search Personalisation Son N. Tran City London University The 37th European Conference on Information Retrieval st
  • 2. Search Personalisation 2  Return search results based on  The input query  The user searching interests Different users submit the same input query will probably get different search result lists Even an individual user will get different search results at different search times (e.g., Open US) Temporal Latent Topic User Profiles for Search Personalisation
  • 3. The performance of search personalisation depends on the richness of a user profile J. Teevan, M. R. Morris, and S. Bush. Discovering and using groups to improve personalized search. In WSDM’2009 3 Temporal Latent Topic User Profiles for Search Personalisation
  • 4. Topic-based user profiles 4  Use Human generated ontology (ODP – dmoz.org) to extract topics from all clicked/relevant documents of a specific user to build her profile 1. R. W. White, et al., Enhancing Personalized Search by Mining and Modeling Task Behavior. In WWW’2013 2. P. N. Bennett, et al., Modeling the impact of short- and long-term behavior on search personalization. In SIGIR’2012 Temporal Latent Topic User Profiles for Search Personalisation
  • 5. Challenges for Human Generated Ontology 5  New topics which are not covered in the Ontology will possibly emerge overtime  Expensive human effort to classify/maintain each document into correct categories Temporal Latent Topic User Profiles for Search Personalisation
  • 6. Challenges for Time-awareness 6  Previous methods use all the clicked/relevant documents of a user to build her searching profile  The documents are treated equally without considering temporal features (i.e., the time of documents being clicked and viewed) The profile is too broad Cannot fully express the current interest of the user 1. T. T. Vu, et al., Improving search personalisation with dynamic group formation. In SIGIR’2014 2. K. Raman, et al., Toward whole-session relevance: Exploring intrinsic diversity in web search. In SIGIR’2013 Temporal Latent Topic User Profiles for Search Personalisation
  • 7. Research Questions 7 1. How can we build user profiles with time- awareness? 2. Do the time-aware profiles help improve search performance? Temporal Latent Topic User Profiles for Search Personalisation
  • 8. Applying Latent Dirichlet Allocation 8 Temporal Latent Topic User Profiles for Search Personalisation
  • 9. Building temporal latent topic user profiles (1) 9  Non-temporal method Temporal Latent Topic User Profiles for Search Personalisation 4th 1st2nd3rd Football Law Health OS 0.51 0.33 0.11 0.05 Clicked documents Football Law OS Health 0.55 0.27 0.10 0.08 Law OS Health Football 0.41 0.37 0.12 0.10 OS Law Football Health 0.65 0.21 0.10 0.04 Distribution over topics Football Law OS Health 0.32 0.30 0.29 0.09 Means over topics The topic-based user profile
  • 10. Building temporal latent topic user profiles (2) 10  Our method Temporal Latent Topic User Profiles for Search Personalisation 1st Football Law Health OS 0.51 0.33 0.11 0.05 Football Law Health OS 0.51 0.33 0.11 0.05 The temporal topic user profile 0.90
  • 11. Football Law Health OS 0.53 0.30 0.09 0.08 Building temporal latent topic user profiles (2) 11 Temporal Latent Topic User Profiles for Search Personalisation 2nd 1st Football Law Health OS 0.51 0.33 0.11 0.05 Football Law OS Health 0.55 0.27 0.10 0.08 The temporal topic user profile 0.91 0.90
  • 12. Football Law OS Health 0.37 0.34 0.19 0.10 0.910.92 Building temporal latent topic user profiles (2) 12 Temporal Latent Topic User Profiles for Search Personalisation 3rd 1st2nd Football Law Health OS 0.51 0.33 0.11 0.05 Football Health OS Law 0.55 0.27 0.10 0.08 Law OS Health Football 0.41 0.37 0.12 0.10 The temporal topic user profile 0.90
  • 13. OS Law Football Health 0.32 0.30 0.29 0.09 Building temporal latent topic user profiles (2) 13 Temporal Latent Topic User Profiles for Search Personalisation 4th 1st2nd3rd Football Law Health OS 0.51 0.33 0.11 0.05 Football Health OS Law 0.55 0.27 0.10 0.08 Law OS Health Football 0.41 0.37 0.12 0.10 OS Law Football Health 0.65 0.21 0.10 0.04 Temporal topic profile 0.93 0.92 0.91 0.90 Football Law OS Health 0.32 0.30 0.29 0.09 Non-temporal topic profile
  • 14. Building temporal latent topic user profiles (3) Temporal Latent Topic User Profiles for Search Personalisation14  Du = {d1, d2, …, dn} is a relevant document set of the user u  The user profile of u is a distribution over the topic Z (extracted by LDA)  tdi = n indicates that di is the nth most relevant/clicked document of u  α is the decay parameter; K is the normalisation factor
  • 15. Building temporal latent topic user profiles (4) 15  Long-term user profile  Use relevant documents extracted from the user’s whole search history  Daily user profile  Use relevant documents extracted from the search history of the user in the current searching day  Session user profile  Use relevant documents extracted from the search history of the user in the current search session Temporal Latent Topic User Profiles for Search Personalisation
  • 16. Re-ranking search results (1) 16 Temporal Latent Topic User Profiles for Search Personalisation 1 32 Health Law Football OS 0.51 0.33 0.11 0.05 Football Law Health OS 0.55 0.27 0.13 0.05 Football OS Health Law 0.41 0.37 0.12 0.10 Original Rank 132 Health Law Football OS 0.51 0.33 0.11 0.05 Football Law Health OS 0.55 0.27 0.13 0.05 Football OS Health Law 0.41 0.37 0.12 0.10 After re-ranking Football Law OS Health 0.47 0.24 0.16 0.12 The user profile (p)
  • 17. Re-ranking search results (2) 17  Personalised scores  Use Jensen-Shannon divergence (DJS[d||p] ) Temporal Latent Topic User Profiles for Search Personalisation 1 32 Health Law Football OS 0.51 0.33 0.11 0.05 Football Law Health OS 0.55 0.27 0.13 0.05 Football OS Health Law 0.41 0.37 0.12 0.10 Football Law OS Health 0.47 0.24 0.16 0.12 Returned documents (d) The user profile (p)
  • 18. Re-ranking search results (3) 18  Re-ranking Features  Re-Ranking Algorithm: LambdaMART[1] 1. C. J. Burges, et al., Learning to rank with non-smooth cost functions. In NIPS’2007. Feature Description Personalised Features LongTermScore Personalised score between document and long-term profile DailyScore Personalised score between document and daily profile SessionScore Personalised score between document and session profile Non-personalised Features DocRank Rank of document on original returned list QuerySim Cosine similarity score between current and previous queries QueryNo Total number of queries that have been submitted in the current search session (included the current query)
  • 19. Evaluation 19 Dataset  The query logs of 1166 anonymous users in four weeks, from 01st to 28th July 2012  A log entity consists of an anonymous user identifier, a query, top-10 returned URLs, and clicked documents along with the user’s dwell time  Download all the URLs’ content for learning topics  A search session is demarcated by 30 minutes of user inactivity  A relevant document is a click with dwell time of at least 30 seconds or the last click in a session (SAT click)Temporal Latent Topic User Profiles for Search Personalisation
  • 20. Evaluation methodology 20  Assign a positive (relevant) label to a returned URL if  it is a SAT click in the current query  it is a SAT click in one of the other repeated queries in the same search session  Assign negative (irrelevant) labels to the rest of URLs Temporal Latent Topic User Profiles for Search Personalisation
  • 21. Personalisation Methods and Baselines 21  Personalisation Methods  LON uses only LongTermScore from long-term profile  DAI uses only DailyScore from daily profile  SES uses SessionScore from session profile  ALL uses all personalised scores from three profiles (ALL)  Baselines  Default is the default ranking returned by the search engine  Static uses the LongTermScore from long-term profile without time-awareness (i.e., not using decay function) Temporal Latent Topic User Profiles for Search Personalisation
  • 22. Results 22  Evaluation metrics  Mean Average Precision (MAP)  Precision (P@k)  Mean Reciprocal Rank (MRR)  Normalized Discounted Cumulative Gain (nDCG@k)  For each evaluation metric, the higher value indicates the better ranking Temporal Latent Topic User Profiles for Search Personalisation
  • 23. Overall Performance 23 Temporal Latent Topic User Profiles for Search Personalisation • All the improvements over the baselines are all significant with paired t-test of p < 0.001
  • 24. • Three temporal profiles help to improve search performance over default ranking and the use of non-temporal profile Conclusions (1) 24 Temporal Latent Topic User Profiles for Search Personalisation
  • 25. • Using all features (ALL) achieves the highest performance Conclusions (2) 25 Temporal Latent Topic User Profiles for Search Personalisation
  • 26. Conclusions (3) 26 Temporal Latent Topic User Profiles for Search Personalisation • The session profile achieves better performance than the daily profile • The daily profile gains advantages over the long-term profile
  • 27. Conclusions (4) 27 Temporal Latent Topic User Profiles for Search Personalisation • Without time-awareness, the long-term profile gets no improvement over the default ranking
  • 28. Summary Temporal Latent Topic User Profiles for Search Personalisation28  Build long-term, daily and session profiles with time-awareness using topics extracted automatically from relevant documents in different time scales  Use the three profiles to re-rank search results returned by Bing and show the significant improvement in search performances
  • 29. Thank you! Any questions? 29 Temporal Latent Topic User Profiles for Search Personalisation
  • 30. Dataset (2) 30 Temporal Latent Topic User Profiles for Search Personalisation
  • 31. Example of query logs 31 Temporal Latent Topic User Profiles for Search Personalisation
  • 32. Click Entropies 32  P(d|q) is the percentage of the clicks on document d among all the clicks for q  A smaller query click entropy value indicates more agreement between users on clicking a small number of web pages Temporal Latent Topic User Profiles for Search Personalisation
  • 33. Click entropies 33 Temporal Latent Topic User Profiles for Search Personalisation
  • 34. Query Positions in Search Session 34  Aim to study whether the position of a query has any effect on the performance of the temporal latent topic profiles  Label the queries by their positions during the search Temporal Latent Topic User Profiles for Search Personalisation
  • 35. Temporal Latent Topic User Profiles for Search Personalisation35 Footbal l Law Health OS 0.51 0.33 0.11 0.05 Clicked documents Footbal l Health OS Law 0.55 0.27 0.13 0.05 Law OS Health Footbal l 0.41 0.37 0.12 0.10 OS Law Footbal l Health 0.65 0.15 0.11 0.09 Distribution over topics Footbal l Law OS Health 0.32 0.29 0.28 0.11 Means over topics The topic-based user profile
  • 36. Re-ranking search results (1) Temporal Latent Topic User Profiles for Search Personalisation36 Query: MU
  • 37. Pre-processing 37  Remove the queries whose positive label set is empty from the dataset  Discard the domain-related queries (e.g., Facebook, Youtube) Temporal Latent Topic User Profiles for Search Personalisation

Editor's Notes

  1. Use the rank positions of the positive label as the ground truth to evaluate the search performance before and after re-ranking
  2. The session profile (SES) achieves better performance than the daily profile (DAI). It also shows that the daily profile (DAI) gains advantage over the long-term profile (LON). This indicates that the short-term profiles capture more details of user interest than the longer ones. The combination of all features (ALL) achieves the highest performance.
  3. The session profile (SES) achieves better performance than the daily profile (DAI). It also shows that the daily profile (DAI) gains advantage over the long-term profile (LON). This indicates that the short-term profiles capture more details of user interest than the longer ones. The combination of all features (ALL) achieves the highest performance.
  4. The session profile (SES) achieves better performance than the daily profile (DAI). It also shows that the daily profile (DAI) gains advantage over the long-term profile (LON). This indicates that the short-term profiles capture more details of user interest than the longer ones. The combination of all features (ALL) achieves the highest performance.
  5. The session profile (SES) achieves better performance than the daily profile (DAI). It also shows that the daily profile (DAI) gains advantage over the long-term profile (LON). This indicates that the short-term profiles capture more details of user interest than the longer ones. The combination of all features (ALL) achieves the highest performance.
  6. The session profile (SES) achieves better performance than the daily profile (DAI). It also shows that the daily profile (DAI) gains advantage over the long-term profile (LON). This indicates that the short-term profiles capture more details of user interest than the longer ones. The combination of all features (ALL) achieves the highest performance.
  7. Show the improvement of the temporal profiles over the Default baseline using MAP metric for different magnitudes of click entropy we show the improvement of the temporal profiles over the Default ranking from the search engine in term of MAP metric for different magnitudes of click entropy. Here the statistical significance is also guaranteed with the use of paired t-test (p < 0:001). With smaller value of click entropy, the re-ranking performance is only slightly improved. For example, with click entropy between 0 and 0.5, the improvement of the MAP metric from long-term profile is of only 0.39%, in comparison with the original search engine. One may see that the effectiveness of the temporal pro les is increasing proportionally according to the value of click entropy. The highest improvements are achieved when click entropies are >= 2
  8. A query usually has a broader influence in a search session than only returning a list of URLs. The position of a query in a search session is also important because it may be fine-tuned by a user after the unsatisfactory results from previous queries. In this experiment we aim to study whether the position of a query has any effect on the performance of the temporal latent topic profiles. For each session, we label the queries by their positions during the search