SlideShare a Scribd company logo
Modeling and Detecting
Changes in User Satisfaction
Julia Kiseleva*, Eric Crestan, Riccardo Brigo, Roland Dittel
*Eindhoven University of Technology
Microsoft Bing
Want to go to CIKM
conference
QUERY SERP
What is User Satisfaction?
What is User Satisfaction?
QUERY SERP
,
What is User Satisfaction?
QUERY SERP
,
What is User Satisfaction?
QUERY SERP
,Pr (Ref.)
Assumption: If a “significant” amount of users
reformulate a query with a particular SERP it is an
indication of changing in user preferences
World May Change User Preferences
QUERY SERP
,
QUERY SERP
ti
ti+1
,
Timeline
Pr ti =
Pr ti+1 =
How Can We Detect the Changes?
QUERY SERP
,
QUERY SERP
ti
ti+1
,
| Pr ti - Pr ti+1 |
Timeline
Pr ti =
Pr ti+1 =
How Can We Detect the Changes?
• There are many definitions in the literature
• We use the query expansion
o new years wallpaper IS REFORMULATED WITH 2014
o medals Olympics IS REFORMULATED WITH 2014
o ct 40ez IS REFORMULATED WITH 2013
o march 31 holiday IS REFORMULATED WITH 2014
o …
Detecting Query Reformulation
An Example of the Drift in
Reformulation Signal
The Explanation of the Drift
Before November 2013 After November 2013
The Question:
“How to detect
this kind of
changes?”
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can
change over time yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output
(i.e., target variable) given the input (input features)
• Concept drift types:
Change Detection Techniques
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can change over time
yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable)
given the input (input features)
• Concept drift types:
Time
Datamean
Sudden/abrupt
Disambiguation
such as
“flawless Beyoncé”
Change Detection Techniques
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can change over time
yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable)
given the input (input features)
• Concept drift types:
Time
Datamean
Incremental
Disambiguation
such as
“cikm conference
2014”
Change Detection Techniques
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can change over time
yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable)
given the input (input features)
• Concept drift types:
Time
Datamean
Gradual
Breaking news
such as
“idaho bus crash
investigation”
Change Detection Techniques
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can change over time
yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable)
given the input (input features)
• Concept drift types:
Time
Datamean
Reoccurring
Seasonal change
such as
“black Friday 2014”
Change Detection Techniques
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can change over time
yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable)
given the input (input features)
• Concept drift types:
Time
Datamean
Change Detection Techniques
• Change detection techniques
o In dynamically changing and non-stationary environments, the data distribution can
change over time yielding the phenomenon of concept drift
o The real concept drift refers to changes in the conditional distribution of the output
(i.e., target variable) given the input (input features)
• Concept drift types:
Time
Datamean
Sudden/abru
pt
Incremental Gradual
Reoccurring
concepts
Outlier
(not concept drift)
Disambiguation
such as
“medal olympics
2014”
Seasonal change
such as
“black Friday
2014”
Breaking news
such as
“idaho bus crash
investigation”
Disambiguation
such as
“cikm conference
2014”
Change Detection Techniques
Detecting Drifts in
Reformulation Signal
Query: “cikm conference”
0.1
TimeLinet0
0.1 0.2 0.2 0.3
Reformulation: “2014”
Window W0
ti
Detecting Drifts in
Reformulation Signal
Query: “cikm conference”
0.1
TimeLinet0 ti+ t
0.1 0.2 0.2 0.3 0.7 0.8 0.8
Reformulation: “2014”
Window W0 Window W1
ti
E(W0) E(W1)
Size of Window W1 = n1Size of Window W0 = n0
The
upcoming
conference
event
If |E(W1) - E(W2)|> eout
Then Drift Detected
Calculating Threshold eout
Confidence
Variance at W = W0 U W1
m = 1/(1/n0 + 1/n1)
eout
Learn
reformulation
model M
User Behavior
Logs
t0 Timelineti+
Learn
reformulation
model M
User Behavior
Logs
t0
Incoming User
Behavior logs
Timeline
Detect changes in model M
If change detected
else Do Nothing
ti ti+ t
Learn
reformulation
model M
User Behavior
Logs
ti
Incoming User
Behavior logs
Timeline
Detect changes in model M
If change detected
else Do Nothing
ti+w1 ti+w1+w2
Alarm:
Change of user
satisfaction
detected
for pairs :
{<Qi,
SERPi>}1<i<n
Learn
reformulation
model M
User Behavior
Logs
t0
Incoming User
Behavior Logs
Timeline
Detect changes in model M
If change detected
else Do Nothing
ti ti+ t
1) List of reformulation terms
per query
2) List of URLs per
reformulation
Alarm:
Change of user
satisfaction
detected
for pairs :
{<Qi,
SERPi>}1<i<n
o Dataset consists of 6 months
of the behavioral log data
from a commercial search
engine
o The training window size is
one month
o The test window size is two
weeks
Experimentation
Evaluation
Results
o We successfully leveraged the concept drift detection
techniques to detect changes in user satisfaction
o The proposed technique works in unsupervised way
o Large scale evaluation has been performed
o Classification of the drift type is needed
o Prediction of the lifetime of the drift would help
Conclusion and Future Work
Questions?
Questions?
o We successfully leveraged the concept drift detection
techniques
o The proposed technique works in unsupervised way
o Large scale evaluation has been performed
o Classification of the drift type is needed
o Prediction of the lifetime of the drift would help
Conclusion and Future Work

More Related Content

Similar to Modelling and Detecting Changes in User Satisfaction

Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...
Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...
Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...
Lucidworks
 

Similar to Modelling and Detecting Changes in User Satisfaction (20)

New Frontiers of Lean Practice
New Frontiers of Lean PracticeNew Frontiers of Lean Practice
New Frontiers of Lean Practice
 
Natural Intelligence the human factor in AI
Natural Intelligence the human factor in AINatural Intelligence the human factor in AI
Natural Intelligence the human factor in AI
 
How to Measure Important Data by fmr Booking.com Product Owner
How to Measure Important Data by fmr Booking.com Product OwnerHow to Measure Important Data by fmr Booking.com Product Owner
How to Measure Important Data by fmr Booking.com Product Owner
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
 
Monoliths, Migrations, and Microservices
Monoliths, Migrations, and MicroservicesMonoliths, Migrations, and Microservices
Monoliths, Migrations, and Microservices
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014
 
Contextualizing the Visualization of Climate Data
Contextualizing the Visualization of Climate DataContextualizing the Visualization of Climate Data
Contextualizing the Visualization of Climate Data
 
Walmart Big Data Expo
Walmart Big Data ExpoWalmart Big Data Expo
Walmart Big Data Expo
 
UCD WST February 20 2020
UCD WST February 20 2020UCD WST February 20 2020
UCD WST February 20 2020
 
I scope noisemodel
I scope noisemodelI scope noisemodel
I scope noisemodel
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
 
Objectives of Production Management : INTRO TO CAPM
Objectives of Production Management : INTRO TO CAPMObjectives of Production Management : INTRO TO CAPM
Objectives of Production Management : INTRO TO CAPM
 
The Real-Time Communications Platform for the Internet of Things
The Real-Time Communications Platform for the Internet of ThingsThe Real-Time Communications Platform for the Internet of Things
The Real-Time Communications Platform for the Internet of Things
 
Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...
Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...
Search Accuracy Metrics and Predictive Analytics - A Big Data Use Case: Prese...
 
Balancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationBalancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem Formulation
 
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...Improving the TV User Experience by Algorithms: Personalized Content Recommen...
Improving the TV User Experience by Algorithms: Personalized Content Recommen...
 
Big data
Big dataBig data
Big data
 
sCorrecting for country skew: How APNIC adjusts for sample bias in the counts
sCorrecting for country skew: How APNIC adjusts for sample bias in the countssCorrecting for country skew: How APNIC adjusts for sample bias in the counts
sCorrecting for country skew: How APNIC adjusts for sample bias in the counts
 

More from Julia Kiseleva

Understanding User Satisfaction with Intelligent Assistants
Understanding User Satisfaction with Intelligent AssistantsUnderstanding User Satisfaction with Intelligent Assistants
Understanding User Satisfaction with Intelligent Assistants
Julia Kiseleva
 
Discovering Temporal Hidden Contexts in Web Sessions for User Trail Prediction
Discovering Temporal Hidden Contexts in Web Sessions for User Trail PredictionDiscovering Temporal Hidden Contexts in Web Sessions for User Trail Prediction
Discovering Temporal Hidden Contexts in Web Sessions for User Trail Prediction
Julia Kiseleva
 
Context Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsContext Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive Analytics
Julia Kiseleva
 

More from Julia Kiseleva (11)

Evaluation Personal Assistants
Evaluation Personal AssistantsEvaluation Personal Assistants
Evaluation Personal Assistants
 
From queries to dialogues
From queries to dialoguesFrom queries to dialogues
From queries to dialogues
 
Using Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing BehaviorUsing Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing Behavior
 
Understanding and Predicting User Satisfaction with Intelligent Assistants
Understanding and Predicting User Satisfaction with Intelligent AssistantsUnderstanding and Predicting User Satisfaction with Intelligent Assistants
Understanding and Predicting User Satisfaction with Intelligent Assistants
 
Understanding User Satisfaction with Intelligent Assistants
Understanding User Satisfaction with Intelligent AssistantsUnderstanding User Satisfaction with Intelligent Assistants
Understanding User Satisfaction with Intelligent Assistants
 
Using Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing BehaviorUsing Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing Behavior
 
Where to Go on Your Next Trip? Optimizing Travel Destinations Based on User P...
Where to Go on Your Next Trip? Optimizing Travel Destinations Based on User P...Where to Go on Your Next Trip? Optimizing Travel Destinations Based on User P...
Where to Go on Your Next Trip? Optimizing Travel Destinations Based on User P...
 
Predicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsPredicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov Models
 
The talk at Twente University on 28 July 2014
The talk at Twente University on 28 July 2014 The talk at Twente University on 28 July 2014
The talk at Twente University on 28 July 2014
 
Discovering Temporal Hidden Contexts in Web Sessions for User Trail Prediction
Discovering Temporal Hidden Contexts in Web Sessions for User Trail PredictionDiscovering Temporal Hidden Contexts in Web Sessions for User Trail Prediction
Discovering Temporal Hidden Contexts in Web Sessions for User Trail Prediction
 
Context Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsContext Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive Analytics
 

Recently uploaded

一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 

Modelling and Detecting Changes in User Satisfaction

  • 1. Modeling and Detecting Changes in User Satisfaction Julia Kiseleva*, Eric Crestan, Riccardo Brigo, Roland Dittel *Eindhoven University of Technology Microsoft Bing
  • 2. Want to go to CIKM conference QUERY SERP What is User Satisfaction?
  • 3. What is User Satisfaction? QUERY SERP ,
  • 4. What is User Satisfaction? QUERY SERP ,
  • 5. What is User Satisfaction? QUERY SERP ,Pr (Ref.) Assumption: If a “significant” amount of users reformulate a query with a particular SERP it is an indication of changing in user preferences
  • 6. World May Change User Preferences
  • 7. QUERY SERP , QUERY SERP ti ti+1 , Timeline Pr ti = Pr ti+1 = How Can We Detect the Changes?
  • 8. QUERY SERP , QUERY SERP ti ti+1 , | Pr ti - Pr ti+1 | Timeline Pr ti = Pr ti+1 = How Can We Detect the Changes?
  • 9. • There are many definitions in the literature • We use the query expansion o new years wallpaper IS REFORMULATED WITH 2014 o medals Olympics IS REFORMULATED WITH 2014 o ct 40ez IS REFORMULATED WITH 2013 o march 31 holiday IS REFORMULATED WITH 2014 o … Detecting Query Reformulation
  • 10. An Example of the Drift in Reformulation Signal
  • 11. The Explanation of the Drift Before November 2013 After November 2013 The Question: “How to detect this kind of changes?”
  • 12. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Change Detection Techniques
  • 13. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Time Datamean Sudden/abrupt Disambiguation such as “flawless Beyoncé” Change Detection Techniques
  • 14. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Time Datamean Incremental Disambiguation such as “cikm conference 2014” Change Detection Techniques
  • 15. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Time Datamean Gradual Breaking news such as “idaho bus crash investigation” Change Detection Techniques
  • 16. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Time Datamean Reoccurring Seasonal change such as “black Friday 2014” Change Detection Techniques
  • 17. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Time Datamean Change Detection Techniques
  • 18. • Change detection techniques o In dynamically changing and non-stationary environments, the data distribution can change over time yielding the phenomenon of concept drift o The real concept drift refers to changes in the conditional distribution of the output (i.e., target variable) given the input (input features) • Concept drift types: Time Datamean Sudden/abru pt Incremental Gradual Reoccurring concepts Outlier (not concept drift) Disambiguation such as “medal olympics 2014” Seasonal change such as “black Friday 2014” Breaking news such as “idaho bus crash investigation” Disambiguation such as “cikm conference 2014” Change Detection Techniques
  • 19. Detecting Drifts in Reformulation Signal Query: “cikm conference” 0.1 TimeLinet0 0.1 0.2 0.2 0.3 Reformulation: “2014” Window W0 ti
  • 20. Detecting Drifts in Reformulation Signal Query: “cikm conference” 0.1 TimeLinet0 ti+ t 0.1 0.2 0.2 0.3 0.7 0.8 0.8 Reformulation: “2014” Window W0 Window W1 ti E(W0) E(W1) Size of Window W1 = n1Size of Window W0 = n0 The upcoming conference event If |E(W1) - E(W2)|> eout Then Drift Detected
  • 21. Calculating Threshold eout Confidence Variance at W = W0 U W1 m = 1/(1/n0 + 1/n1) eout
  • 23. Learn reformulation model M User Behavior Logs t0 Incoming User Behavior logs Timeline Detect changes in model M If change detected else Do Nothing ti ti+ t
  • 24. Learn reformulation model M User Behavior Logs ti Incoming User Behavior logs Timeline Detect changes in model M If change detected else Do Nothing ti+w1 ti+w1+w2 Alarm: Change of user satisfaction detected for pairs : {<Qi, SERPi>}1<i<n
  • 25. Learn reformulation model M User Behavior Logs t0 Incoming User Behavior Logs Timeline Detect changes in model M If change detected else Do Nothing ti ti+ t 1) List of reformulation terms per query 2) List of URLs per reformulation Alarm: Change of user satisfaction detected for pairs : {<Qi, SERPi>}1<i<n
  • 26. o Dataset consists of 6 months of the behavioral log data from a commercial search engine o The training window size is one month o The test window size is two weeks Experimentation
  • 29. o We successfully leveraged the concept drift detection techniques to detect changes in user satisfaction o The proposed technique works in unsupervised way o Large scale evaluation has been performed o Classification of the drift type is needed o Prediction of the lifetime of the drift would help Conclusion and Future Work
  • 32. o We successfully leveraged the concept drift detection techniques o The proposed technique works in unsupervised way o Large scale evaluation has been performed o Classification of the drift type is needed o Prediction of the lifetime of the drift would help Conclusion and Future Work

Editor's Notes

  1. In our work we use probability to reformulate a query as a sign of user satisfaction or dissatisfaction
  2. Let revisit our example where the user wanted to visit CIKM
  3. When we can this change becomes a drift
  4. Talk about applications