SlideShare a Scribd company logo
1 of 31
Download to read offline
Predicting User Engagement with Direct Displays Using
Mouse Cursor Information
Ioannis Arapakis (Eurecat), Luis Leiva (Sciling)
Contents
§Introduction & motivation
§Merits of mouse cursor analysis
§Experimental setup
§Predictive modelling
§Performance assessment
§Results
§Conclusions
Introduction
§ In recent years direct displays (DDs)
have become a standard component
on the SERPs of all major web search
engines
§ DDs serve two main purposes:
• Provide well-structured summary of
information which is difficult or time-
consuming to access
• Help tidy up the SERP section that contains
the universal search results
Knowledge Module
§ One such prominent example is the Knowledge Module (KM)
display which provides users with information about the named
entities they are searching
§ The content presented in the KM display is obtained in a semi-
structured format from curated entity databases (e.g., Freebase,
Wikipedia)
§ This raw information is further enriched by the search engine,
e.g., ranking of related entities, explanations of their relationship
or with related multimedia and social media content
Motivation
§ In this context, most research has focused on general backend
system tasks, the most important being knowledge base
construction, or more specific backend tasks such as related
entity recommendation
§ This work attempts to understand how users engage with a DD
like the KM display in entity-centric search tasks
§ We are interested in predicting user engagement with a DD in
the absence of explicit feedback (e.g., self-report data)
Addressing the gap
§ Existing modelling techniques make a simplifying assumption
when analysing web search log data: the user is assumed to
be equally engaged with all parts of the SERP
§ In practice this assumption is not always true:
• A user may click on certain links on the page, but not all links
• May read a certain result snippet in the SERP but not necessarily the entire
list of results
• May ignore the SERP content completely and focus only on the images
shown in the KM display or other DDs
Mouse cursor tracking
§ Navigation & interaction with a digital
environment usually involves the use of a
mouse (i.e., selecting, hovering, clicking)
§ Can be easily performed in a non-invasive
manner, without removing users from their
natural setting
§ Several works have shown that the mouse
cursor is a proxy of gaze (attention)
§ Low-cost, scalable alternative to eye-
tracking
Crowdsourcing study
§ We conducted a crowdsourcing study and examine how users
engage with DDs like the knowledge module (KM) display
§ We collected and analysed more than 115K mouse cursor
positions from 300 users
§ With this study we aim to predict:
• When a user notices the KM display on the SERP
• If it is perceived as a useful aid to their search tasks
• Whether interacting with the KM display alters the users’ perception of how
fast they complete the search tasks
Experimental design
§ Repeated-measures design
§ One independent variable: KM display (with two levels: “visible”
or “hidden”)
§ Three dependent variables: (i) KM display noticeability, (ii) KM
display usefulness and (iii) perceived task accomplishment
speed
§ Two short search tasks were completed using the Yahoo search
engine: one task with the KM display on the SERP and one
without it*
* The KM display visibility was controlled with client-side scripting.
Search UI
§ Participants accessed the search engine through a custom proxy
which did not alter the original look and feel of the SERPs
§ This allowed us to capture user interactions with the SERP
without interfering with the actual web search engine interface in
production
§ For each search task, participants were presented with a
question and were suggested a search query to begin with
Search query sample
§ Query set consisted of 32 unique query patterns (144 different queries in total)
§ The selected query patterns belonged to four different topics (famous people,
movies, athletes, sport teams) and required either single or multiple answers
Mouse cursor tracking tool
§ To collect mouse cursor data we used EVTRACK*, an open
source JavaScript event tracking library that is part of the smt2ε
system
§ EVTRACK allows to specify what browser events should be
captured and how they should be captured, i.e., via event
listeners (the event is captured as soon as it is fired) or via event
polling
* https://github.com/luileito/evtrack
Self-reported measures of engagement
§ A mini-questionnaire on the SERPs gathered
ground truth labels for the mouse cursor data
§ The mini-questionnaire was initially hidden
and was shown to the user just before
leaving the SERP
§ It comprised 3 questions:
• Did you notice the knowledge module?
• To what extent did you find the knowledge module
useful in answering the question?
• To what extent did the knowledge module help you
answer the question faster?
Procedure
§ Participants were asked to evaluate two different backend
systems of Yahoo web search by performing two search tasks
§ For each task, participants had to answer a question by
searching for relevant information on the proxified search engine
§ In one task the KM display would be hidden (control condition)
and in the other task it would be visible (experimental condition)
§ The order of the tasks was randomized for each participant
§ Participants were presented with a suggested query to begin
their search but were free to submit additional queries
§ We used informational, entity-centric queries to introduce a
common starting point across all participants
Modelling user engagement
§ Our final dataset consists of ~115K cursor positions, collected
during 600 search task sessions
§ Out of those 600 search task sessions we analysed the 300
cases that correspond to the experimental condition with the
visible KM in the SERP
§ Our dataset is generally balanced, with 176 users having
reported noticing the KM display
§ We normalised the values for each feature so that feature values
that those that fall in greater numeric ranges do not dominate
over those in smaller numeric ranges
Feature Engineering
* These functions are computed for most base and meta-features.
Users who did not engage with the KG Users who engaged with the KG
Feature Engineering (cont.)
Predictive Modelling
§ We trained 10 RF* models (90% of data) and used
them to obtain the predictions for each of the held-
out set (10% of data) among the ten folds**
§ Excluded highly correlated and linearly dependent
features
§ Performed feature selection using recursive
feature elimination
§ We used a subset of our training data for fine-
tuning the classifier’s hyperparameters
* R packages “Caret” and “randomForest”.
** With stratified sampling.
Performance evaluation
§ Baselines:
• If the user clicked on the KM display (hasClickedKM, binary)
• If the mouse cursor hovered over the KM display (hasHoveredKM, binary)
• Time spent on the page (dwellTime) as a feature to the RF classifier
§ Performance evaluation:
• Precision / Recall
• Accuracy
• F-Measure
• AUC
Results
Attention
Precision Recall F-Measure Accuracy AUC
0
0.2
0.4
0.6
0.8
1
Attention
Click
Hover
DwellTime
Ours
Usefulness
Precision Recall F-Measure Accuracy AUC
0
0.2
0.4
0.6
0.8
1
Usefulness
Click
Hover
DwellTime
Ours
Perceived Task Duration
Precision Recall F-Measure Accuracy AUC
0
0.2
0.4
0.6
0.8
1
Task Duration
Click
Hover
DwellTime
Ours
Computational complexity
§ Mouse gesture techniques that rely on PCA
preprocessing and k-means clustering
• Covariance matrix computation + eigenvalue
decomposition ☞ O(p2N + p3)
• K-means ☞ O(icN)
§ Cursor Motifs that use Dynamic Time Warping
(DTW) and k-nearest neighbours (kNN)
• DTW ☞ O(N2)
• kNN ☞ O(N2k2)
§ Proposed method has linear ☞ O(N) or
quasilinear cost ☞ O(NlogN)
Conclusions
§ We conducted a crowdsourcing study that revealed the potential
benefits of using mouse cursor data to predict user
engagement with DDs
§ We demonstrated that our feature selection model outperforms
the standard baselines to measure three user engagement
proxies with the KM display
§ Our initial results suggest that it is possible to predict when the
user attention is captured by a DD using only simple, yet highly
discriminative features derived from mouse cursor activity
Conclusions (cont.)
§ Predicting accurately if a DD was truly noticed can:
• Increase the true negative prediction rate
• Reduce the false negative rate
§ Knowing when a user finds a DD useful has important
implications on the methodology for understanding the impact
of launching a new DD, modifying its existing design, and how
that change may affect search UIs
Conclusions (cont.)
§ Information about perceived task duration can be combined with
the previous grounds truths to understand better how users
engage with ads or multimedia content
§ The main practical use of our models is perhaps to
automatically select or lay out the DDs
§ DDs are optional for the SERPs and the user behaviour could
provide signals about whether DDs should be shown or not in
particular queries
§ Our method offers a computationally efficient way to analyse
mouse cursor data
Acknowledgments
§ We thank B. Barla Cambazoglu and Marios Koulakis for fruitful
discussions
Thank you for your attention!
iarapakis
arapakis.ioannis@gmail.com
https://es.linkedin.com/in/ioannisarapakis
http://www.slideshare.net/iarapakis/sigir16

More Related Content

Similar to sigir16

Cloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataCloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataAbhishek M Shivalingaiah
 
A flexible recommenndation system for Cable TV
A flexible recommenndation system for Cable TVA flexible recommenndation system for Cable TV
A flexible recommenndation system for Cable TVIntoTheMinds
 
A Flexible Recommendation System for Cable TV
A Flexible Recommendation System for Cable TVA Flexible Recommendation System for Cable TV
A Flexible Recommendation System for Cable TVFrancisco Couto
 
DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...
DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...
DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...yeung2000
 
Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark MLAhmet Bulut
 
Ed Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisEd Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisVolha Banadyseva
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueXavier Amatriain
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsYalçın Yenigün
 
IEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slidesIEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slidesNish Parikh
 
Large scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log miningLarge scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log miningitstuff
 
Optimizing Observability Spend: Metrics
Optimizing Observability Spend: MetricsOptimizing Observability Spend: Metrics
Optimizing Observability Spend: MetricsEric D. Schabell
 
Why Distributed Tracing is Essential for Performance and Reliability
Why Distributed Tracing is Essential for Performance and ReliabilityWhy Distributed Tracing is Essential for Performance and Reliability
Why Distributed Tracing is Essential for Performance and ReliabilityDevOps.com
 
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Data Con LA
 
RESUME SCREENING USING LSTM
RESUME SCREENING USING LSTMRESUME SCREENING USING LSTM
RESUME SCREENING USING LSTMIRJET Journal
 
Exploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdfExploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdfRushikeshKulkarni71
 
Thesis_presentation_arda_tasci
Thesis_presentation_arda_tasciThesis_presentation_arda_tasci
Thesis_presentation_arda_tasciArda Taşcı
 
User Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience ExpansionUser Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience ExpansionDatabricks
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 

Similar to sigir16 (20)

Cloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataCloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big Data
 
A flexible recommenndation system for Cable TV
A flexible recommenndation system for Cable TVA flexible recommenndation system for Cable TV
A flexible recommenndation system for Cable TV
 
A Flexible Recommendation System for Cable TV
A Flexible Recommendation System for Cable TVA Flexible Recommendation System for Cable TV
A Flexible Recommendation System for Cable TV
 
DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...
DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...
DeepScan: Exploiting Deep Learning for Malicious Account Detection in Locatio...
 
resume
resumeresume
resume
 
Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark ML
 
Ed Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisEd Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual Analysis
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business Value
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning Applications
 
Internship Presentation.pdf
Internship Presentation.pdfInternship Presentation.pdf
Internship Presentation.pdf
 
IEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slidesIEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slides
 
Large scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log miningLarge scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log mining
 
Optimizing Observability Spend: Metrics
Optimizing Observability Spend: MetricsOptimizing Observability Spend: Metrics
Optimizing Observability Spend: Metrics
 
Why Distributed Tracing is Essential for Performance and Reliability
Why Distributed Tracing is Essential for Performance and ReliabilityWhy Distributed Tracing is Essential for Performance and Reliability
Why Distributed Tracing is Essential for Performance and Reliability
 
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
 
RESUME SCREENING USING LSTM
RESUME SCREENING USING LSTMRESUME SCREENING USING LSTM
RESUME SCREENING USING LSTM
 
Exploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdfExploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdf
 
Thesis_presentation_arda_tasci
Thesis_presentation_arda_tasciThesis_presentation_arda_tasci
Thesis_presentation_arda_tasci
 
User Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience ExpansionUser Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience Expansion
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 

More from Telefonica Research

A Price-Per-Attention Auction Scheme Using Mouse Cursor Information
A Price-Per-Attention Auction Scheme Using Mouse Cursor InformationA Price-Per-Attention Auction Scheme Using Mouse Cursor Information
A Price-Per-Attention Auction Scheme Using Mouse Cursor InformationTelefonica Research
 
Learning Efficient Representations of Mouse Movements to Predict User Attention
Learning Efficient Representations of Mouse Movements to Predict User AttentionLearning Efficient Representations of Mouse Movements to Predict User Attention
Learning Efficient Representations of Mouse Movements to Predict User AttentionTelefonica Research
 
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and ChallengesUser Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and ChallengesTelefonica Research
 
System and User Aspects of Web Search Latency
System and User Aspects of Web Search LatencySystem and User Aspects of Web Search Latency
System and User Aspects of Web Search LatencyTelefonica Research
 
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold StartSocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold StartTelefonica Research
 
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...Telefonica Research
 
SIGIR2014 - Impact of Response Latency on User Behavior in Web Search
SIGIR2014 - Impact of Response Latency on User Behavior in Web SearchSIGIR2014 - Impact of Response Latency on User Behavior in Web Search
SIGIR2014 - Impact of Response Latency on User Behavior in Web SearchTelefonica Research
 

More from Telefonica Research (9)

A Price-Per-Attention Auction Scheme Using Mouse Cursor Information
A Price-Per-Attention Auction Scheme Using Mouse Cursor InformationA Price-Per-Attention Auction Scheme Using Mouse Cursor Information
A Price-Per-Attention Auction Scheme Using Mouse Cursor Information
 
Learning Efficient Representations of Mouse Movements to Predict User Attention
Learning Efficient Representations of Mouse Movements to Predict User AttentionLearning Efficient Representations of Mouse Movements to Predict User Attention
Learning Efficient Representations of Mouse Movements to Predict User Attention
 
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and ChallengesUser Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
 
SIGIR15-NeuroIR
SIGIR15-NeuroIRSIGIR15-NeuroIR
SIGIR15-NeuroIR
 
Sigir15
Sigir15Sigir15
Sigir15
 
System and User Aspects of Web Search Latency
System and User Aspects of Web Search LatencySystem and User Aspects of Web Search Latency
System and User Aspects of Web Search Latency
 
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold StartSocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
 
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
 
SIGIR2014 - Impact of Response Latency on User Behavior in Web Search
SIGIR2014 - Impact of Response Latency on User Behavior in Web SearchSIGIR2014 - Impact of Response Latency on User Behavior in Web Search
SIGIR2014 - Impact of Response Latency on User Behavior in Web Search
 

Recently uploaded

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 

Recently uploaded (20)

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 

sigir16

  • 1. Predicting User Engagement with Direct Displays Using Mouse Cursor Information Ioannis Arapakis (Eurecat), Luis Leiva (Sciling)
  • 2. Contents §Introduction & motivation §Merits of mouse cursor analysis §Experimental setup §Predictive modelling §Performance assessment §Results §Conclusions
  • 3. Introduction § In recent years direct displays (DDs) have become a standard component on the SERPs of all major web search engines § DDs serve two main purposes: • Provide well-structured summary of information which is difficult or time- consuming to access • Help tidy up the SERP section that contains the universal search results
  • 4. Knowledge Module § One such prominent example is the Knowledge Module (KM) display which provides users with information about the named entities they are searching § The content presented in the KM display is obtained in a semi- structured format from curated entity databases (e.g., Freebase, Wikipedia) § This raw information is further enriched by the search engine, e.g., ranking of related entities, explanations of their relationship or with related multimedia and social media content
  • 5. Motivation § In this context, most research has focused on general backend system tasks, the most important being knowledge base construction, or more specific backend tasks such as related entity recommendation § This work attempts to understand how users engage with a DD like the KM display in entity-centric search tasks § We are interested in predicting user engagement with a DD in the absence of explicit feedback (e.g., self-report data)
  • 6. Addressing the gap § Existing modelling techniques make a simplifying assumption when analysing web search log data: the user is assumed to be equally engaged with all parts of the SERP § In practice this assumption is not always true: • A user may click on certain links on the page, but not all links • May read a certain result snippet in the SERP but not necessarily the entire list of results • May ignore the SERP content completely and focus only on the images shown in the KM display or other DDs
  • 7. Mouse cursor tracking § Navigation & interaction with a digital environment usually involves the use of a mouse (i.e., selecting, hovering, clicking) § Can be easily performed in a non-invasive manner, without removing users from their natural setting § Several works have shown that the mouse cursor is a proxy of gaze (attention) § Low-cost, scalable alternative to eye- tracking
  • 8.
  • 9. Crowdsourcing study § We conducted a crowdsourcing study and examine how users engage with DDs like the knowledge module (KM) display § We collected and analysed more than 115K mouse cursor positions from 300 users § With this study we aim to predict: • When a user notices the KM display on the SERP • If it is perceived as a useful aid to their search tasks • Whether interacting with the KM display alters the users’ perception of how fast they complete the search tasks
  • 10. Experimental design § Repeated-measures design § One independent variable: KM display (with two levels: “visible” or “hidden”) § Three dependent variables: (i) KM display noticeability, (ii) KM display usefulness and (iii) perceived task accomplishment speed § Two short search tasks were completed using the Yahoo search engine: one task with the KM display on the SERP and one without it* * The KM display visibility was controlled with client-side scripting.
  • 11. Search UI § Participants accessed the search engine through a custom proxy which did not alter the original look and feel of the SERPs § This allowed us to capture user interactions with the SERP without interfering with the actual web search engine interface in production § For each search task, participants were presented with a question and were suggested a search query to begin with
  • 12. Search query sample § Query set consisted of 32 unique query patterns (144 different queries in total) § The selected query patterns belonged to four different topics (famous people, movies, athletes, sport teams) and required either single or multiple answers
  • 13. Mouse cursor tracking tool § To collect mouse cursor data we used EVTRACK*, an open source JavaScript event tracking library that is part of the smt2ε system § EVTRACK allows to specify what browser events should be captured and how they should be captured, i.e., via event listeners (the event is captured as soon as it is fired) or via event polling * https://github.com/luileito/evtrack
  • 14. Self-reported measures of engagement § A mini-questionnaire on the SERPs gathered ground truth labels for the mouse cursor data § The mini-questionnaire was initially hidden and was shown to the user just before leaving the SERP § It comprised 3 questions: • Did you notice the knowledge module? • To what extent did you find the knowledge module useful in answering the question? • To what extent did the knowledge module help you answer the question faster?
  • 15. Procedure § Participants were asked to evaluate two different backend systems of Yahoo web search by performing two search tasks § For each task, participants had to answer a question by searching for relevant information on the proxified search engine § In one task the KM display would be hidden (control condition) and in the other task it would be visible (experimental condition) § The order of the tasks was randomized for each participant § Participants were presented with a suggested query to begin their search but were free to submit additional queries § We used informational, entity-centric queries to introduce a common starting point across all participants
  • 16. Modelling user engagement § Our final dataset consists of ~115K cursor positions, collected during 600 search task sessions § Out of those 600 search task sessions we analysed the 300 cases that correspond to the experimental condition with the visible KM in the SERP § Our dataset is generally balanced, with 176 users having reported noticing the KM display § We normalised the values for each feature so that feature values that those that fall in greater numeric ranges do not dominate over those in smaller numeric ranges
  • 17. Feature Engineering * These functions are computed for most base and meta-features.
  • 18. Users who did not engage with the KG Users who engaged with the KG
  • 20. Predictive Modelling § We trained 10 RF* models (90% of data) and used them to obtain the predictions for each of the held- out set (10% of data) among the ten folds** § Excluded highly correlated and linearly dependent features § Performed feature selection using recursive feature elimination § We used a subset of our training data for fine- tuning the classifier’s hyperparameters * R packages “Caret” and “randomForest”. ** With stratified sampling.
  • 21. Performance evaluation § Baselines: • If the user clicked on the KM display (hasClickedKM, binary) • If the mouse cursor hovered over the KM display (hasHoveredKM, binary) • Time spent on the page (dwellTime) as a feature to the RF classifier § Performance evaluation: • Precision / Recall • Accuracy • F-Measure • AUC
  • 23. Attention Precision Recall F-Measure Accuracy AUC 0 0.2 0.4 0.6 0.8 1 Attention Click Hover DwellTime Ours
  • 24. Usefulness Precision Recall F-Measure Accuracy AUC 0 0.2 0.4 0.6 0.8 1 Usefulness Click Hover DwellTime Ours
  • 25. Perceived Task Duration Precision Recall F-Measure Accuracy AUC 0 0.2 0.4 0.6 0.8 1 Task Duration Click Hover DwellTime Ours
  • 26. Computational complexity § Mouse gesture techniques that rely on PCA preprocessing and k-means clustering • Covariance matrix computation + eigenvalue decomposition ☞ O(p2N + p3) • K-means ☞ O(icN) § Cursor Motifs that use Dynamic Time Warping (DTW) and k-nearest neighbours (kNN) • DTW ☞ O(N2) • kNN ☞ O(N2k2) § Proposed method has linear ☞ O(N) or quasilinear cost ☞ O(NlogN)
  • 27. Conclusions § We conducted a crowdsourcing study that revealed the potential benefits of using mouse cursor data to predict user engagement with DDs § We demonstrated that our feature selection model outperforms the standard baselines to measure three user engagement proxies with the KM display § Our initial results suggest that it is possible to predict when the user attention is captured by a DD using only simple, yet highly discriminative features derived from mouse cursor activity
  • 28. Conclusions (cont.) § Predicting accurately if a DD was truly noticed can: • Increase the true negative prediction rate • Reduce the false negative rate § Knowing when a user finds a DD useful has important implications on the methodology for understanding the impact of launching a new DD, modifying its existing design, and how that change may affect search UIs
  • 29. Conclusions (cont.) § Information about perceived task duration can be combined with the previous grounds truths to understand better how users engage with ads or multimedia content § The main practical use of our models is perhaps to automatically select or lay out the DDs § DDs are optional for the SERPs and the user behaviour could provide signals about whether DDs should be shown or not in particular queries § Our method offers a computationally efficient way to analyse mouse cursor data
  • 30. Acknowledgments § We thank B. Barla Cambazoglu and Marios Koulakis for fruitful discussions
  • 31. Thank you for your attention! iarapakis arapakis.ioannis@gmail.com https://es.linkedin.com/in/ioannisarapakis http://www.slideshare.net/iarapakis/sigir16