SlideShare a Scribd company logo
1 of 19
Download to read offline
Predicting User Knowledge Gain
in Informational Search Sessions
Ran Yu1
, Ujwal Gadiraju1
, Peter Holtz2
, Markus Rokicki1
, Philipp Kemkes1
, Stefan Dietze1,3
1. L3S Research Center, Leibniz Universität Hannover; Hannover, Germany
2. Leibniz-Institut für Wissensmedien (IWM); Tübingen, Germany
3. GESIS - Leibniz-Institute for the Social Sciences; Köln, Germany
@ranyu_zh yu@l3s.de
Search Engine for Serving Learning Needs
● Web search is frequently used to acquire new
knowledge & satisfy learning-related
objectives
● How does the knowledge of a user evolve
through the course of informational search
sessions and can we predict user knowledge
(change)?
Web Search Queries : Navigational, transactional or informational intents
[Broder, 2002]
.. in informational web search sessions, the intent of a user is to acquire
some information assumed to be present on one or more web pages ..
Intentional learning - “learning that is motivated by intentions and is goal
directed”, “cognitive process that have learning as a goal rather than an
accidental outcome”
2
Goal
Build models to detect user knowledge & knowledge gain:
● classifying the knowledge state (KS) at the end of the informational session with
respect to a particular information need into: low, moderate, high
● classifying the knowledge gain (KG) during the informational session into: low,
moderate, high
Knowledge on
SAL at 9:00?
Knowledge on
SAL at 9:50?
How did learning
happen?
3
e.g.
1. Eickhoff et al. investigated the correlation between several query and search
mission-related metrics with learning progress [2014]
2. Wu et al. predicted the difficulty of search tasks from query and mission-related
features [2012]
3. Collins-Thompson et al. investigated the aspects of search interaction which are
effective for supporting superior learning outcomes in vocabulary learning
scenario [2016, 2017,2018]
4. Zhang et al. explored using search behavior as an indicator for the domain
knowledge of a user [2015]
Prior works
4
Prior works
Summary:
● The learning related notions: knowledge gain, knowledge state, user engagement,
learning stage (based on e.g. Anderson and Krathwohl's taxonomy), expertise of a
topic, task difficulty
● Features extracted from: search session, search engine result page (SERP),
document, user behavior, eye-tracking data
● limited set of features
● addressing specific learning scenarios
● the generalizability of knowledge gain measures have not been
investigated
● no automated method for predicting user knowledge/gain.
5
Data Collection
1. Recruit workers from
crowdsourcing
platform
2. Pre-test to assess
worker’s initial
knowledge state on
the given topic
3. Direct the worker to
the SearchWell
platform to search and
browse documents
4. Post-test to assess
worker’s post
knowledge
6
Dataset available at: https://sites.google.com/view/predicting-user-knowledge
Topics & Information Needs
TREC 2014 Web
Track
7

Data Analysis
Knowledge gain is measured as the difference
between pre- and post-test score.
● 70% of users exhibited a knowledge gain
(KG)
● Negative relationship between KG of users
and topic popularity (avg. accuracy of
workers in knowledge tests) (R= -.87)
● Amount of time users actively spent on
web pages describes 7% of the variance
in their KG
● Query complexity explains 25% of the
variance in the KG of users
More results in:
Ujwal Gadiraju, Ran Yu, Stefan Dietze, Peter Holtz. Analyzing Knowledge Gain of
Users in Informational Search Sessions on the Web. ACM CHIIR 2018.
8
Generating class label
Generating classes: Group user knowledge state
(KS) and knowledge gain (KG) into {low, moderate,
high} group using Standard Deviation
Classification approach.
mean ± 0.5 SD
9
Considered Features
● Session, e.g. session duration, duration per query, ...
● Query, e.g. query length, query number, max query complexity, …
● SERP (Search engine result page), e.g. time on SERP, number of clicks, ...
● Browsing, e.g. number of pages, avg time per page, …
● Mouse, e.g. max scroll distance, number of mouseovers, ...
10
Models & configurations
Classifiers - Naive Bayes, Logistic regression, SVM, random forest, multilayer
perceptron. (Applied grid search to find the best parameters.)
Feature analysis & selection
● Pearson correlation between feature and KG (KS)≥ β (γ)
● Correlation between features < τ
Correlation
between
features
Correlation between feature & KG
11
Result - KG prediction
Overall: meaningful evidence for predicting
knowledge gain
Accuracy:
Efficiency:
245 distinct configurations * 10-fold cross
validation * 10 repetitions
Metrics:
● Accuracy (Accu) across all classes
● Precision (P), Recall (R), F1 (F1) score of
each class
Macro average of precision (P), recall (R),
and F1 (F1)
● Runtime in milliseconds.
RF
SVM NB
NB
12
Feature importance - KG
Individual features
● Browsing time related
● Page title length
● Amount of mouse movement
● Rank of the clicked documents
● ...
13
Feature categories
Browsing
Mouse SERP
Result - KS prediction
140 distinct configurations * 10-fold cross
validation * 10 repetitions
Metrics:
● Accuracy (Accu) across all classes
● Precision (P), Recall (R), F1 (F1) score of
each class
Macro average of precision (P), recall (R),
and F1 (F1)
● Runtime in milliseconds.
Overall: meaningful evidence for predicting
knowledge state. Outperforms baseline.
Accuracy:
Efficiency:
KSZhang
14
RF
MP SVM
Feature importance - KS
15
Individual features
● Query complexity
● Page title length
● Page title unique terms
● Query length
● Browsed page size
● ...
Feature categories
Query
Browsing
Discussion & Conclusions
Conclusions
● knowledge gain (state) can be predicted during informational search sessions
with a certain level of accuracy,
● performance of the knowledge gain prediction appears to be generally better,
suggesting that the
task is easier given the nature of our data, and
● the performance of the prediction approach is better for more extreme classes
Limitations
● limited duration of the search sessions reduce the predictive power of certain
features
● topic descriptions provided central keywords for the first query, which makes
the query features less distinguishable
16
Future works
● Conducting experiments to get data for more varied search sessions
● reproduce and refine the findings in more varied search sessions (e.g.
longer duration, procedural knowledge)
● investigate features of (multimedia) resources that user interacted
with
● Use predicted KS & KG for optimizing retrieval algorithms
17
Thank you!
Q&A
18
A Few More Details . . .
TEST QUESTION SELECTION:
● Using 100 workers per topic, and a larger pool of items
(~30); filter items that were too easy (>80% of the workers
got it right) or too hard/ambiguous (<20% got it right).
SAL-SESSIONS:
● Participation of only Level-3 CrowdFlower workers from
primarily English-speaking countries
● 50 workers per topic, filtered out workers who entered no
queries, workers who selected the same option
‘TRUE/FALSE’ for all items, those who did not complete the
post-session test ⇒ 420 workers
19

More Related Content

Similar to Predicting User Knowledge Gain in Informational Search Sessions

Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Roi Blanco
 
Data analysis – using computers
Data analysis – using computersData analysis – using computers
Data analysis – using computersNoonapau
 
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...Yongyao Jiang
 
Data analysis – using computers for presentation
Data analysis – using computers for presentationData analysis – using computers for presentation
Data analysis – using computers for presentationNoonapau
 
Application of fuzzy logic for user
Application of fuzzy logic for userApplication of fuzzy logic for user
Application of fuzzy logic for userIJCI JOURNAL
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning StrategyJessie Chuang
 
Query formulation process
Query formulation processQuery formulation process
Query formulation processmalathimurugan
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 
Data analysis – using computers
Data analysis – using computersData analysis – using computers
Data analysis – using computersNoonapau
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerIJERA Editor
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsIRJET Journal
 
Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Marijn Koolen
 
A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...
A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...
A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...Nim Dvir
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 
Improving search personalisation with dynamic group formation
Improving search personalisation with dynamic group formationImproving search personalisation with dynamic group formation
Improving search personalisation with dynamic group formationThanh Vu
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersCarlos Toxtli
 

Similar to Predicting User Knowledge Gain in Informational Search Sessions (20)

Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
 
Data analysis – using computers
Data analysis – using computersData analysis – using computers
Data analysis – using computers
 
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
 
Data analysis – using computers for presentation
Data analysis – using computers for presentationData analysis – using computers for presentation
Data analysis – using computers for presentation
 
Application of fuzzy logic for user
Application of fuzzy logic for userApplication of fuzzy logic for user
Application of fuzzy logic for user
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning Strategy
 
Query formulation process
Query formulation processQuery formulation process
Query formulation process
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 
Data analysis – using computers
Data analysis – using computersData analysis – using computers
Data analysis – using computers
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
 
Role of Data Science in eCommerce
Role of Data Science in eCommerceRole of Data Science in eCommerce
Role of Data Science in eCommerce
 
8th sem (1)
8th sem (1)8th sem (1)
8th sem (1)
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!
 
A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...
A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...
A Framework For Effective Content Strategy Based On Heuristic Evaluation (Res...
 
Master's Seminar
Master's SeminarMaster's Seminar
Master's Seminar
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
Improving search personalisation with dynamic group formation
Improving search personalisation with dynamic group formationImproving search personalisation with dynamic group formation
Improving search personalisation with dynamic group formation
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
 

Recently uploaded

Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts servicevipmodelshub1
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Sheetaleventcompany
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Delhi Call girls
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsThierry TROUIN ☁
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024APNIC
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls KolkataLow Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With RoomVIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Roomgirls4nights
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4
 
Russian Call Girls Thane Swara 8617697112 Independent Escort Service Thane
Russian Call Girls Thane Swara 8617697112 Independent Escort Service ThaneRussian Call Girls Thane Swara 8617697112 Independent Escort Service Thane
Russian Call Girls Thane Swara 8617697112 Independent Escort Service ThaneCall girls in Ahmedabad High profile
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Russian Call girls in Dubai +971563133746 Dubai Call girls
Russian  Call girls in Dubai +971563133746 Dubai  Call girlsRussian  Call girls in Dubai +971563133746 Dubai  Call girls
Russian Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Servicegwenoracqe6
 
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 

Recently uploaded (20)

Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with Flows
 
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls KolkataLow Rate Call Girls Kolkata Avani 🤌  8250192130 🚀 Vip Call Girls Kolkata
Low Rate Call Girls Kolkata Avani 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With RoomVIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
 
Russian Call Girls Thane Swara 8617697112 Independent Escort Service Thane
Russian Call Girls Thane Swara 8617697112 Independent Escort Service ThaneRussian Call Girls Thane Swara 8617697112 Independent Escort Service Thane
Russian Call Girls Thane Swara 8617697112 Independent Escort Service Thane
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
 
Russian Call girls in Dubai +971563133746 Dubai Call girls
Russian  Call girls in Dubai +971563133746 Dubai  Call girlsRussian  Call girls in Dubai +971563133746 Dubai  Call girls
Russian Call girls in Dubai +971563133746 Dubai Call girls
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
 
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
 
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 

Predicting User Knowledge Gain in Informational Search Sessions

  • 1. Predicting User Knowledge Gain in Informational Search Sessions Ran Yu1 , Ujwal Gadiraju1 , Peter Holtz2 , Markus Rokicki1 , Philipp Kemkes1 , Stefan Dietze1,3 1. L3S Research Center, Leibniz Universität Hannover; Hannover, Germany 2. Leibniz-Institut für Wissensmedien (IWM); Tübingen, Germany 3. GESIS - Leibniz-Institute for the Social Sciences; Köln, Germany @ranyu_zh yu@l3s.de
  • 2. Search Engine for Serving Learning Needs ● Web search is frequently used to acquire new knowledge & satisfy learning-related objectives ● How does the knowledge of a user evolve through the course of informational search sessions and can we predict user knowledge (change)? Web Search Queries : Navigational, transactional or informational intents [Broder, 2002] .. in informational web search sessions, the intent of a user is to acquire some information assumed to be present on one or more web pages .. Intentional learning - “learning that is motivated by intentions and is goal directed”, “cognitive process that have learning as a goal rather than an accidental outcome” 2
  • 3. Goal Build models to detect user knowledge & knowledge gain: ● classifying the knowledge state (KS) at the end of the informational session with respect to a particular information need into: low, moderate, high ● classifying the knowledge gain (KG) during the informational session into: low, moderate, high Knowledge on SAL at 9:00? Knowledge on SAL at 9:50? How did learning happen? 3
  • 4. e.g. 1. Eickhoff et al. investigated the correlation between several query and search mission-related metrics with learning progress [2014] 2. Wu et al. predicted the difficulty of search tasks from query and mission-related features [2012] 3. Collins-Thompson et al. investigated the aspects of search interaction which are effective for supporting superior learning outcomes in vocabulary learning scenario [2016, 2017,2018] 4. Zhang et al. explored using search behavior as an indicator for the domain knowledge of a user [2015] Prior works 4
  • 5. Prior works Summary: ● The learning related notions: knowledge gain, knowledge state, user engagement, learning stage (based on e.g. Anderson and Krathwohl's taxonomy), expertise of a topic, task difficulty ● Features extracted from: search session, search engine result page (SERP), document, user behavior, eye-tracking data ● limited set of features ● addressing specific learning scenarios ● the generalizability of knowledge gain measures have not been investigated ● no automated method for predicting user knowledge/gain. 5
  • 6. Data Collection 1. Recruit workers from crowdsourcing platform 2. Pre-test to assess worker’s initial knowledge state on the given topic 3. Direct the worker to the SearchWell platform to search and browse documents 4. Post-test to assess worker’s post knowledge 6 Dataset available at: https://sites.google.com/view/predicting-user-knowledge
  • 7. Topics & Information Needs TREC 2014 Web Track 7 
  • 8. Data Analysis Knowledge gain is measured as the difference between pre- and post-test score. ● 70% of users exhibited a knowledge gain (KG) ● Negative relationship between KG of users and topic popularity (avg. accuracy of workers in knowledge tests) (R= -.87) ● Amount of time users actively spent on web pages describes 7% of the variance in their KG ● Query complexity explains 25% of the variance in the KG of users More results in: Ujwal Gadiraju, Ran Yu, Stefan Dietze, Peter Holtz. Analyzing Knowledge Gain of Users in Informational Search Sessions on the Web. ACM CHIIR 2018. 8
  • 9. Generating class label Generating classes: Group user knowledge state (KS) and knowledge gain (KG) into {low, moderate, high} group using Standard Deviation Classification approach. mean ± 0.5 SD 9
  • 10. Considered Features ● Session, e.g. session duration, duration per query, ... ● Query, e.g. query length, query number, max query complexity, … ● SERP (Search engine result page), e.g. time on SERP, number of clicks, ... ● Browsing, e.g. number of pages, avg time per page, … ● Mouse, e.g. max scroll distance, number of mouseovers, ... 10
  • 11. Models & configurations Classifiers - Naive Bayes, Logistic regression, SVM, random forest, multilayer perceptron. (Applied grid search to find the best parameters.) Feature analysis & selection ● Pearson correlation between feature and KG (KS)≥ β (γ) ● Correlation between features < τ Correlation between features Correlation between feature & KG 11
  • 12. Result - KG prediction Overall: meaningful evidence for predicting knowledge gain Accuracy: Efficiency: 245 distinct configurations * 10-fold cross validation * 10 repetitions Metrics: ● Accuracy (Accu) across all classes ● Precision (P), Recall (R), F1 (F1) score of each class Macro average of precision (P), recall (R), and F1 (F1) ● Runtime in milliseconds. RF SVM NB NB 12
  • 13. Feature importance - KG Individual features ● Browsing time related ● Page title length ● Amount of mouse movement ● Rank of the clicked documents ● ... 13 Feature categories Browsing Mouse SERP
  • 14. Result - KS prediction 140 distinct configurations * 10-fold cross validation * 10 repetitions Metrics: ● Accuracy (Accu) across all classes ● Precision (P), Recall (R), F1 (F1) score of each class Macro average of precision (P), recall (R), and F1 (F1) ● Runtime in milliseconds. Overall: meaningful evidence for predicting knowledge state. Outperforms baseline. Accuracy: Efficiency: KSZhang 14 RF MP SVM
  • 15. Feature importance - KS 15 Individual features ● Query complexity ● Page title length ● Page title unique terms ● Query length ● Browsed page size ● ... Feature categories Query Browsing
  • 16. Discussion & Conclusions Conclusions ● knowledge gain (state) can be predicted during informational search sessions with a certain level of accuracy, ● performance of the knowledge gain prediction appears to be generally better, suggesting that the task is easier given the nature of our data, and ● the performance of the prediction approach is better for more extreme classes Limitations ● limited duration of the search sessions reduce the predictive power of certain features ● topic descriptions provided central keywords for the first query, which makes the query features less distinguishable 16
  • 17. Future works ● Conducting experiments to get data for more varied search sessions ● reproduce and refine the findings in more varied search sessions (e.g. longer duration, procedural knowledge) ● investigate features of (multimedia) resources that user interacted with ● Use predicted KS & KG for optimizing retrieval algorithms 17
  • 19. A Few More Details . . . TEST QUESTION SELECTION: ● Using 100 workers per topic, and a larger pool of items (~30); filter items that were too easy (>80% of the workers got it right) or too hard/ambiguous (<20% got it right). SAL-SESSIONS: ● Participation of only Level-3 CrowdFlower workers from primarily English-speaking countries ● 50 workers per topic, filtered out workers who entered no queries, workers who selected the same option ‘TRUE/FALSE’ for all items, those who did not complete the post-session test ⇒ 420 workers 19