SlideShare a Scribd company logo
1 of 49
Download to read offline
Improving user experience in
recommender systems
How latent feature diversification
can decrease choice difficulty and
improve choice satisfaction
Martijn Willemsen
Talk at Institute for Software
technology, Nov 3, 2015
Graz University of Technology
Information overload
Recommendation
Choice Overload in Recommenders
Recommenders reduce information
overload…
But large personalized sets cause choice
overload!
Top-N of all highly ranked items
What should I choose?
These are all very attractive!
Choice Overload
Seminal example of choice overload
Satisfaction decreases with larger sets as increased
attractiveness is counteracted by choice difficulty
More attractive
3% sales
Less attractive
30% sales
Higher purchase
satisfaction
From Iyengar and Lepper (2000)
Choice Overload in Recommenders
(Bollen, Knijnenburg, Willemsen & Graus, RecSys 2010)
perceived recommendation
variety
perceived recommendation
quality
Top-20
vs Top-5 recommendations
movie
expertise
choice
satisfaction
choice
difficulty
+
+
+
+
-+
.401 (.189)
p < .05
.170 (.069)
p < .05
.449 (.072)
p < .001
.346 (.125)
p < .01
.445 (.102)
p < .001
-.217 (.070)
p < .005
Objective System Aspects (OSA)
Subjective System Aspects (SSA)
Experience (EXP)
Personal Characteristics (PC)
Interaction (INT)
Lin-20
vs Top-5 recommendations
+
+ - +
.172 (.068)
p < .05
.938 (.249)
p < .001
-.540 (.196)
p < .01
-.633 (.177)
p < .001
.496 (.152)
p < .005
-0.1
0
0.1
0.2
0.3
0.4
0.5
Top-5 Top-20 Lin-20
Choice satisfaction
A solution: diversification
Tradeoff between similarity and diversity (Smyth &
McClave, 2001) in finding relevant items
Diversification remedies high similarity of Top-N lists
But diversification reduces the overall quality (accuracy) of the list
Many studies only look at data not at real users
Simulated users interact more efficiently with a diverse set (Bridge &
Kelly 2006)
Some studies look at how actual users perceive and
evaluate diversity
Ziegler et al. (2005): diversification based on external ontology
Diversity reduced the accuracy of the recommendations
But coverage and satisfaction increased!
Goal of the current research
Extend existing work by:
Diversification based on psychological mechanisms
Test the algorithm with real users and measure their perceptions and
experiences with the algorithm
Outline of the talk
Our user-centric evaluation framework
Psychology behind choice overload
Latent Feature diversification algorithm
Two user studies to test our diversification
User-Centric Framework
Computers Scientists (and marketing researchers) like to study
behavior…. (they hate asking the user or just cannot (AB tests))
User-Centric Framework
Psychologists and HCI people are mostly interested in experience…
User-Centric Framework
Though it helps to triangulate experience with behavior…
User-Centric Framework
Our framework adds the intermediate construct of perception that explains
why behavior and experiences changes due to our manipulations
User-Centric Framework
And adds personal
and situational
characteristics
Relations modeled
using factor analysis
and SEM
Knijnenburg, B.P., Willemsen, M.C., Gantner, Z., Soncu, H., Newell, C. (2012). Explaining
the User Experience of Recommender Systems. User Modeling and User-Adapted
Interaction (UMUAI), vol 22, p. 441-504 http://bit.ly/umuai
Latent feature diversification
from Psy to CS
Joint work with Mark Graus and
Bart Knijnenburg
Psychology behind choice overload
More options provide more benefits in terms of finding the
right option…
…but result in higher costs/effort
Objective effort:
More comparisons required
Cognitive effort:
Increased potential regret
Larger expectations for larger
sets
Many tradeoffs
Research on Choice overload
Choice overload is not omnipresent
Meta-analysis (Scheibehenne et al., JCR 2010)
suggests an overall effect size of zero
Choice overload stronger when:
No strong prior preferences
Little difference in attractiveness items
Prior studies did not control for
the diversity of the item set
Can we reduce choice difficulty and overload by using
personalized diversified item sets?
While controlling for attractiveness…
Diversification and attractiveness
Camera:
Suppose Peter thinks
resolution (MP) and Zoom
are equally important
user vector shows preference
direction
Equi-preference line:
Set of equally attractive options
(orthogonal on user vector)
Diversify on the equi-
preference line!
Choice Difficulty and Diversity
Larger sets are often more difficult because of the
increased uniformity of these sets (Fasolo et al., 2009;
Reutskaja et al., 2009)
Larger item sets have many
similar options
small inter-product distances
and small tradeoffs
High density!
Choice Difficulty related to
lack of justification 0
5
10
15
20
0 10 20Resolution(MP)
Zoom
High Density
small tradeoffs
Choice difficulty and trade-offs
As item sets become more diverse (less dense) tradeoff
size increases
Tradeoffs are effortful…
give up one aspect for another
But can be justified very easily!
0
5
10
15
20
0 10 20
Resolution(MP)
Zoom
High Density
small tradeoffs
0
5
10
15
20
0 10 20
Resolution(MP)
Zoom
Low Density
large tradeoffs
Double Mediation Model for difficulty
(Scholten and Sherman, JEP:G 2006)
U-shaped relation between diversity and difficulty:
Choosing from uniform set is
hard to justify but has no
difficult tradeoffs
Choosing from a diverse set
encompasses difficult tradeoffs
but is easy to justify
Does this also apply to personalized item sets?
How can we apply this to recommender system
algorithms? What features to diversify on?
Difficulty
diversity
uniform diverse
Features in Matrix Factorization
Latent Features as means
of diversification!
“Latent features are Preference
dimensions related to real world
concepts (e.g. Escapist/serious)”
(Koren, Bell and Volinsky,2009)
Users and items described as
vectors of latent features
Parallel to how choice sets are described in
MAUT(multi-attribute utility theory) in consumer
psychology
Explaining Matrix Factorization
Map users and items to a joint latent factor space of
dimensionality f
Each item is a vector qi
each user a vector pu
Predicted rating r of user u for item i:
How to find the dimensions?
SVD: singular value decomposition
u
T
iui pqr ˆ
Matrix Factorization algorithms
each user a vector pu Each item is a vector qi
Usual
Suspects
Titanic
DieHard
Godfather
Jack
Dylan
Olivia
Mark
?
?
?
? ? ?
?
pu
Dim1
Dim2
Jack 3 -1
Dylan 1.4 .2
Olivia -2.5 -.8
Mark -2 -1.5
qi
Usual
Suspects
Titanic
DieHard
Godfather
Dim 1 1.6 -1 5 0.2
Dim 2 1 1 .3 -.2
Two-dimensional Latent feature space and
diversification
Jack
Mark
Olivia Dylan
Greedy Diversity Algorithm
10-dimensional MF model
Take personalized top-200
Low: closest to centroid
Greedy algorithm
Select K items with
highest inter-item distance
(using city-block)
Medium:
select maximally diverse from
100 items closest to centroid
High: from all items in top-200
Greedy Diversity Algorithm
10-dimensional MF model
Take personalized top-200
Low: closest to centroid
Greedy algorithm
Select K items with
highest inter-item distance
(using city-block)
Medium:
select maximally diverse from
100 items closest to centroid
High: from all items in top-200
The algorithm
Measure of density: AFSR
Density/tradeoffs on the features: capture how items are distributed
in the feature space.
Average Factor Score Range (AFSR) based on the density metric
used by Fasolo et al. (2009)
X is set of items i
D is number of features
Captures the distribution of items in the feature space and their
tradeoffs better than standard similarity measures
Selection of initial set
Top-200 was selected as a balanced initial set:
Large differences in AFSR scores for the 3 levels of diversification
High average predicted rating: 4.48
Range in predicted ratings (0.546) lower than error of MF model:
MAE = .656 (RMSE = .854)
Final check: How does attractiveness vary within
and between the sets?
more diverse sets are by nature more likely to capture high-ranked
items…
Set Size Diversification Rating AFSR
5
Low 4.505 0.295
Medium 4.529 0.634
High 4.561 1.210
20
Low 4.537 0.586
Medium 4.558 1.005
High 4.604 1.615
System characteristics
MF recommender based on MyMedia project
10M MovieLens dataset: movies from 1994
5.6M ratings for 70k users and 5.4k movies
RMSE of 0.854, MAE of 0.656
Movies shown with title and predicted rating:
hovering the mouse over the title reveals additional information:
short synopsis, cast, director and image
Study 1a: Check diversification algorithm
Does our diversification affects the subjective experiences
with the recommendations?
Do people perceive the diversity?
Does it affects attractiveness and choice difficulty?
Each participant inspects 3 lists (low, mid and high
diversification), order counterbalanced
No choices made from the list
Study 1a: design/procedure
Pre-questionnaire
Personal characteristics
Rating task to train the system (10 ratings)
Assess three lists of recommendations
Within subjects: low / mid / high diversification
Between subjects: number of items (5,10,15,20,25)
After each list we measured:
Perceived Diversity & Attractiveness
Expected Trade-Off Difficulty & Choice Difficulty
Study 1a: Participants and Manipulation
checks
97 Participants from an online database
Paid for participation
Mean age: 29 years, 52 females and 45 males
Low, medium and high diversification
differed in the feature score range
Average predicted ratings of the sets were not different!
diversity
Average AFSR
(SE)
Avg. Predicted
rating
(SE)
Low 0.959 (0.015) 4.486 (0.042)
Medium 1.273 (0.016) 4.486 (0.041)
High 1.744 (0.024) 4.527 (0.039)
Study 1a: Structural Equation Model
Study 1a: Structural Equation Model
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
low mid high
standardizedscore
diversification
attractiveness diversity
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
low mid high
scaledifference
diversification
choice diff. tradeoff diff.
Study 1a: Conclusions & Discussion
Diversifying on latent features
Increases attractiveness/diversity
Reduces trade-off difficulty (high)
Reduces choice difficulty (linearly)
No evidence for U-shaped difficulty model
High diversity does not result in trade-off conflicts
(perhaps due to the nature of the domain/MF?)
No effect of number on items
Small sets benefit as much from diversification
Diversification on MF features seems
promising to increase attractiveness!
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
low mid high
scaledifference
diversification
choice diff. tradeoff diff.
Study 1b: No choice satisfaction
In Study 1a, no actual choice was made
Explains limited effect of number of items
We could not measure choice satisfaction or justification-based
processes
Diversification and list length as two factors in a new
experiment with choice (and choice satisfaction)
Item size: 5, 10 and 20
Low and high diversity (no medium)
We expect choice difficulty to be more prominent for low
diversity sets
Study 1b: design/procedure
Pre-questionnaire
Personal characteristics
Rating task to train the system (10 ratings)
Choose one item from a list of recommendations
Between subjects: 2 levels (low / high diversification)
X 3 lengths (5, 10 or 20 items)
Afterwards we measured:
Perceived Diversity & Attractiveness
Choice Difficulty and Choice satisfaction
87 Participants from an online database
Paid for participation Mean age: 29Y, 41F, 46M
Questionnaire-items
Perceived recommendation diversity
5 items, e.g. “The list of movies was varied”
Perceived recommendation attractiveness
5 items, e.g. “The list of recommendations was attractive”
Choice satisfaction
6 items, e.g. “I think I would enjoy watching the chosen movie”
Choice difficulty
5 items, e.g.: “It was easy to select a movie”
Study 1b: Structural Equation Model
Study 1b: Structural Equation Model
-0.5
0.0
0.5
1.0
1.5
2.0
5 10 20
standardizedscore
list length
Diversity
low diversity
high diversity
0.0
0.5
1.0
1.5
2.0
2.5
5 10 20
standardizedscore
list length
Satisfaction
low diversity high diversity
Study 1b: Results
Long list is more difficult (cognitive and objective effort
(hovers)) but also more satisfying in itself
Diversity influences choice satisfaction in important ways
Diversity increases attractiveness and reduces difficulty
These can increase satisfaction (but only for short lists)
Our diversification improved the 5-item list
5 diverse items are as satisfactory as
10 or 20 items and less difficult!
Less effort needed (hovers)
Using latent feature diversification
one does not need long item lists…
0.0
0.5
1.0
1.5
2.0
2.5
5 10 20
standardizedscore
list length
Satisfaction
low diversity high diversity
But…
Our studies show that low or high diversification from the
centroid of Top-200 works
However, these sets were optimized for diversity, not for prediction
accuracy
Most item sets thus not contain the best predicted items
So how does this compare to standard Top-N lists?
Slight modification of our algorithm: diversify starting from the best
predicted item in the top-N set (Top-1) rather than the centroid
Diversification and list length as two factors in a choice
overload experiment
list sizes: 5 and 20
Diversification: none (top 5/20), medium, high
Properties of the item sets
Set size diversification Avg. AFSR Avg. Rank
5
High (α = 1) 1.380 78.284
Medium (α = 0.3) 1.096 7.484
Top-N (α = 0) 0.774 3.000
20
High (α = 1) 1.793 89.380
Medium (α = 0.3) 1.486 17.849
Top-N (α = 0) 1.270 10.500
Algorithm balances between max diversity and highest rank
Every iteration: weigh (1-α) highest rank
against highest diversification (α)
α=0: top-N, α=1: max diverse.
(α=0.3 gives good medium diversity)
Design/procedure Study 2
159 Participants from an online database
Rating task to train the system (15 ratings)
Choose one item from a list of recommendations
Between subjects: 3 levels of diversification, 2 lengths
Afterwards we measured:
Perceptions: Perceived Diversity & Attractiveness
Experience: Choice Difficulty and Choice satisfaction
Behavior: total views / unique items considered
Structural Equation Model
Perceived Diversity & attractiveness
Perceived Diversity increases with
Diversification
Similarly for 5 and 20 items
Perc. Diversity increases attractiveness
Perceived difficulty goes down with
diversification
Perceived attractiveness goes up
with diversification
Diverse 5 item set excels…
Just as satisfying as 20 items
Less difficult to choose from
Less cognitive load…!
-0.5
0
0.5
1
1.5
none med high
standardizedscore
diversification
Perc. Diversity
5 items
20 items
-0.2
0
0.2
0.4
0.6
0.8
1
none med high
standardizedscore
diversification
Choice Satisfaction
5 items
20 items
Choice Characteristics
Chosen option (mean and std. err)
Set
Diversity
List Position Rating Rank
5 items
None (top 5) 3.60 (0.27) 4.51 (0.07) 3.60 (0.27)
Medium 4.41 (0.59) 4.41 (0.07) 14.52 (5.37)
High 4.19 (0.27) 4.30 (0.07) 77.59 (12.76)
20 items
None (top 20) 10.15 (0.92) 4.45 (0.05) 10.15 (0.92)
Medium 10.33 (1.18) 4.40 (0.08) 17.7 (2.68)
high 9.93 (1.07) 4.16 (0.07) 72.22 (11.84)
With higher diversity, no difference in position of chosen option
Resulting in less ‘optimal’ choice in terms of predicted rating
Without a reduction in choice satisfaction!
Conclusions
Reducing Choice difficulty and overload
Diversity reduces choice difficulty
Less uniform sets are easier to choose from
Latent feature diversification easy to implement
Diversity can improve choice satisfaction
Even when the diversified list has movies with lower
predicted ratings than standard top-N lists
No need for larger item sets
Offering personalized diversified small items sets might be the key
to help decision makers cope with too much choice!
Psychological theory can inform how to improve the
output of Recommender algorithms
What you should take away…
Psychological theory can inform new ways of diversifying
algorithm output or eliciting preferences
But also: working with recommenders and algorithms we could
enhance psychological theory: personalized item sets gives control
User-centric evaluation helps to assess the effectiveness
Lot of work… and we need user studies…
But linking subjective to objective measures might help future
studies that cannot do user studies
User-centric framework allows us to understand WHY
particular approaches work or not
Concept of mediation: user perception helps understanding..
Questions?
Contact:
Martijn Willemsen
@MCWillemsen
M.C.Willemsen@tue.nl
www.martijnwillemsen.nl
Thanks to my co-authors:
Mark Graus
Bart Knijnenburg

More Related Content

What's hot

(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report
(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report
(Gaurav sawant &amp; dhaval sawlani)bia 678 final project reportGaurav Sawant
 
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015Journal For Research
 
Recommender system
Recommender systemRecommender system
Recommender systemSaiguru P.v
 
Paper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive MultimediaPaper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive MultimediaJeanette Howe
 
Jeffrey xu yu large graph processing
Jeffrey xu yu large graph processingJeffrey xu yu large graph processing
Jeffrey xu yu large graph processingjins0618
 
Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...
Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...
Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...Mentor Palokaj
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation SystemsZia Babar
 
Fairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization ApproachFairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization ApproachToshihiro Kamishima
 
Stated preference methods and analysis
Stated preference methods and analysisStated preference methods and analysis
Stated preference methods and analysisHabet Madoyan
 

What's hot (12)

(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report
(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report
(Gaurav sawant &amp; dhaval sawlani)bia 678 final project report
 
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
 
Recommender system
Recommender systemRecommender system
Recommender system
 
Recommender-technology-ReColl08
Recommender-technology-ReColl08Recommender-technology-ReColl08
Recommender-technology-ReColl08
 
Paper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive MultimediaPaper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive Multimedia
 
Jeffrey xu yu large graph processing
Jeffrey xu yu large graph processingJeffrey xu yu large graph processing
Jeffrey xu yu large graph processing
 
Recommender systems to help people move forward
Recommender systems to help people move forwardRecommender systems to help people move forward
Recommender systems to help people move forward
 
Developing Movie Recommendation System
Developing Movie Recommendation SystemDeveloping Movie Recommendation System
Developing Movie Recommendation System
 
Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...
Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...
Mentor Palokaj - Thesis Final Version v1.1 - Response of Bartle types to a me...
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Fairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization ApproachFairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization Approach
 
Stated preference methods and analysis
Stated preference methods and analysisStated preference methods and analysis
Stated preference methods and analysis
 

Similar to Improving user experience in recommender systems

Machine Learning in e commerce - Reboot
Machine Learning in e commerce - RebootMachine Learning in e commerce - Reboot
Machine Learning in e commerce - RebootMarion DE SOUSA
 
Introduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing SlidesIntroduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing SlidesQuestionPro
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxdongchangim30
 
Interactively Evolving User Interfaces
Interactively Evolving User InterfacesInteractively Evolving User Interfaces
Interactively Evolving User InterfacesJuan Quiroz
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011idoguy
 
Introduction to recommender systems
Introduction to recommender systemsIntroduction to recommender systems
Introduction to recommender systemsRami Alsalman
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
typicality-based collaborative filtering recommendation
typicality-based collaborative filtering recommendationtypicality-based collaborative filtering recommendation
typicality-based collaborative filtering recommendationswathi78
 
Computational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding RegionsComputational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding Regionsbutest
 
Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)Jeet Das
 
KnowMe and ShareMe: understanding automatically discovered personality traits...
KnowMe and ShareMe: understanding automatically discovered personality traits...KnowMe and ShareMe: understanding automatically discovered personality traits...
KnowMe and ShareMe: understanding automatically discovered personality traits...Leon Gou
 
Topic_6
Topic_6Topic_6
Topic_6butest
 
Introduction to Advanced Analytics (without equations!)
Introduction to Advanced Analytics (without equations!)Introduction to Advanced Analytics (without equations!)
Introduction to Advanced Analytics (without equations!)Bonamy Finch
 
Quality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureQuality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureTouradj Ebrahimi
 
Analytics for Virtual Worlds
Analytics for Virtual WorldsAnalytics for Virtual Worlds
Analytics for Virtual WorldsHilary Mason
 
Typicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationTypicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationPapitha Velumani
 
Recommender Systems and the Human Factor
Recommender Systems and the Human FactorRecommender Systems and the Human Factor
Recommender Systems and the Human FactorMark Graus
 

Similar to Improving user experience in recommender systems (20)

Machine Learning in e commerce - Reboot
Machine Learning in e commerce - RebootMachine Learning in e commerce - Reboot
Machine Learning in e commerce - Reboot
 
Introduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing SlidesIntroduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptx
 
Interactively Evolving User Interfaces
Interactively Evolving User InterfacesInteractively Evolving User Interfaces
Interactively Evolving User Interfaces
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
 
Introduction to recommender systems
Introduction to recommender systemsIntroduction to recommender systems
Introduction to recommender systems
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
typicality-based collaborative filtering recommendation
typicality-based collaborative filtering recommendationtypicality-based collaborative filtering recommendation
typicality-based collaborative filtering recommendation
 
Computational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding RegionsComputational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding Regions
 
Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)
 
KnowMe and ShareMe: understanding automatically discovered personality traits...
KnowMe and ShareMe: understanding automatically discovered personality traits...KnowMe and ShareMe: understanding automatically discovered personality traits...
KnowMe and ShareMe: understanding automatically discovered personality traits...
 
Topic_6
Topic_6Topic_6
Topic_6
 
Introduction to Advanced Analytics (without equations!)
Introduction to Advanced Analytics (without equations!)Introduction to Advanced Analytics (without equations!)
Introduction to Advanced Analytics (without equations!)
 
Slalom
SlalomSlalom
Slalom
 
Quality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureQuality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and Future
 
PPT s09-machine vision-s2
PPT s09-machine vision-s2PPT s09-machine vision-s2
PPT s09-machine vision-s2
 
Analytics for Virtual Worlds
Analytics for Virtual WorldsAnalytics for Virtual Worlds
Analytics for Virtual Worlds
 
Typicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationTypicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering Recommendation
 
Abstract
AbstractAbstract
Abstract
 
Recommender Systems and the Human Factor
Recommender Systems and the Human FactorRecommender Systems and the Human Factor
Recommender Systems and the Human Factor
 

Recently uploaded

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 

Improving user experience in recommender systems

  • 1. Improving user experience in recommender systems How latent feature diversification can decrease choice difficulty and improve choice satisfaction Martijn Willemsen Talk at Institute for Software technology, Nov 3, 2015 Graz University of Technology
  • 3. Choice Overload in Recommenders Recommenders reduce information overload… But large personalized sets cause choice overload! Top-N of all highly ranked items What should I choose? These are all very attractive!
  • 4. Choice Overload Seminal example of choice overload Satisfaction decreases with larger sets as increased attractiveness is counteracted by choice difficulty More attractive 3% sales Less attractive 30% sales Higher purchase satisfaction From Iyengar and Lepper (2000)
  • 5. Choice Overload in Recommenders (Bollen, Knijnenburg, Willemsen & Graus, RecSys 2010) perceived recommendation variety perceived recommendation quality Top-20 vs Top-5 recommendations movie expertise choice satisfaction choice difficulty + + + + -+ .401 (.189) p < .05 .170 (.069) p < .05 .449 (.072) p < .001 .346 (.125) p < .01 .445 (.102) p < .001 -.217 (.070) p < .005 Objective System Aspects (OSA) Subjective System Aspects (SSA) Experience (EXP) Personal Characteristics (PC) Interaction (INT) Lin-20 vs Top-5 recommendations + + - + .172 (.068) p < .05 .938 (.249) p < .001 -.540 (.196) p < .01 -.633 (.177) p < .001 .496 (.152) p < .005 -0.1 0 0.1 0.2 0.3 0.4 0.5 Top-5 Top-20 Lin-20 Choice satisfaction
  • 6. A solution: diversification Tradeoff between similarity and diversity (Smyth & McClave, 2001) in finding relevant items Diversification remedies high similarity of Top-N lists But diversification reduces the overall quality (accuracy) of the list Many studies only look at data not at real users Simulated users interact more efficiently with a diverse set (Bridge & Kelly 2006) Some studies look at how actual users perceive and evaluate diversity Ziegler et al. (2005): diversification based on external ontology Diversity reduced the accuracy of the recommendations But coverage and satisfaction increased!
  • 7. Goal of the current research Extend existing work by: Diversification based on psychological mechanisms Test the algorithm with real users and measure their perceptions and experiences with the algorithm Outline of the talk Our user-centric evaluation framework Psychology behind choice overload Latent Feature diversification algorithm Two user studies to test our diversification
  • 8. User-Centric Framework Computers Scientists (and marketing researchers) like to study behavior…. (they hate asking the user or just cannot (AB tests))
  • 9. User-Centric Framework Psychologists and HCI people are mostly interested in experience…
  • 10. User-Centric Framework Though it helps to triangulate experience with behavior…
  • 11. User-Centric Framework Our framework adds the intermediate construct of perception that explains why behavior and experiences changes due to our manipulations
  • 12. User-Centric Framework And adds personal and situational characteristics Relations modeled using factor analysis and SEM Knijnenburg, B.P., Willemsen, M.C., Gantner, Z., Soncu, H., Newell, C. (2012). Explaining the User Experience of Recommender Systems. User Modeling and User-Adapted Interaction (UMUAI), vol 22, p. 441-504 http://bit.ly/umuai
  • 13. Latent feature diversification from Psy to CS Joint work with Mark Graus and Bart Knijnenburg
  • 14. Psychology behind choice overload More options provide more benefits in terms of finding the right option… …but result in higher costs/effort Objective effort: More comparisons required Cognitive effort: Increased potential regret Larger expectations for larger sets Many tradeoffs
  • 15. Research on Choice overload Choice overload is not omnipresent Meta-analysis (Scheibehenne et al., JCR 2010) suggests an overall effect size of zero Choice overload stronger when: No strong prior preferences Little difference in attractiveness items Prior studies did not control for the diversity of the item set Can we reduce choice difficulty and overload by using personalized diversified item sets? While controlling for attractiveness…
  • 16. Diversification and attractiveness Camera: Suppose Peter thinks resolution (MP) and Zoom are equally important user vector shows preference direction Equi-preference line: Set of equally attractive options (orthogonal on user vector) Diversify on the equi- preference line!
  • 17. Choice Difficulty and Diversity Larger sets are often more difficult because of the increased uniformity of these sets (Fasolo et al., 2009; Reutskaja et al., 2009) Larger item sets have many similar options small inter-product distances and small tradeoffs High density! Choice Difficulty related to lack of justification 0 5 10 15 20 0 10 20Resolution(MP) Zoom High Density small tradeoffs
  • 18. Choice difficulty and trade-offs As item sets become more diverse (less dense) tradeoff size increases Tradeoffs are effortful… give up one aspect for another But can be justified very easily! 0 5 10 15 20 0 10 20 Resolution(MP) Zoom High Density small tradeoffs 0 5 10 15 20 0 10 20 Resolution(MP) Zoom Low Density large tradeoffs
  • 19. Double Mediation Model for difficulty (Scholten and Sherman, JEP:G 2006) U-shaped relation between diversity and difficulty: Choosing from uniform set is hard to justify but has no difficult tradeoffs Choosing from a diverse set encompasses difficult tradeoffs but is easy to justify Does this also apply to personalized item sets? How can we apply this to recommender system algorithms? What features to diversify on? Difficulty diversity uniform diverse
  • 20. Features in Matrix Factorization Latent Features as means of diversification! “Latent features are Preference dimensions related to real world concepts (e.g. Escapist/serious)” (Koren, Bell and Volinsky,2009) Users and items described as vectors of latent features Parallel to how choice sets are described in MAUT(multi-attribute utility theory) in consumer psychology
  • 21. Explaining Matrix Factorization Map users and items to a joint latent factor space of dimensionality f Each item is a vector qi each user a vector pu Predicted rating r of user u for item i: How to find the dimensions? SVD: singular value decomposition u T iui pqr ˆ
  • 22. Matrix Factorization algorithms each user a vector pu Each item is a vector qi Usual Suspects Titanic DieHard Godfather Jack Dylan Olivia Mark ? ? ? ? ? ? ? pu Dim1 Dim2 Jack 3 -1 Dylan 1.4 .2 Olivia -2.5 -.8 Mark -2 -1.5 qi Usual Suspects Titanic DieHard Godfather Dim 1 1.6 -1 5 0.2 Dim 2 1 1 .3 -.2
  • 23. Two-dimensional Latent feature space and diversification Jack Mark Olivia Dylan
  • 24. Greedy Diversity Algorithm 10-dimensional MF model Take personalized top-200 Low: closest to centroid Greedy algorithm Select K items with highest inter-item distance (using city-block) Medium: select maximally diverse from 100 items closest to centroid High: from all items in top-200
  • 25. Greedy Diversity Algorithm 10-dimensional MF model Take personalized top-200 Low: closest to centroid Greedy algorithm Select K items with highest inter-item distance (using city-block) Medium: select maximally diverse from 100 items closest to centroid High: from all items in top-200
  • 26. The algorithm Measure of density: AFSR Density/tradeoffs on the features: capture how items are distributed in the feature space. Average Factor Score Range (AFSR) based on the density metric used by Fasolo et al. (2009) X is set of items i D is number of features Captures the distribution of items in the feature space and their tradeoffs better than standard similarity measures
  • 27. Selection of initial set Top-200 was selected as a balanced initial set: Large differences in AFSR scores for the 3 levels of diversification High average predicted rating: 4.48 Range in predicted ratings (0.546) lower than error of MF model: MAE = .656 (RMSE = .854) Final check: How does attractiveness vary within and between the sets? more diverse sets are by nature more likely to capture high-ranked items… Set Size Diversification Rating AFSR 5 Low 4.505 0.295 Medium 4.529 0.634 High 4.561 1.210 20 Low 4.537 0.586 Medium 4.558 1.005 High 4.604 1.615
  • 28. System characteristics MF recommender based on MyMedia project 10M MovieLens dataset: movies from 1994 5.6M ratings for 70k users and 5.4k movies RMSE of 0.854, MAE of 0.656 Movies shown with title and predicted rating: hovering the mouse over the title reveals additional information: short synopsis, cast, director and image
  • 29. Study 1a: Check diversification algorithm Does our diversification affects the subjective experiences with the recommendations? Do people perceive the diversity? Does it affects attractiveness and choice difficulty? Each participant inspects 3 lists (low, mid and high diversification), order counterbalanced No choices made from the list
  • 30. Study 1a: design/procedure Pre-questionnaire Personal characteristics Rating task to train the system (10 ratings) Assess three lists of recommendations Within subjects: low / mid / high diversification Between subjects: number of items (5,10,15,20,25) After each list we measured: Perceived Diversity & Attractiveness Expected Trade-Off Difficulty & Choice Difficulty
  • 31. Study 1a: Participants and Manipulation checks 97 Participants from an online database Paid for participation Mean age: 29 years, 52 females and 45 males Low, medium and high diversification differed in the feature score range Average predicted ratings of the sets were not different! diversity Average AFSR (SE) Avg. Predicted rating (SE) Low 0.959 (0.015) 4.486 (0.042) Medium 1.273 (0.016) 4.486 (0.041) High 1.744 (0.024) 4.527 (0.039)
  • 32. Study 1a: Structural Equation Model
  • 33. Study 1a: Structural Equation Model 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 low mid high standardizedscore diversification attractiveness diversity -1 -0.8 -0.6 -0.4 -0.2 0 0.2 low mid high scaledifference diversification choice diff. tradeoff diff.
  • 34. Study 1a: Conclusions & Discussion Diversifying on latent features Increases attractiveness/diversity Reduces trade-off difficulty (high) Reduces choice difficulty (linearly) No evidence for U-shaped difficulty model High diversity does not result in trade-off conflicts (perhaps due to the nature of the domain/MF?) No effect of number on items Small sets benefit as much from diversification Diversification on MF features seems promising to increase attractiveness! -1 -0.8 -0.6 -0.4 -0.2 0 0.2 low mid high scaledifference diversification choice diff. tradeoff diff.
  • 35. Study 1b: No choice satisfaction In Study 1a, no actual choice was made Explains limited effect of number of items We could not measure choice satisfaction or justification-based processes Diversification and list length as two factors in a new experiment with choice (and choice satisfaction) Item size: 5, 10 and 20 Low and high diversity (no medium) We expect choice difficulty to be more prominent for low diversity sets
  • 36. Study 1b: design/procedure Pre-questionnaire Personal characteristics Rating task to train the system (10 ratings) Choose one item from a list of recommendations Between subjects: 2 levels (low / high diversification) X 3 lengths (5, 10 or 20 items) Afterwards we measured: Perceived Diversity & Attractiveness Choice Difficulty and Choice satisfaction 87 Participants from an online database Paid for participation Mean age: 29Y, 41F, 46M
  • 37. Questionnaire-items Perceived recommendation diversity 5 items, e.g. “The list of movies was varied” Perceived recommendation attractiveness 5 items, e.g. “The list of recommendations was attractive” Choice satisfaction 6 items, e.g. “I think I would enjoy watching the chosen movie” Choice difficulty 5 items, e.g.: “It was easy to select a movie”
  • 38. Study 1b: Structural Equation Model
  • 39. Study 1b: Structural Equation Model -0.5 0.0 0.5 1.0 1.5 2.0 5 10 20 standardizedscore list length Diversity low diversity high diversity 0.0 0.5 1.0 1.5 2.0 2.5 5 10 20 standardizedscore list length Satisfaction low diversity high diversity
  • 40. Study 1b: Results Long list is more difficult (cognitive and objective effort (hovers)) but also more satisfying in itself Diversity influences choice satisfaction in important ways Diversity increases attractiveness and reduces difficulty These can increase satisfaction (but only for short lists) Our diversification improved the 5-item list 5 diverse items are as satisfactory as 10 or 20 items and less difficult! Less effort needed (hovers) Using latent feature diversification one does not need long item lists… 0.0 0.5 1.0 1.5 2.0 2.5 5 10 20 standardizedscore list length Satisfaction low diversity high diversity
  • 41. But… Our studies show that low or high diversification from the centroid of Top-200 works However, these sets were optimized for diversity, not for prediction accuracy Most item sets thus not contain the best predicted items So how does this compare to standard Top-N lists? Slight modification of our algorithm: diversify starting from the best predicted item in the top-N set (Top-1) rather than the centroid Diversification and list length as two factors in a choice overload experiment list sizes: 5 and 20 Diversification: none (top 5/20), medium, high
  • 42. Properties of the item sets Set size diversification Avg. AFSR Avg. Rank 5 High (α = 1) 1.380 78.284 Medium (α = 0.3) 1.096 7.484 Top-N (α = 0) 0.774 3.000 20 High (α = 1) 1.793 89.380 Medium (α = 0.3) 1.486 17.849 Top-N (α = 0) 1.270 10.500 Algorithm balances between max diversity and highest rank Every iteration: weigh (1-α) highest rank against highest diversification (α) α=0: top-N, α=1: max diverse. (α=0.3 gives good medium diversity)
  • 43. Design/procedure Study 2 159 Participants from an online database Rating task to train the system (15 ratings) Choose one item from a list of recommendations Between subjects: 3 levels of diversification, 2 lengths Afterwards we measured: Perceptions: Perceived Diversity & Attractiveness Experience: Choice Difficulty and Choice satisfaction Behavior: total views / unique items considered
  • 45. Perceived Diversity & attractiveness Perceived Diversity increases with Diversification Similarly for 5 and 20 items Perc. Diversity increases attractiveness Perceived difficulty goes down with diversification Perceived attractiveness goes up with diversification Diverse 5 item set excels… Just as satisfying as 20 items Less difficult to choose from Less cognitive load…! -0.5 0 0.5 1 1.5 none med high standardizedscore diversification Perc. Diversity 5 items 20 items -0.2 0 0.2 0.4 0.6 0.8 1 none med high standardizedscore diversification Choice Satisfaction 5 items 20 items
  • 46. Choice Characteristics Chosen option (mean and std. err) Set Diversity List Position Rating Rank 5 items None (top 5) 3.60 (0.27) 4.51 (0.07) 3.60 (0.27) Medium 4.41 (0.59) 4.41 (0.07) 14.52 (5.37) High 4.19 (0.27) 4.30 (0.07) 77.59 (12.76) 20 items None (top 20) 10.15 (0.92) 4.45 (0.05) 10.15 (0.92) Medium 10.33 (1.18) 4.40 (0.08) 17.7 (2.68) high 9.93 (1.07) 4.16 (0.07) 72.22 (11.84) With higher diversity, no difference in position of chosen option Resulting in less ‘optimal’ choice in terms of predicted rating Without a reduction in choice satisfaction!
  • 47. Conclusions Reducing Choice difficulty and overload Diversity reduces choice difficulty Less uniform sets are easier to choose from Latent feature diversification easy to implement Diversity can improve choice satisfaction Even when the diversified list has movies with lower predicted ratings than standard top-N lists No need for larger item sets Offering personalized diversified small items sets might be the key to help decision makers cope with too much choice! Psychological theory can inform how to improve the output of Recommender algorithms
  • 48. What you should take away… Psychological theory can inform new ways of diversifying algorithm output or eliciting preferences But also: working with recommenders and algorithms we could enhance psychological theory: personalized item sets gives control User-centric evaluation helps to assess the effectiveness Lot of work… and we need user studies… But linking subjective to objective measures might help future studies that cannot do user studies User-centric framework allows us to understand WHY particular approaches work or not Concept of mediation: user perception helps understanding..

Editor's Notes

  1. This is a problem that has been solved already… We can help us cope with information overload by using recommenders
  2. But once we get a large set of good recommendations, we might go from information overload to choice overload
  3. Iyengar and lepper set up two different types of taste booths in a big supermarket More people were attracted towards the boot with more jams, but less people visiting that booth actually bought jam, compared to the booth with fewer jams
  4. For the sake of time only the standaard choice situation, comparing 5 with 20 items People used a movie recommender, and got either a 5 or 20 item personalized list. The 20 item list (compared to the 5 item list) as perceived to more varied and attractive, and therefore more satisfactory, but also more difficult to choose from, reducing the satisfaction. In the end, the 5 and 20 items were just as satisfactory…
  5. Accurate recommendations often come at the problem of offering too similar options
  6. We do not search for the highest accuracy but for the underlying mechanisms! Our ultimate goal is to give users the highest satisfaction with the least difficulty, for this we need to manipulate underlying aspects of the stimulus set.
  7. A deeper look into the diversity / accuracy tradeoff and the advantages of diversification (using the right algorithm and the right Psy assumptions)
  8. If not personalized, it is not garantueed that the set is equally attractive to all users. Some users will experience difficulty, others might not. See Chernev
  9. User vector maximum difference in preference
  10. The key of manipulation density is by looking at the latent features as they are used in modern recommender systems: these are not just technical tricks to reduce the dimensionality of a set of sparse data, but as they arrive from preferences, they might be related to meaningful preference dimensions
  11. Try to fill in the blanks
  12. Use PCA like techniques to find mapping Extend this slide with the idea that there are many movies within these plains between which we can diversify…
  13. Need a set of items that are sufficiently attractive and with not too high variance in attractiveness, but large enough to diversify We take the top 200 predicted for this person, these are all movies they will like. Predicted ratings are typically around between 4 and 5 stars with a half star range.
  14. Avg pred. rating Top 100: 4.58, range = .431
  15. 3 levels of diversification because of U-shaped model Manipulate number of items to see if diversity increases with the number of items and perhaps the impact of diversification is stronger for smaller lists
  16. Maximizer not related to any concept in the model List length did not affect the variables (due to low N?) Discuss this later in study 2 Factor scores are standardized, the numbers reflect how a change in 1 SD of a factor A influences the score on another factor B Most inportant: our diversification algo is preceived to be more diverse by our participants. Higher perceived diversity increases the perceived attractiveness (and a PC of expertise: experts perceive the movies also to be more attractive) Show graph to see the total effects of the manipulations on diversity and attractiveness How do these perceptions change the experienced difficulty? Tradeoff difficulty goes down for high div. (rather than up, as the u-shaped model predicted) but is lower for participants with high preference strength Choice difficulty drops with diversity and with higher attractiveness, but increases with perceived tradeoff difficulty The net effect is that choice difficulty reduces linearly with diversity (see graph)
  17. Report the items that came out as related to the constructs in the factor analysis
  18. No relation of the personal characteristics to the concepts in the model (expertise, maximizer and strength of preference) The manipulations affect diversity and difficulty directly. The precise effects are somewhat harder to see so we use the total effect plots, as list length interacts with diversity The effect of high diversity is attenuated for larger sets. Same for attractiveness, which is related to diversity. Consistent with the existing literature and with Bollen et al. Choice satisfaction increases with attractiveness and diversity but decreases with choice difficulty Choice difficulty is function of diversity, attractiveness and list length. Longer lists are harder, and become more difficult if they are attractive, but less difficult for higher diversity. As only the 5 item list is perceived as more diverse, we see that for this list length the difficults is lower. (show graph) The choice satisfaction increaes with attractiveness, high diversity and list length, but decreases with difficulty. The 10 and 20 items lists are more satisfying, without much difference between high and low diversity, but for 5 items, we see that the high diversified 5 items are as satisfactory as the 10 and 20 items. (e.g. the high diversity makes this list more attractive and less difficult, ofsetting the negative effect of list length)