SlideShare a Scribd company logo
1 of 38
Download to read offline
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Should I Follow the Crowd?
A Probabilistic Analysis of the Effectiveness
of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval (SIGIR 2018)
Rocío Cañamares and Pablo Castells
Universidad Autónoma de Madrid
http://ir.ii.uam.es
Ann Arbor, USA, 10 July 2018
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Popularity in recommendation
Would you find it useful
to recommend these items?
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Outline
1. The popularity bias in recommender systems
2. Formal analysis
3. Experiments
4. Conclusions
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Outline
1. The popularity bias in recommender systems
2. Formal analysis
3. Experiments
4. Conclusions
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Recommender systems
Users who bought this also bought…
Music discovery
Related videos
People you may know…
. . .
now
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
The recommender system’s task
 Rating matrix with some available cell values, most cells empty
4 4 2 2 2
4 1 4
4 3 2 5 2
4 3 5 2
1 5 1
Users
Items
Abstraction of user-item
interaction
The “rating” matrix
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
The recommender system’s task
4 4 2 2 2
4 1 4
4 3 ? 2 5 ? 2
4 3 5 2
1 5 1
Users
Items
Abstraction of user-item
interaction
The “rating” matrix
 Rating matrix with some available cell values, most cells empty
 Rank items by predicting missing ratings
 Offline evaluation: split the data into training and test
 Evaluate with IR metrics  test ratings = relevance judgments
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
The popularity bias in recommender systems
Items
Users
In the data In algorithms
Mat. factorization
Nr. positive ratings
Nr.timestop10
0
400
800
0 1000 2000
Popularity
800
400
0
0 1000 2000
User-based kNN
Nr. positive ratings
2000
1000
0
0 1000 2000
Popular
items
Rest of items
(long tail)
Items
Nr.ratings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
The popularity bias in recommender systems
Items
Users
In the data
In offline evaluation?
In algorithms
Mat. factorization
Nr. positive ratings
Nr.timestop10
0
400
800
0 1000 2000
Popularity
800
400
0
0 1000 2000
User-based kNN
Nr. positive ratings
2000
1000
0
0 1000 2000
Popular
items
Rest of items
(long tail)
Random
Nr. positive ratings
User-based kNN
Matrix factorization0.3
0.2
0.1
0
nDCG@10
MovieLens 1M
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Avoiding popularity
Popularity is not personalized, trivial, suspicious…
Metrics and algorithms have been proposed
that remove or cope with popularity biases
But… is popularity
“good” or “bad”?
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Should I follow the crowd?
?
 Lack of novelty is not a sufficient answer: degrees of popularity
 Majority provides useful default choices
 Technically simplest and cheap solution
– Most apps have majority listings
 Majority is not always right
– Randomness factors, conformity,
manipulation, etc.
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Outline
1. The popularity bias in recommender systems
2. Formal analysis
3. Experiments
4. Conclusions
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Our formal analysis
Research questions
 Is popularity really effective (accurate)
in recommendation?
– Which popularity: positive rating count / ratio?
 Are we measuring properly its effectiveness
in offline experiments?
Expected
precision
Observed vs. true
metric values
Random
variables
Rankings
Depend-
encies
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Formalization: observed vs. true accuracy
Computed on available
user taste observations
Computed with full
knowledge of user tastes
Observed metric value True metric value
Items
Users Relevant
Non relevant
Missing ratings
≈
?
Items
Users
Expected
precision
Observed vs. true
metric values
Random
variables
Rankings
Depend-
encies
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Formalization: expectation
 Analysis on the expected accuracy of popularity-
based recommendation
– How good is popularity compared to “something else”
– Do observed and true accuracy agree (in expectation)?
 A most simple metric: 𝔼 𝑃@1
 Expectation  random variables, probabilities
Expected
precision
Observed vs. true
metric values
Random
variables
Rankings
Depend-
encies
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
𝑟𝑒𝑙
Formalization: key random variables
𝑟𝑎𝑡𝑒𝑑
Users × Items
Expected
precision
Random
variables
Rankings
Depend-
encies
Observed vs. true
metric values
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
𝑟𝑒𝑙
Formalization: key random variables
Users × Items
𝑟𝑎𝑡𝑒𝑑
Expected
precision
Random
variables
Rankings
Depend-
encies
Observed vs. true
metric values
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Optimal and popularity rankings
 Popularity variants as probability ranking functions
Nr. relevant ratings ⟶ 𝑝𝑜𝑝 𝑖 ∝ 𝑝(𝑟𝑒𝑙, 𝑡𝑟𝑎𝑖𝑛|𝑖)
Average rating value ⟶ 𝑎𝑣𝑔 𝑖 = 𝑝(𝑟𝑒𝑙|𝑡𝑟𝑎𝑖𝑛, 𝑖)
 Lemma: the optimal non-personalized rankings are
For true 𝔼 𝑃@1 ⟶ 𝑜𝑝𝑡 𝑖 = 𝑝 𝑟𝑒𝑙 ¬𝑡𝑟𝑎𝑖𝑛, 𝑖
For observed 𝔼 𝑃@1 ⟶ 𝑜𝑝𝑡 𝑖 = 𝑝 𝑟𝑒𝑙, 𝑡𝑒𝑠𝑡 ¬𝑡𝑟𝑎𝑖𝑛, 𝑖
 We reach several findings by reasoning on rank equivalence
Expected
precision
Random
variables
Rankings
Depend-
encies
Observed vs. true
metric values
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Conditional (in)dependences between variables
Discover Rate
Like
User
Item
Items
Nr.ratings
Rating
distribution
𝑝 𝑟𝑎𝑡𝑒𝑑 𝑖
Expected
precision
Random
variables
Rankings
Depend-
encies
Observed vs. true
metric values
Theoretical
findings
𝑠𝑒𝑒𝑛
𝑖
𝑟𝑒𝑙
𝑟𝑎𝑡𝑒𝑑
𝑝 𝑟𝑎𝑡𝑒𝑑 𝑖 = 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑟𝑒𝑙, 𝑖 𝑝 𝑟𝑒𝑙 𝑖 + 𝑝 𝑟𝑎𝑡𝑒𝑑 ¬𝑟𝑒𝑙, 𝑖 𝑝(¬𝑟𝑒𝑙|𝑖)
𝑝 𝑟𝑎𝑡𝑒𝑑 𝑟𝑒𝑙, 𝑖 = 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑠𝑒𝑒𝑛, 𝑟𝑒𝑙, 𝑖 𝑝 𝑠𝑒𝑒𝑛 𝑟𝑒𝑙, 𝑖
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Conditional (in)dependences between variables
Discover Rate
Like
User
Item
Items
Nr.ratings
Rating
distribution
𝑝 𝑟𝑎𝑡𝑒𝑑 𝑖
Expected
precision
Random
variables
Rankings
Depend-
encies
Observed vs. true
metric values
Theoretical
findings
𝑝 𝑟𝑎𝑡𝑒𝑑 𝑟𝑒𝑙, 𝑖 = 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑠𝑒𝑒𝑛, 𝑟𝑒𝑙, 𝑖 𝑝 𝑠𝑒𝑒𝑛 𝑟𝑒𝑙, 𝑖
𝑠𝑒𝑒𝑛
𝑖
𝑟𝑒𝑙
𝑟𝑎𝑡𝑒𝑑
𝑠𝑒𝑒𝑛
𝑖
𝑟𝑒𝑙
𝑟𝑎𝑡𝑒𝑑
𝑠𝑒𝑒𝑛
𝑖
𝑟𝑒𝑙
𝑟𝑎𝑡𝑒𝑑
1. Rating depends
just on relevance
2. Rating independent
from relevance
3. Rating depends both
on items and relevance
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
0
0.5
1
Observed True
Theoretical findings
1. Rating depends only on relevance
Observed and true optimals agree: 𝒑𝒐𝒑 ∝ 𝒂𝒗𝒈 ∝ optimal
2. Rating independent from relevance
Observed precision: random < 𝒂𝒗𝒈 < 𝒑𝒐𝒑 ∝ optimal
True precision: random < 𝒑𝒐𝒑 < 𝒂𝒗𝒈 ∝ optimal
3. General case: no independence assumption
Monte Carlo
Observed precision: random ∼ 𝒂𝒗𝒈 < 𝒑𝒐𝒑 ∼ optimal
True precision: random ≼ 𝒑𝒐𝒑 ≼ 𝒂𝒗𝒈 ≺ optimal
𝔼 𝑃@1 𝜃 = න
Ω 𝑛
𝔼 𝑃@1 𝜃, 𝜔 𝑑𝜔
P@1
Random
Optimal
Popularity
Avg rating
Observed and true
precision agree
Observed and true
precision disagree
Expected
precision
Random
variables
Rankings
Depend-
encies
Observed vs. true
metric values
Theoretical
findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Outline
1. The popularity bias in recommender systems
2. Formal analysis
3. Experiments
4. Conclusions
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Experiments
We wish to…
1. Run recommendations by rating count,
average rating, random, optimal
2. Compute observed and true precision
3. See what comes out
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
 We build a dataset free of observational (popularity) bias
1. We sample 1,000 music tracks from deezer.com uniformly at random
2. We ask anonymous workers on CrowdFlower to rate 100 tracks each,
sampled uniformly at random
→ ~100 judgments per user
× ~100 judgments per track
= ~100,000 judgments total
 CM100k dataset available at
http://ir.ii.uam.es/cm100k
Data – Crowdsourced dataset
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Computing observed vs. true precision: judgments and ratings
100k judgments
Items
Seen
before
Never seen
before
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Computing observed vs. true precision: judgments and ratings
100k judgments
Items
Seen
before
“Ratings”
Never seen
before
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Observed precision
Computing observed vs. true precision: judgments and ratings
100k judgments
Items
“Ratings”
Random split
Input for algorithm
Relevance judgments
Training
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Computing observed vs. true precision: judgments and ratings
Items
True precision estimate
Biased input
for algorithm
Full relevance
judgment sample
Training
Test
Items
Training
Observed precision
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
0
0.02
0.04
0.06
Observed True
Results
MovieLens 1M CM100k
nDCG@10
Random
Optimal
Average rating
Popularity
0
0.1
0.2
Observed Observed TrueObserved
Similar qualitative outcome
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
0
0.02
0.04
0.06
Observed True
Results
nDCG@10
Random
Optimal
Average rating
Popularity
0
0.1
0.2
Observed Observed TrueObserved
• Popularity almost optimal • Popularity ∼ random• Popularity almost optimal • Popularity ∼ random
CM100kMovieLens 1M
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
0
0.02
0.04
0.06
Observed True
Results
nDCG@10
Random
Optimal
Average rating
Popularity
0
0.1
0.2
Observed Observed TrueObserved
• Popularity almost optimal
• Avg rating < popularity
• Popularity ∼ random
• Avg rating > popularity
• Popularity almost optimal • Popularity ∼ random
CM100kMovieLens 1M
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Test relevance dependence extremes
1. Full relevance dependence 2. Relevance independence
nDCG@10
0
0.02
0.04
0.06
Observed True
0
0.02
0.04
0.06
Observed True
Random
Optimal
Avg rating
Popularity
Observed TrueObserved True
Shuffling the
discovery distribution
in different ways… Discovery
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Discovery
Test relevance dependence extremes
1. Full relevance dependence 2. Relevance independence
nDCG@10
0
0.02
0.04
0.06
Observed True
0
0.02
0.04
0.06
Observed True
Random
Optimal
Avg rating
Popularity
TrueObserved True
Popularity
and avg rating
are just ok
Random < avg rating
< popularity < optimal
Observed
Roughly agree
Shuffling the
discovery distribution
in different ways…
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Test relevance dependence extremes
1. Full relevance dependence 2. Relevance independence
nDCG@10
0
0.02
0.04
0.06
Observed True
0
0.02
0.04
0.06
Observed True
Random
Optimal
Avg rating
Popularity
Observed TrueObserved True
• Popularity near optimal
• Avg rating near random
Random < popularity
< avg rating < optimal
Shuffling the
discovery distribution
in different ways… Discovery
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Implications on personalized algorithms: user-based kNN
0
0.01
0.02
0.03
Obs True
0
0.1
0.2
0.3
Obs
MovieLens 1M CM100k (full dependencies)
nDCG@10
0
0.01
0.02
0.03
Obs True
0
0.1
0.2
0.3
Obs
Non-normalized kNN
(biased to popularity)
Normalized kNN
(biased to avg rating)
Observed TrueObserved
Non-normalized > normalized Non-normalized < normalized
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Outline
1. The popularity bias in recommender systems
2. Formal analysis
3. Experiments
4. Conclusions
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Conclusions
 So… is it good or bad to recommend popular items?
– It depends on the relation between rating, discovery and relevance
– Tends to be good; worst case: complex simultaneous dependencies
– Weak relevance dependence can twist offline measurements
 The average rating can work better than the rating count
– Observed accuracy tends to be unfair to the average rating
 Implications on collaborative filtering algorithms
– Understanding popularity can help improve state of the art algorithms
 Evaluation with random samples can uncover new findings
IRGIRGroup @UAM
Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems
41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018)
Ann Arbor, MI, USA, 10 July 2018
Future work
 Further questions can be attempted upon a similar
formal approach
 Undiscovered accuracy stop by our poster!
 Other data split procedures, e.g. temporal
 Dynamic taste development
 Recommender system in the discovery loop
 Further research on personalized algorithms
 Denser (full?) unbiased judgment matrix


More Related Content

Similar to SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems

Homepage Personalization at Spotify
Homepage Personalization at SpotifyHomepage Personalization at Spotify
Homepage Personalization at SpotifyOguz Semerci
 
Enterprise Gamification by the Numbers
Enterprise Gamification by the NumbersEnterprise Gamification by the Numbers
Enterprise Gamification by the NumbersAxonify
 
Engineering with Open Source - Hyonjee Joo
Engineering with Open Source - Hyonjee JooEngineering with Open Source - Hyonjee Joo
Engineering with Open Source - Hyonjee JooTwo Sigma
 
Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...
Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...
Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...Ansgar Koene
 
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...Ray Poynter
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Simplilearn
 
Social Aspects of Interactive Recommender Systems
Social Aspects of Interactive Recommender SystemsSocial Aspects of Interactive Recommender Systems
Social Aspects of Interactive Recommender SystemsDenis Parra Santander
 
Ansgar rcep algorithmic_bias_july2018
Ansgar rcep algorithmic_bias_july2018Ansgar rcep algorithmic_bias_july2018
Ansgar rcep algorithmic_bias_july2018Ansgar Koene
 
180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)
180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)
180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)StartupAlliance
 
The Disappearing Data Scientist
The Disappearing Data ScientistThe Disappearing Data Scientist
The Disappearing Data ScientistDATAVERSITY
 
ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...
ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...
ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...Javier Sanz-Cruzado Puig
 
Tendencias de Investigación de Mercado para el 2014 - GRIT
Tendencias de Investigación de Mercado para el 2014 - GRITTendencias de Investigación de Mercado para el 2014 - GRIT
Tendencias de Investigación de Mercado para el 2014 - GRITFuturo Labs
 
State of Analytics: Retail and Consumer Goods
State of Analytics: Retail and Consumer GoodsState of Analytics: Retail and Consumer Goods
State of Analytics: Retail and Consumer GoodsSPI Conference
 
Measures and mismeasures of algorithmic fairness
Measures and mismeasures of algorithmic fairnessMeasures and mismeasures of algorithmic fairness
Measures and mismeasures of algorithmic fairnessManojit Nandi
 
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
Data ethics and machine learning: discrimination, algorithmic bias, and how t...Data ethics and machine learning: discrimination, algorithmic bias, and how t...
Data ethics and machine learning: discrimination, algorithmic bias, and how t...Data Driven Innovation
 
Introduction To Predictive Modelling
Introduction To Predictive ModellingIntroduction To Predictive Modelling
Introduction To Predictive ModellingSpotle.ai
 
AI and Journalism Talk City University
AI and Journalism Talk City UniversityAI and Journalism Talk City University
AI and Journalism Talk City UniversityPOLIS LSE
 

Similar to SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems (20)

Homepage Personalization at Spotify
Homepage Personalization at SpotifyHomepage Personalization at Spotify
Homepage Personalization at Spotify
 
Enterprise Gamification by the Numbers
Enterprise Gamification by the NumbersEnterprise Gamification by the Numbers
Enterprise Gamification by the Numbers
 
Engineering with Open Source - Hyonjee Joo
Engineering with Open Source - Hyonjee JooEngineering with Open Source - Hyonjee Joo
Engineering with Open Source - Hyonjee Joo
 
Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...
Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...
Bias in algorithmic decision-making: Standards, Algorithmic Literacy and Gove...
 
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
Social Media Data Analytics
Social Media Data AnalyticsSocial Media Data Analytics
Social Media Data Analytics
 
Social Aspects of Interactive Recommender Systems
Social Aspects of Interactive Recommender SystemsSocial Aspects of Interactive Recommender Systems
Social Aspects of Interactive Recommender Systems
 
Ansgar rcep algorithmic_bias_july2018
Ansgar rcep algorithmic_bias_july2018Ansgar rcep algorithmic_bias_july2018
Ansgar rcep algorithmic_bias_july2018
 
180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)
180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)
180719 테헤란로 런치클럽-실무자 관점에서 본 Spotify의 유저리서치 방식 (Spotify 백원희님)
 
The Disappearing Data Scientist
The Disappearing Data ScientistThe Disappearing Data Scientist
The Disappearing Data Scientist
 
ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...
ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...
ECIR 2019 - Information Retrieval Models for Contact Recommendation in Social...
 
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
 
Tendencias de Investigación de Mercado para el 2014 - GRIT
Tendencias de Investigación de Mercado para el 2014 - GRITTendencias de Investigación de Mercado para el 2014 - GRIT
Tendencias de Investigación de Mercado para el 2014 - GRIT
 
State of Analytics: Retail and Consumer Goods
State of Analytics: Retail and Consumer GoodsState of Analytics: Retail and Consumer Goods
State of Analytics: Retail and Consumer Goods
 
Measures and mismeasures of algorithmic fairness
Measures and mismeasures of algorithmic fairnessMeasures and mismeasures of algorithmic fairness
Measures and mismeasures of algorithmic fairness
 
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
Data ethics and machine learning: discrimination, algorithmic bias, and how t...Data ethics and machine learning: discrimination, algorithmic bias, and how t...
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
 
Introduction To Predictive Modelling
Introduction To Predictive ModellingIntroduction To Predictive Modelling
Introduction To Predictive Modelling
 
AI and Journalism Talk City University
AI and Journalism Talk City UniversityAI and Journalism Talk City University
AI and Journalism Talk City University
 
NewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big DataNewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big Data
 

Recently uploaded

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 

Recently uploaded (17)

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 

SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems

  • 1. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Rocío Cañamares and Pablo Castells Universidad Autónoma de Madrid http://ir.ii.uam.es Ann Arbor, USA, 10 July 2018
  • 2. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Popularity in recommendation Would you find it useful to recommend these items?
  • 3. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Outline 1. The popularity bias in recommender systems 2. Formal analysis 3. Experiments 4. Conclusions
  • 4. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Outline 1. The popularity bias in recommender systems 2. Formal analysis 3. Experiments 4. Conclusions
  • 5. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Recommender systems Users who bought this also bought… Music discovery Related videos People you may know… . . . now
  • 6. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 The recommender system’s task  Rating matrix with some available cell values, most cells empty 4 4 2 2 2 4 1 4 4 3 2 5 2 4 3 5 2 1 5 1 Users Items Abstraction of user-item interaction The “rating” matrix
  • 7. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 The recommender system’s task 4 4 2 2 2 4 1 4 4 3 ? 2 5 ? 2 4 3 5 2 1 5 1 Users Items Abstraction of user-item interaction The “rating” matrix  Rating matrix with some available cell values, most cells empty  Rank items by predicting missing ratings  Offline evaluation: split the data into training and test  Evaluate with IR metrics  test ratings = relevance judgments
  • 8. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 The popularity bias in recommender systems Items Users In the data In algorithms Mat. factorization Nr. positive ratings Nr.timestop10 0 400 800 0 1000 2000 Popularity 800 400 0 0 1000 2000 User-based kNN Nr. positive ratings 2000 1000 0 0 1000 2000 Popular items Rest of items (long tail) Items Nr.ratings
  • 9. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 The popularity bias in recommender systems Items Users In the data In offline evaluation? In algorithms Mat. factorization Nr. positive ratings Nr.timestop10 0 400 800 0 1000 2000 Popularity 800 400 0 0 1000 2000 User-based kNN Nr. positive ratings 2000 1000 0 0 1000 2000 Popular items Rest of items (long tail) Random Nr. positive ratings User-based kNN Matrix factorization0.3 0.2 0.1 0 nDCG@10 MovieLens 1M
  • 10. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Avoiding popularity Popularity is not personalized, trivial, suspicious… Metrics and algorithms have been proposed that remove or cope with popularity biases But… is popularity “good” or “bad”?
  • 11. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Should I follow the crowd? ?  Lack of novelty is not a sufficient answer: degrees of popularity  Majority provides useful default choices  Technically simplest and cheap solution – Most apps have majority listings  Majority is not always right – Randomness factors, conformity, manipulation, etc.
  • 12. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Outline 1. The popularity bias in recommender systems 2. Formal analysis 3. Experiments 4. Conclusions
  • 13. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Our formal analysis Research questions  Is popularity really effective (accurate) in recommendation? – Which popularity: positive rating count / ratio?  Are we measuring properly its effectiveness in offline experiments? Expected precision Observed vs. true metric values Random variables Rankings Depend- encies Theoretical findings
  • 14. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Formalization: observed vs. true accuracy Computed on available user taste observations Computed with full knowledge of user tastes Observed metric value True metric value Items Users Relevant Non relevant Missing ratings ≈ ? Items Users Expected precision Observed vs. true metric values Random variables Rankings Depend- encies Theoretical findings
  • 15. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Formalization: expectation  Analysis on the expected accuracy of popularity- based recommendation – How good is popularity compared to “something else” – Do observed and true accuracy agree (in expectation)?  A most simple metric: 𝔼 𝑃@1  Expectation  random variables, probabilities Expected precision Observed vs. true metric values Random variables Rankings Depend- encies Theoretical findings
  • 16. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 𝑟𝑒𝑙 Formalization: key random variables 𝑟𝑎𝑡𝑒𝑑 Users × Items Expected precision Random variables Rankings Depend- encies Observed vs. true metric values Theoretical findings
  • 17. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 𝑟𝑒𝑙 Formalization: key random variables Users × Items 𝑟𝑎𝑡𝑒𝑑 Expected precision Random variables Rankings Depend- encies Observed vs. true metric values Theoretical findings
  • 18. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Optimal and popularity rankings  Popularity variants as probability ranking functions Nr. relevant ratings ⟶ 𝑝𝑜𝑝 𝑖 ∝ 𝑝(𝑟𝑒𝑙, 𝑡𝑟𝑎𝑖𝑛|𝑖) Average rating value ⟶ 𝑎𝑣𝑔 𝑖 = 𝑝(𝑟𝑒𝑙|𝑡𝑟𝑎𝑖𝑛, 𝑖)  Lemma: the optimal non-personalized rankings are For true 𝔼 𝑃@1 ⟶ 𝑜𝑝𝑡 𝑖 = 𝑝 𝑟𝑒𝑙 ¬𝑡𝑟𝑎𝑖𝑛, 𝑖 For observed 𝔼 𝑃@1 ⟶ 𝑜𝑝𝑡 𝑖 = 𝑝 𝑟𝑒𝑙, 𝑡𝑒𝑠𝑡 ¬𝑡𝑟𝑎𝑖𝑛, 𝑖  We reach several findings by reasoning on rank equivalence Expected precision Random variables Rankings Depend- encies Observed vs. true metric values Theoretical findings
  • 19. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Conditional (in)dependences between variables Discover Rate Like User Item Items Nr.ratings Rating distribution 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑖 Expected precision Random variables Rankings Depend- encies Observed vs. true metric values Theoretical findings 𝑠𝑒𝑒𝑛 𝑖 𝑟𝑒𝑙 𝑟𝑎𝑡𝑒𝑑 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑖 = 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑟𝑒𝑙, 𝑖 𝑝 𝑟𝑒𝑙 𝑖 + 𝑝 𝑟𝑎𝑡𝑒𝑑 ¬𝑟𝑒𝑙, 𝑖 𝑝(¬𝑟𝑒𝑙|𝑖) 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑟𝑒𝑙, 𝑖 = 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑠𝑒𝑒𝑛, 𝑟𝑒𝑙, 𝑖 𝑝 𝑠𝑒𝑒𝑛 𝑟𝑒𝑙, 𝑖
  • 20. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Conditional (in)dependences between variables Discover Rate Like User Item Items Nr.ratings Rating distribution 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑖 Expected precision Random variables Rankings Depend- encies Observed vs. true metric values Theoretical findings 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑟𝑒𝑙, 𝑖 = 𝑝 𝑟𝑎𝑡𝑒𝑑 𝑠𝑒𝑒𝑛, 𝑟𝑒𝑙, 𝑖 𝑝 𝑠𝑒𝑒𝑛 𝑟𝑒𝑙, 𝑖 𝑠𝑒𝑒𝑛 𝑖 𝑟𝑒𝑙 𝑟𝑎𝑡𝑒𝑑 𝑠𝑒𝑒𝑛 𝑖 𝑟𝑒𝑙 𝑟𝑎𝑡𝑒𝑑 𝑠𝑒𝑒𝑛 𝑖 𝑟𝑒𝑙 𝑟𝑎𝑡𝑒𝑑 1. Rating depends just on relevance 2. Rating independent from relevance 3. Rating depends both on items and relevance
  • 21. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 0 0.5 1 Observed True Theoretical findings 1. Rating depends only on relevance Observed and true optimals agree: 𝒑𝒐𝒑 ∝ 𝒂𝒗𝒈 ∝ optimal 2. Rating independent from relevance Observed precision: random < 𝒂𝒗𝒈 < 𝒑𝒐𝒑 ∝ optimal True precision: random < 𝒑𝒐𝒑 < 𝒂𝒗𝒈 ∝ optimal 3. General case: no independence assumption Monte Carlo Observed precision: random ∼ 𝒂𝒗𝒈 < 𝒑𝒐𝒑 ∼ optimal True precision: random ≼ 𝒑𝒐𝒑 ≼ 𝒂𝒗𝒈 ≺ optimal 𝔼 𝑃@1 𝜃 = න Ω 𝑛 𝔼 𝑃@1 𝜃, 𝜔 𝑑𝜔 P@1 Random Optimal Popularity Avg rating Observed and true precision agree Observed and true precision disagree Expected precision Random variables Rankings Depend- encies Observed vs. true metric values Theoretical findings
  • 22. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Outline 1. The popularity bias in recommender systems 2. Formal analysis 3. Experiments 4. Conclusions
  • 23. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Experiments We wish to… 1. Run recommendations by rating count, average rating, random, optimal 2. Compute observed and true precision 3. See what comes out
  • 24. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018  We build a dataset free of observational (popularity) bias 1. We sample 1,000 music tracks from deezer.com uniformly at random 2. We ask anonymous workers on CrowdFlower to rate 100 tracks each, sampled uniformly at random → ~100 judgments per user × ~100 judgments per track = ~100,000 judgments total  CM100k dataset available at http://ir.ii.uam.es/cm100k Data – Crowdsourced dataset
  • 25. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Computing observed vs. true precision: judgments and ratings 100k judgments Items Seen before Never seen before
  • 26. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Computing observed vs. true precision: judgments and ratings 100k judgments Items Seen before “Ratings” Never seen before
  • 27. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Observed precision Computing observed vs. true precision: judgments and ratings 100k judgments Items “Ratings” Random split Input for algorithm Relevance judgments Training
  • 28. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Computing observed vs. true precision: judgments and ratings Items True precision estimate Biased input for algorithm Full relevance judgment sample Training Test Items Training Observed precision
  • 29. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 0 0.02 0.04 0.06 Observed True Results MovieLens 1M CM100k nDCG@10 Random Optimal Average rating Popularity 0 0.1 0.2 Observed Observed TrueObserved Similar qualitative outcome
  • 30. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 0 0.02 0.04 0.06 Observed True Results nDCG@10 Random Optimal Average rating Popularity 0 0.1 0.2 Observed Observed TrueObserved • Popularity almost optimal • Popularity ∼ random• Popularity almost optimal • Popularity ∼ random CM100kMovieLens 1M
  • 31. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 0 0.02 0.04 0.06 Observed True Results nDCG@10 Random Optimal Average rating Popularity 0 0.1 0.2 Observed Observed TrueObserved • Popularity almost optimal • Avg rating < popularity • Popularity ∼ random • Avg rating > popularity • Popularity almost optimal • Popularity ∼ random CM100kMovieLens 1M
  • 32. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Test relevance dependence extremes 1. Full relevance dependence 2. Relevance independence nDCG@10 0 0.02 0.04 0.06 Observed True 0 0.02 0.04 0.06 Observed True Random Optimal Avg rating Popularity Observed TrueObserved True Shuffling the discovery distribution in different ways… Discovery
  • 33. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Discovery Test relevance dependence extremes 1. Full relevance dependence 2. Relevance independence nDCG@10 0 0.02 0.04 0.06 Observed True 0 0.02 0.04 0.06 Observed True Random Optimal Avg rating Popularity TrueObserved True Popularity and avg rating are just ok Random < avg rating < popularity < optimal Observed Roughly agree Shuffling the discovery distribution in different ways…
  • 34. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Test relevance dependence extremes 1. Full relevance dependence 2. Relevance independence nDCG@10 0 0.02 0.04 0.06 Observed True 0 0.02 0.04 0.06 Observed True Random Optimal Avg rating Popularity Observed TrueObserved True • Popularity near optimal • Avg rating near random Random < popularity < avg rating < optimal Shuffling the discovery distribution in different ways… Discovery
  • 35. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Implications on personalized algorithms: user-based kNN 0 0.01 0.02 0.03 Obs True 0 0.1 0.2 0.3 Obs MovieLens 1M CM100k (full dependencies) nDCG@10 0 0.01 0.02 0.03 Obs True 0 0.1 0.2 0.3 Obs Non-normalized kNN (biased to popularity) Normalized kNN (biased to avg rating) Observed TrueObserved Non-normalized > normalized Non-normalized < normalized
  • 36. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Outline 1. The popularity bias in recommender systems 2. Formal analysis 3. Experiments 4. Conclusions
  • 37. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Conclusions  So… is it good or bad to recommend popular items? – It depends on the relation between rating, discovery and relevance – Tends to be good; worst case: complex simultaneous dependencies – Weak relevance dependence can twist offline measurements  The average rating can work better than the rating count – Observed accuracy tends to be unfair to the average rating  Implications on collaborative filtering algorithms – Understanding popularity can help improve state of the art algorithms  Evaluation with random samples can uncover new findings
  • 38. IRGIRGroup @UAM Should I Follow the Crowd? A Probabilistic Analysis of the Effectiveness of Popularity in Recommender Systems 41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018) Ann Arbor, MI, USA, 10 July 2018 Future work  Further questions can be attempted upon a similar formal approach  Undiscovered accuracy stop by our poster!  Other data split procedures, e.g. temporal  Dynamic taste development  Recommender system in the discovery loop  Further research on personalized algorithms  Denser (full?) unbiased judgment matrix 