Estimating the Magic Barrier of Recommender Systems

•

1 like•657 views

This study estimated the "magic barrier" of recommender systems by collecting additional ratings ("opinions") from users on items they had previously rated. The magic barrier represents the lowest expected error rate achievable by any recommendation algorithm, given natural inconsistencies in human ratings. The researchers collected over 6,000 new opinions from 300+ users on movies they had rated previously. They calculated the standard deviation of errors between original ratings and new opinions, finding a magic barrier of approximately 1.2 on the site's 0-10 rating scale. This suggests recommender systems cannot achieve perfect predictions and that errors within 1.2 points are attributable to natural human inconsistencies rather than algorithm quality.

Education

Estimating the Magic Barrier of Recommender
Systems: A User Study
Alan Said, Brijnesh J. Jain, Sascha Narr,
Till Plumbaum, Sahin Albayrak, Christian Scheel

SIGIR 2012 – Portland, OR, USA

Evaluating Recommender Systems The User Study
Recommender systems evaluation generally measures the quality of the We asked users of www.moviepilot.de to provide new
algorithm based on some accuracy metric, e.g. precision, or error measure, e.g. ratings (so‐called opinions) for movies they had rated in
root‐mean‐square error. However, these measures neglect the inherent the past. We specifically asked for opinions and not re‐
inconsistencies users – people – are afflicted by. ratings so not to suggest a change of heart.

These are the first results from a noise measurement user study for estimating The user interface for collecting opinions was created so
the magic barrier of recommender systems conducted on a commercial movie that it resembled the regular rating page of moviepilot in
recommendation community. order to create a feeling of familiarity for the users and
lower rating inconsistencies related to unfamiliarity with
The magic barrier is the expected squared error of the optimal the system.
recommendation algorithm, or, the lowest error we can expect from any
recommendation algorithm. Our results show that the barrier can be estimated
by collecting the opinions of users on already rated items.

Data
The study ran in April and May 2011 and resulted in a dataset containing 6,299
opinions on 2,329 movies by 306 users – i.e. 6,299 rating‐opinion pairs. All
participating users had to have had rated at least 50 movies on moviepilot.de The ”rate new movies” page on
Our interface for collecting new opinions
and gave at least 20 new opinions. moviepilot.de

The Magic Barrier Calculated Magic Barrier
Root‐mean‐square error (RMSE) is commonly used for accuracy evaluation of a 1,6

rating function on a set of ratings 1,4
1,417

1,201
1,2

1,043
1

0,8

Having new opinions we can express the the error between an original rating
and and a new opinion on item i by user u as 0,6

0,4

We can suppose there is an unknown true rating function that knows the true
0,2

opinions of each user on each item. We can derive an estimate of the RMSE of
as 0

all r ≥ avg r < avg

Standard deviation of the error, where all refers to the
deviation over all opinions; r ≥ avg and r < avg refer to
the deviation over all ratings above and below average.
which is equal to the standard deviation of where ,
Moviepilot’s rating scale is 0‐10 stars. A magic barrier of
It is possible that there are ratings functions with a lower RMSE than , these ±1,2 means that rating prediction errors within that
functions tend to overfit and their lower RMSE does not mean they perform boundary are part of user’s rating inconsistencies.
better – they perform within the boundaries of the magic barrier.

Further Reading Results & Conclusion
We presented a study on the inherent noise found in rating values given by users in a
Detailed explanation of the commercial recommendation system.
magic barrier
Our assumption, that the magic barrier of recommender systems can be better assessed by
noise estimation seems to hold.
Users and Noise: The Magic We presented an early model for the magic barrier and the level of accuracy a recommender
Barrier of Recommender systems can achieve without over‐fitting on the noise in the data. Performing an estimate of
Systems [UMAP2012, Said et al.] the magic barrier of a system makes it possible ot assess whether a system can be further
improved or not.

Paper version of the poster We suggest that in order to estimate a system’s prediction quality, opinion collection for
magic barrier estaimation should be conducted regularly.

Technische Universität Berlin {alan, jain, narr, till, sahin, scheel}@dai‐lab.de www.dai‐lab.de

Similar to Estimating the Magic Barrier of Recommender Systems

Rating System Algorithms DocumentScandala Tamang

Sentiment Analysis of Product Reviews and Trustworthiness Evaluation using TRSIRJET Journal

Computing Ratings and Rankings by Mining Feedback CommentsIRJET Journal

IRJET- Analysis of Brand Value Prediction based on Social Media DataIRJET Journal

Feature Based Opinion Mining from Amazon ReviewsRavi Kiran Holur Vijay

IRJET- Efficiently Analyzing and Detecting Fake ReviewsIRJET Journal

A Fast Flowgraph Based Classification System for Packed and Polymorphic Malwa...Silvio Cesare

The Magic Barrier of Recommender Systems - No Magic, Just RatingsAlan Said

PRODUCT REPUTATION AND GLOBAL RATING IN E-COMMERCE IAEME Publication

session2.pdfshero2015

Rated Ranking Evaluator (RRE) Hands-on Relevance Testing @ChorusSease

20320140501009 2IAEME Publication

Opinion-Based Entity RankingKavita Ganesan

IRE Major Project Anurag Gupta

A Survey on Evaluating Sentiments by Using Artificial Neural NetworkIRJET Journal

Automatic Recommendation of Trustworthy Users in Online Product Rating SitesIRJET Journal

IRJET- Slant Analysis of Customer Reviews in View of Concealed Markov DisplayIRJET Journal

Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...Dr. Amarjeet Singh

The Sqale method: presentationJean-Louis LETOUZEY

Similar to Estimating the Magic Barrier of Recommender Systems (20)

Rating System Algorithms Document

Sentiment Analysis of Product Reviews and Trustworthiness Evaluation using TRS

Computing Ratings and Rankings by Mining Feedback Comments

IRJET- Analysis of Brand Value Prediction based on Social Media Data

Feature Based Opinion Mining from Amazon Reviews

IRJET- Efficiently Analyzing and Detecting Fake Reviews

A Fast Flowgraph Based Classification System for Packed and Polymorphic Malwa...

The Magic Barrier of Recommender Systems - No Magic, Just Ratings

PRODUCT REPUTATION AND GLOBAL RATING IN E-COMMERCE

session2.pdf

Rated Ranking Evaluator (RRE) Hands-on Relevance Testing @Chorus

20320140501009 2

Opinion-Based Entity Ranking

IRE Major Project

A Survey on Evaluating Sentiments by Using Artificial Neural Network

Automatic Recommendation of Trustworthy Users in Online Product Rating Sites

IRJET- Slant Analysis of Customer Reviews in View of Concealed Markov Display

Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...

The Sqale method: presentation

Recently uploaded

Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav

INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña

Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW

Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy

Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo

ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1

week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2

Transaction Management in Database Management SystemChristalin Nelson

DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1

ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10

prashanth updated resume 2024 for Teaching ProfessionSri Sairam College Of Engineering Bengaluru

4.16.24 Poverty and Precarity--Desmond.pptxmary850239

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection

Multi Domain Alias In the Odoo 17 ERP ModuleCeline George

Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith

Textual Evidence in Reading and Writing of SHSMae Pangan

Oppenheimer Film Discussion for Philosophy and FilmStan Meyer

Scientific Writing :Research DiscourseAnita GoswamiGiri

Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar

ClimART Action | eTwinning Projectjordimapav

Recently uploaded (20)

Narcotic and Non Narcotic Analgesic..pdf

INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx

Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW

Student Profile Sample - We help schools to connect the data they have, with ...

Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx

ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv

week 1 cookery 8 fourth - quarter .pptx

Transaction Management in Database Management System

DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx

ROLES IN A STAGE PRODUCTION in arts.pptx

prashanth updated resume 2024 for Teaching Profession

4.16.24 Poverty and Precarity--Desmond.pptx

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...

Multi Domain Alias In the Odoo 17 ERP Module

Mental Health Awareness - a toolkit for supporting young minds

Textual Evidence in Reading and Writing of SHS

Oppenheimer Film Discussion for Philosophy and Film

Scientific Writing :Research Discourse

Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...

ClimART Action | eTwinning Project

Estimating the Magic Barrier of Recommender Systems

1. Estimating the Magic Barrier of Recommender Systems: A User Study Alan Said, Brijnesh J. Jain, Sascha Narr, Till Plumbaum, Sahin Albayrak, Christian Scheel SIGIR 2012 – Portland, OR, USA Evaluating Recommender Systems The User Study Recommender systems evaluation generally measures the quality of the We asked users of www.moviepilot.de to provide new algorithm based on some accuracy metric, e.g. precision, or error measure, e.g. ratings (so‐called opinions) for movies they had rated in root‐mean‐square error. However, these measures neglect the inherent the past. We specifically asked for opinions and not re‐ inconsistencies users – people – are afflicted by. ratings so not to suggest a change of heart. These are the first results from a noise measurement user study for estimating The user interface for collecting opinions was created so the magic barrier of recommender systems conducted on a commercial movie that it resembled the regular rating page of moviepilot in recommendation community. order to create a feeling of familiarity for the users and lower rating inconsistencies related to unfamiliarity with The magic barrier is the expected squared error of the optimal the system. recommendation algorithm, or, the lowest error we can expect from any recommendation algorithm. Our results show that the barrier can be estimated by collecting the opinions of users on already rated items. Data The study ran in April and May 2011 and resulted in a dataset containing 6,299 opinions on 2,329 movies by 306 users – i.e. 6,299 rating‐opinion pairs. All participating users had to have had rated at least 50 movies on moviepilot.de The ”rate new movies” page on Our interface for collecting new opinions and gave at least 20 new opinions. moviepilot.de The Magic Barrier Calculated Magic Barrier Root‐mean‐square error (RMSE) is commonly used for accuracy evaluation of a 1,6 rating function on a set of ratings 1,4 1,417 1,201 1,2 1,043 1 0,8 Having new opinions we can express the the error between an original rating and and a new opinion on item i by user u as 0,6 0,4 We can suppose there is an unknown true rating function that knows the true 0,2 opinions of each user on each item. We can derive an estimate of the RMSE of as 0 all r ≥ avg r < avg Standard deviation of the error, where all refers to the deviation over all opinions; r ≥ avg and r < avg refer to the deviation over all ratings above and below average. which is equal to the standard deviation of where , Moviepilot’s rating scale is 0‐10 stars. A magic barrier of It is possible that there are ratings functions with a lower RMSE than , these ±1,2 means that rating prediction errors within that functions tend to overfit and their lower RMSE does not mean they perform boundary are part of user’s rating inconsistencies. better – they perform within the boundaries of the magic barrier. Further Reading Results & Conclusion We presented a study on the inherent noise found in rating values given by users in a Detailed explanation of the commercial recommendation system. magic barrier Our assumption, that the magic barrier of recommender systems can be better assessed by noise estimation seems to hold. Users and Noise: The Magic We presented an early model for the magic barrier and the level of accuracy a recommender Barrier of Recommender systems can achieve without over‐fitting on the noise in the data. Performing an estimate of Systems [UMAP2012, Said et al.] the magic barrier of a system makes it possible ot assess whether a system can be further improved or not. Paper version of the poster We suggest that in order to estimate a system’s prediction quality, opinion collection for magic barrier estaimation should be conducted regularly. Technische Universität Berlin {alan, jain, narr, till, sahin, scheel}@dai‐lab.de www.dai‐lab.de

Estimating the Magic Barrier of Recommender Systems

Recommended

Recommended

More Related Content

Similar to Estimating the Magic Barrier of Recommender Systems

Similar to Estimating the Magic Barrier of Recommender Systems (20)

More from Alan Said

More from Alan Said (16)

Recently uploaded

Recently uploaded (20)

Estimating the Magic Barrier of Recommender Systems