Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

•

0 likes•67 views

Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets, SDM 2017

Education

Redundancies in Data and their Effect on the Evaluation of Recommendation Systems:
A case study on the Amazon Reviews Dataset
DANIEL BASARAN1, EIRINI NTOUTSI2, ARTHUR ZIMEK3
1Ludwig-Maximilians-Universität, Germany 2Leibniz Universität Hannover, Germany 3University of Southern Denmark
THE AMAZON REVIEWS DATASETS COLLECTION
• Crawled from Amazon website
- Huge (35 M reviews), temporal (spans over 18 years, up to March 2013),
diverse (#30 product categories), heterogeneous (review text, ratings)
• Contain duplicates for items that are variants of the same product
- Different size, color
- dvd and blue-ray versions
EFFECTS ON THE QUALITY OF RECOMMENDATIONS
ELIMINATION OF DUPLICATES
• We kept only 1 of the product variants
and its corresponding reviews and ratings
EVALUATING REDUNDANCY EFFECT ON RECOMMENDERS
• Original (with redundancies) vs cleaned dataset
• Experimentation procedure
- Randomly split each Amazon dataset D into training (80%) Dtr, testing (10%) Dt
and validation (10%) Dv sets
- Due to redundancies in the original dataset D, Dtr, Dt and Dv sets are not disjoint.
They are disjoint in the cleaned dataset.
• Compared methods
- Offset (only numerical ratings)
- Latent Factors (only numerical ratings)
- Hidden Factors (numerical ratings & review texts)
- Matrix Factorization (only numerical ratings)
- Biased Matrix Factorization (only numerical ratings)
OVERALL PICTURE REDUNDANCY VS MSE CONCLUSIONS AND OUTLOOK
• We evaluated the impact of redundancy on the evaluation of
recommendation models
- Conclusions may change when tested on cleaned data
- This effect is stronger for more complex methods
- Our results suggest that the collection should be used only after duplicate
elimination
- Our results do not suggest that the recent methods do not imply real
methodological improvements.
• So what?
- The quality of the data is critical for learning
- Can we automatically control datasets for potential problems?
- The community would benefit from having established benchmarks for different
tasks (common practice in other communities like IR, NLP)
- Evaluation and competitive comparison of existing methods is equally important
to the development of new methods.
This review goes to all
different colour variants
% of retained items per category % of retained reviews per category
• If there is redundancy, the performance gets worse in the clean data.
• The performance differences are not as large as they appear to be on the original data
• The performance degrades the stronger (higher y-values), the more redundancy was
contained in the original dataset (lower x-values).
• Different methods differ in their sensitivity to redundancy (higher potential to overfit).
- No sensitivity for simpler methods (offset), high sensitivity for complex methods (hidden
factors)

What's hot

Improving evaluations and utilization with statistical edge nested data desi...CesToronto

Issue-based metricsAndres Baravalle

Nonnegative matrix-factNecdet Türköz, M.Sc

[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...YONG ZHENG

Ces national conference june 9-12 fairmont royal yorkDonna Smith-Moncrieffe

Carma internet research module n-biasSyracuse University

Malhotra02.....Binty Agarwal

ch5 Issues based metricsjessie52182

SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTIONIncedo

SamplingVijyata Singh

[WI 2017] Context Suggestion: Empirical Evaluations vs User StudiesYONG ZHENG

Marketing ResearchAlberto Jimenez

Lecture 6 measurement iBrittany Milbourn

Paper Presentation: Data Mining User Preference in Interactive MultimediaJeanette Howe

Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFISusan Tait, CSM

Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018Salvatore Trani

Learning-Based Evaluation of Visual Analytic Systems.BELIV Workshop

Project Data Incorporating Qualitative Factors for Improved Software Defect P...Tim Menzies

What's hot (18)

Improving evaluations and utilization with statistical edge nested data desi...

Issue-based metrics

Nonnegative matrix-fact

[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...

Ces national conference june 9-12 fairmont royal york

Carma internet research module n-bias

Malhotra02.....

ch5 Issues based metrics

SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTION

Sampling

[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies

Marketing Research

Lecture 6 measurement i

Paper Presentation: Data Mining User Preference in Interactive Multimedia

Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFI

Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018

Learning-Based Evaluation of Visual Analytic Systems.

Project Data Incorporating Qualitative Factors for Improved Software Defect P...

Similar to Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Research 101: Inferential Quantitative AnalysisHarold Gamero

Basic Statistical Concepts.pdfKwangheeJung

Prote-OMIC Data Analysis and VisualizationDmitry Grapov

Introduction to Data Management in Human EcologyKern Rocke

validity and reliability ppt.pptrahulranjan215851

Multivariate Analysis and Visualization of Proteomic DataUC Davis

Evaluating Collaborative Filtering Recommender SystemsMegaVjohnson

Bmgt 311 chapter_13Chris Lovett

Influence of Timeline and Named-entity Components on User Engagement Roi Blanco

IFPRI- Impact Surveys 1International Food Policy Research Institute- South Asia Office

Online Learning to Rankewhuang3

Marketing researchDeep Gurung

GradTrack: Getting Started with Statistics September 20, 2018Nancy Garmer

GradTrack: Getting Started with Statistics September 20, 2018Evans Library at Florida Institute of Technology

Multi variate presentationArun Kumar

Overview of GRADE approach_selected slides for Cochrane Public Health.pptxakerdemoglu

Cochrane CollaborationNinian Peckitt

Sampling of Blooddrantopa

Sampling, measurement, and stats(2013)BarryCRNA

Metaanalysis copyAmandeep Kaur

Similar to Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets (20)

Research 101: Inferential Quantitative Analysis

Basic Statistical Concepts.pdf

Prote-OMIC Data Analysis and Visualization

Introduction to Data Management in Human Ecology

validity and reliability ppt.ppt

Multivariate Analysis and Visualization of Proteomic Data

Evaluating Collaborative Filtering Recommender Systems

Bmgt 311 chapter_13

Influence of Timeline and Named-entity Components on User Engagement

IFPRI- Impact Surveys 1

Online Learning to Rank

Marketing research

GradTrack: Getting Started with Statistics September 20, 2018

Multi variate presentation

Overview of GRADE approach_selected slides for Cochrane Public Health.pptx

Cochrane Collaboration

Sampling of Blood

Sampling, measurement, and stats(2013)

Metaanalysis copy

Recently uploaded

ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir

Full Stack Web Development Course for BeginnersSabitha Banu

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna

TataKelola dan KamSiber Kecerdasan Buatan v022.pdfSarwono Sutikno, Dr.Eng.,CISA,CISSP,CISM,CSX-F

ESSENTIAL of (CS/IT/IS) class 06 (database)Dr. Mazin Mohamed alkathiri

EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3

AmericanHighSchoolsprezentacijaoskolama.arsicmarija21

Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR

Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan

Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam

Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU

Difference Between Search & Browse Methods in Odoo 17Celine George

DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr

OS-operating systems- ch04 (Threads) ...Dr. Mazin Mohamed alkathiri

Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc

Recently uploaded (20)

ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf

Full Stack Web Development Course for Beginners

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

CELL CYCLE Division Science 8 quarter IV.pptx

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf

ESSENTIAL of (CS/IT/IS) class 06 (database)

EPANDING THE CONTENT OF AN OUTLINE using notes.pptx

AmericanHighSchoolsprezentacijaoskolama.

Solving Puzzles Benefits Everyone (English).pptx

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️

Gas measurement O2,Co2,& ph) 04/2024.pptx

Pharmacognosy Flower 3. Compositae 2023.pdf

Capitol Tech U Doctoral Presentation - April 2024.pptx

Difference Between Search & Browse Methods in Odoo 17

DATA STRUCTURE AND ALGORITHM for beginners

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...

OS-operating systems- ch04 (Threads) ...

Procuring digital preservation CAN be quick and painless with our new dynamic...

Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

1. Redundancies in Data and their Effect on the Evaluation of Recommendation Systems: A case study on the Amazon Reviews Dataset DANIEL BASARAN1, EIRINI NTOUTSI2, ARTHUR ZIMEK3 1Ludwig-Maximilians-Universität, Germany 2Leibniz Universität Hannover, Germany 3University of Southern Denmark THE AMAZON REVIEWS DATASETS COLLECTION • Crawled from Amazon website - Huge (35 M reviews), temporal (spans over 18 years, up to March 2013), diverse (#30 product categories), heterogeneous (review text, ratings) • Contain duplicates for items that are variants of the same product - Different size, color - dvd and blue-ray versions EFFECTS ON THE QUALITY OF RECOMMENDATIONS ELIMINATION OF DUPLICATES • We kept only 1 of the product variants and its corresponding reviews and ratings EVALUATING REDUNDANCY EFFECT ON RECOMMENDERS • Original (with redundancies) vs cleaned dataset • Experimentation procedure - Randomly split each Amazon dataset D into training (80%) Dtr, testing (10%) Dt and validation (10%) Dv sets - Due to redundancies in the original dataset D, Dtr, Dt and Dv sets are not disjoint. They are disjoint in the cleaned dataset. • Compared methods - Offset (only numerical ratings) - Latent Factors (only numerical ratings) - Hidden Factors (numerical ratings & review texts) - Matrix Factorization (only numerical ratings) - Biased Matrix Factorization (only numerical ratings) OVERALL PICTURE REDUNDANCY VS MSE CONCLUSIONS AND OUTLOOK • We evaluated the impact of redundancy on the evaluation of recommendation models - Conclusions may change when tested on cleaned data - This effect is stronger for more complex methods - Our results suggest that the collection should be used only after duplicate elimination - Our results do not suggest that the recent methods do not imply real methodological improvements. • So what? - The quality of the data is critical for learning - Can we automatically control datasets for potential problems? - The community would benefit from having established benchmarks for different tasks (common practice in other communities like IR, NLP) - Evaluation and competitive comparison of existing methods is equally important to the development of new methods. This review goes to all different colour variants % of retained items per category % of retained reviews per category • If there is redundancy, the performance gets worse in the clean data. • The performance differences are not as large as they appear to be on the original data • The performance degrades the stronger (higher y-values), the more redundancy was contained in the original dataset (lower x-values). • Different methods differ in their sensitivity to redundancy (higher potential to overfit). - No sensitivity for simpler methods (offset), high sensitivity for complex methods (hidden factors)

Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Similar to Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Similar to Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets (20)

More from Eirini Ntoutsi

More from Eirini Ntoutsi (12)

Recently uploaded

Recently uploaded (20)

Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Similar to Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Similar to Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets (20)

More from Eirini Ntoutsi

More from Eirini Ntoutsi (12)

Recently uploaded

Recently uploaded (20)

Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Similar to Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Similar to Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets (20)

Redundancies in Data and their Eect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets