SlideShare a Scribd company logo
1 of 1
Download to read offline
Redundancies in Data and their Effect on the Evaluation of Recommendation Systems:
A case study on the Amazon Reviews Dataset
DANIEL BASARAN1, EIRINI NTOUTSI2, ARTHUR ZIMEK3
1Ludwig-Maximilians-Universität, Germany 2Leibniz Universität Hannover, Germany 3University of Southern Denmark
THE AMAZON REVIEWS DATASETS COLLECTION
• Crawled from Amazon website
- Huge (35 M reviews), temporal (spans over 18 years, up to March 2013),
diverse (#30 product categories), heterogeneous (review text, ratings)
• Contain duplicates for items that are variants of the same product
- Different size, color
- dvd and blue-ray versions
EFFECTS ON THE QUALITY OF RECOMMENDATIONS
ELIMINATION OF DUPLICATES
• We kept only 1 of the product variants
and its corresponding reviews and ratings
EVALUATING REDUNDANCY EFFECT ON RECOMMENDERS
• Original (with redundancies) vs cleaned dataset
• Experimentation procedure
- Randomly split each Amazon dataset D into training (80%) Dtr, testing (10%) Dt
and validation (10%) Dv sets
- Due to redundancies in the original dataset D, Dtr, Dt and Dv sets are not disjoint.
They are disjoint in the cleaned dataset.
• Compared methods
- Offset (only numerical ratings)
- Latent Factors (only numerical ratings)
- Hidden Factors (numerical ratings & review texts)
- Matrix Factorization (only numerical ratings)
- Biased Matrix Factorization (only numerical ratings)
OVERALL PICTURE REDUNDANCY VS MSE CONCLUSIONS AND OUTLOOK
• We evaluated the impact of redundancy on the evaluation of
recommendation models
- Conclusions may change when tested on cleaned data
- This effect is stronger for more complex methods
- Our results suggest that the collection should be used only after duplicate
elimination
- Our results do not suggest that the recent methods do not imply real
methodological improvements.
• So what?
- The quality of the data is critical for learning
- Can we automatically control datasets for potential problems?
- The community would benefit from having established benchmarks for different
tasks (common practice in other communities like IR, NLP)
- Evaluation and competitive comparison of existing methods is equally important
to the development of new methods.
This review goes to all
different colour variants
% of retained items per category % of retained reviews per category
• If there is redundancy, the performance gets worse in the clean data.
• The performance differences are not as large as they appear to be on the original data
• The performance degrades the stronger (higher y-values), the more redundancy was
contained in the original dataset (lower x-values).
• Different methods differ in their sensitivity to redundancy (higher potential to overfit).
- No sensitivity for simpler methods (offset), high sensitivity for complex methods (hidden
factors)

More Related Content

What's hot

Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...CesToronto
 
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...YONG ZHENG
 
Ces national conference june 9-12 fairmont royal york
 Ces national conference june 9-12 fairmont royal york Ces national conference june 9-12 fairmont royal york
Ces national conference june 9-12 fairmont royal yorkDonna Smith-Moncrieffe
 
Carma internet research module n-bias
Carma internet research module   n-biasCarma internet research module   n-bias
Carma internet research module n-biasSyracuse University
 
ch5 Issues based metrics
ch5  Issues based metricsch5  Issues based metrics
ch5 Issues based metricsjessie52182
 
SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTION
SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTIONSENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTION
SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTIONIncedo
 
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User StudiesYONG ZHENG
 
Paper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive MultimediaPaper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive MultimediaJeanette Howe
 
Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFI
Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFIStructured Analysis Proposal for Managing HB 4087 Vendor Responses to RFI
Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFISusan Tait, CSM
 
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018Salvatore Trani
 
Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.BELIV Workshop
 
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...Tim Menzies
 

What's hot (18)

Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...
 
Issue-based metrics
Issue-based metricsIssue-based metrics
Issue-based metrics
 
Nonnegative matrix-fact
Nonnegative matrix-factNonnegative matrix-fact
Nonnegative matrix-fact
 
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
 
Ces national conference june 9-12 fairmont royal york
 Ces national conference june 9-12 fairmont royal york Ces national conference june 9-12 fairmont royal york
Ces national conference june 9-12 fairmont royal york
 
Carma internet research module n-bias
Carma internet research module   n-biasCarma internet research module   n-bias
Carma internet research module n-bias
 
Malhotra02.....
Malhotra02.....Malhotra02.....
Malhotra02.....
 
ch5 Issues based metrics
ch5  Issues based metricsch5  Issues based metrics
ch5 Issues based metrics
 
SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTION
SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTIONSENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTION
SENTIMENT ANALYSIS FOR DRUG DEVELOPMENT AND PROMOTION
 
Sampling
SamplingSampling
Sampling
 
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
 
Marketing Research
Marketing ResearchMarketing Research
Marketing Research
 
Lecture 6 measurement i
Lecture   6   measurement iLecture   6   measurement i
Lecture 6 measurement i
 
Paper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive MultimediaPaper Presentation: Data Mining User Preference in Interactive Multimedia
Paper Presentation: Data Mining User Preference in Interactive Multimedia
 
Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFI
Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFIStructured Analysis Proposal for Managing HB 4087 Vendor Responses to RFI
Structured Analysis Proposal for Managing HB 4087 Vendor Responses to RFI
 
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018
 
Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.
 
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
 

Similar to Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

Research 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative AnalysisResearch 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative AnalysisHarold Gamero
 
Basic Statistical Concepts.pdf
Basic Statistical Concepts.pdfBasic Statistical Concepts.pdf
Basic Statistical Concepts.pdfKwangheeJung
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationDmitry Grapov
 
Introduction to Data Management in Human Ecology
Introduction to Data Management in Human EcologyIntroduction to Data Management in Human Ecology
Introduction to Data Management in Human EcologyKern Rocke
 
validity and reliability ppt.ppt
validity and reliability ppt.pptvalidity and reliability ppt.ppt
validity and reliability ppt.pptrahulranjan215851
 
Multivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataMultivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataUC Davis
 
Evaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender SystemsEvaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender SystemsMegaVjohnson
 
Bmgt 311 chapter_13
Bmgt 311 chapter_13Bmgt 311 chapter_13
Bmgt 311 chapter_13Chris Lovett
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Roi Blanco
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rankewhuang3
 
Marketing research
Marketing researchMarketing research
Marketing researchDeep Gurung
 
GradTrack: Getting Started with Statistics September 20, 2018
GradTrack: Getting Started with Statistics September 20, 2018GradTrack: Getting Started with Statistics September 20, 2018
GradTrack: Getting Started with Statistics September 20, 2018Nancy Garmer
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentationArun Kumar
 
Overview of GRADE approach_selected slides for Cochrane Public Health.pptx
Overview of GRADE approach_selected slides for Cochrane Public Health.pptxOverview of GRADE approach_selected slides for Cochrane Public Health.pptx
Overview of GRADE approach_selected slides for Cochrane Public Health.pptxakerdemoglu
 
Cochrane Collaboration
Cochrane CollaborationCochrane Collaboration
Cochrane CollaborationNinian Peckitt
 
Sampling of Blood
Sampling of BloodSampling of Blood
Sampling of Blooddrantopa
 
Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)BarryCRNA
 

Similar to Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets (20)

Research 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative AnalysisResearch 101: Inferential Quantitative Analysis
Research 101: Inferential Quantitative Analysis
 
Basic Statistical Concepts.pdf
Basic Statistical Concepts.pdfBasic Statistical Concepts.pdf
Basic Statistical Concepts.pdf
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
 
Introduction to Data Management in Human Ecology
Introduction to Data Management in Human EcologyIntroduction to Data Management in Human Ecology
Introduction to Data Management in Human Ecology
 
validity and reliability ppt.ppt
validity and reliability ppt.pptvalidity and reliability ppt.ppt
validity and reliability ppt.ppt
 
Multivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataMultivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic Data
 
Evaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender SystemsEvaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender Systems
 
Bmgt 311 chapter_13
Bmgt 311 chapter_13Bmgt 311 chapter_13
Bmgt 311 chapter_13
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
 
IFPRI- Impact Surveys 1
IFPRI- Impact Surveys 1IFPRI- Impact Surveys 1
IFPRI- Impact Surveys 1
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rank
 
Marketing research
Marketing researchMarketing research
Marketing research
 
GradTrack: Getting Started with Statistics September 20, 2018
GradTrack: Getting Started with Statistics September 20, 2018GradTrack: Getting Started with Statistics September 20, 2018
GradTrack: Getting Started with Statistics September 20, 2018
 
GradTrack: Getting Started with Statistics September 20, 2018
GradTrack: Getting Started with Statistics September 20, 2018GradTrack: Getting Started with Statistics September 20, 2018
GradTrack: Getting Started with Statistics September 20, 2018
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentation
 
Overview of GRADE approach_selected slides for Cochrane Public Health.pptx
Overview of GRADE approach_selected slides for Cochrane Public Health.pptxOverview of GRADE approach_selected slides for Cochrane Public Health.pptx
Overview of GRADE approach_selected slides for Cochrane Public Health.pptx
 
Cochrane Collaboration
Cochrane CollaborationCochrane Collaboration
Cochrane Collaboration
 
Sampling of Blood
Sampling of BloodSampling of Blood
Sampling of Blood
 
Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)
 
Metaanalysis copy
Metaanalysis    copyMetaanalysis    copy
Metaanalysis copy
 

More from Eirini Ntoutsi

AAISI AI Colloquium 30/3/2021: Bias in AI systems
AAISI AI Colloquium 30/3/2021: Bias in AI systemsAAISI AI Colloquium 30/3/2021: Bias in AI systems
AAISI AI Colloquium 30/3/2021: Bias in AI systemsEirini Ntoutsi
 
Fairness-aware learning: From single models to sequential ensemble learning a...
Fairness-aware learning: From single models to sequential ensemble learning a...Fairness-aware learning: From single models to sequential ensemble learning a...
Fairness-aware learning: From single models to sequential ensemble learning a...Eirini Ntoutsi
 
Bias in AI-systems: A multi-step approach
Bias in AI-systems: A multi-step approachBias in AI-systems: A multi-step approach
Bias in AI-systems: A multi-step approachEirini Ntoutsi
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Eirini Ntoutsi
 
A Machine Learning Primer,
A Machine Learning Primer,A Machine Learning Primer,
A Machine Learning Primer,Eirini Ntoutsi
 
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...Eirini Ntoutsi
 
Density-based Projected Clustering over High Dimensional Data Streams
Density-based Projected Clustering over High Dimensional Data StreamsDensity-based Projected Clustering over High Dimensional Data Streams
Density-based Projected Clustering over High Dimensional Data StreamsEirini Ntoutsi
 
Density Based Subspace Clustering Over Dynamic Data
Density Based Subspace Clustering Over Dynamic DataDensity Based Subspace Clustering Over Dynamic Data
Density Based Subspace Clustering Over Dynamic DataEirini Ntoutsi
 
Summarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic EnvironmentsSummarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic EnvironmentsEirini Ntoutsi
 
Cluster formation over huge volatile robotic data
Cluster formation over huge volatile robotic data Cluster formation over huge volatile robotic data
Cluster formation over huge volatile robotic data Eirini Ntoutsi
 
Discovering and Monitoring Product Features and the Opinions on them with OPI...
Discovering and Monitoring Product Features and the Opinions on them with OPI...Discovering and Monitoring Product Features and the Opinions on them with OPI...
Discovering and Monitoring Product Features and the Opinions on them with OPI...Eirini Ntoutsi
 

More from Eirini Ntoutsi (12)

AAISI AI Colloquium 30/3/2021: Bias in AI systems
AAISI AI Colloquium 30/3/2021: Bias in AI systemsAAISI AI Colloquium 30/3/2021: Bias in AI systems
AAISI AI Colloquium 30/3/2021: Bias in AI systems
 
Fairness-aware learning: From single models to sequential ensemble learning a...
Fairness-aware learning: From single models to sequential ensemble learning a...Fairness-aware learning: From single models to sequential ensemble learning a...
Fairness-aware learning: From single models to sequential ensemble learning a...
 
Bias in AI-systems: A multi-step approach
Bias in AI-systems: A multi-step approachBias in AI-systems: A multi-step approach
Bias in AI-systems: A multi-step approach
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
 
A Machine Learning Primer,
A Machine Learning Primer,A Machine Learning Primer,
A Machine Learning Primer,
 
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
 
Density-based Projected Clustering over High Dimensional Data Streams
Density-based Projected Clustering over High Dimensional Data StreamsDensity-based Projected Clustering over High Dimensional Data Streams
Density-based Projected Clustering over High Dimensional Data Streams
 
Density Based Subspace Clustering Over Dynamic Data
Density Based Subspace Clustering Over Dynamic DataDensity Based Subspace Clustering Over Dynamic Data
Density Based Subspace Clustering Over Dynamic Data
 
Summarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic EnvironmentsSummarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic Environments
 
Cluster formation over huge volatile robotic data
Cluster formation over huge volatile robotic data Cluster formation over huge volatile robotic data
Cluster formation over huge volatile robotic data
 
Discovering and Monitoring Product Features and the Opinions on them with OPI...
Discovering and Monitoring Product Features and the Opinions on them with OPI...Discovering and Monitoring Product Features and the Opinions on them with OPI...
Discovering and Monitoring Product Features and the Opinions on them with OPI...
 
NeeMo@OpenCoffeeXX
NeeMo@OpenCoffeeXXNeeMo@OpenCoffeeXX
NeeMo@OpenCoffeeXX
 

Recently uploaded

ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 

Recently uploaded (20)

ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 

Redundancies in Data and their E ect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets

  • 1. Redundancies in Data and their Effect on the Evaluation of Recommendation Systems: A case study on the Amazon Reviews Dataset DANIEL BASARAN1, EIRINI NTOUTSI2, ARTHUR ZIMEK3 1Ludwig-Maximilians-Universität, Germany 2Leibniz Universität Hannover, Germany 3University of Southern Denmark THE AMAZON REVIEWS DATASETS COLLECTION • Crawled from Amazon website - Huge (35 M reviews), temporal (spans over 18 years, up to March 2013), diverse (#30 product categories), heterogeneous (review text, ratings) • Contain duplicates for items that are variants of the same product - Different size, color - dvd and blue-ray versions EFFECTS ON THE QUALITY OF RECOMMENDATIONS ELIMINATION OF DUPLICATES • We kept only 1 of the product variants and its corresponding reviews and ratings EVALUATING REDUNDANCY EFFECT ON RECOMMENDERS • Original (with redundancies) vs cleaned dataset • Experimentation procedure - Randomly split each Amazon dataset D into training (80%) Dtr, testing (10%) Dt and validation (10%) Dv sets - Due to redundancies in the original dataset D, Dtr, Dt and Dv sets are not disjoint. They are disjoint in the cleaned dataset. • Compared methods - Offset (only numerical ratings) - Latent Factors (only numerical ratings) - Hidden Factors (numerical ratings & review texts) - Matrix Factorization (only numerical ratings) - Biased Matrix Factorization (only numerical ratings) OVERALL PICTURE REDUNDANCY VS MSE CONCLUSIONS AND OUTLOOK • We evaluated the impact of redundancy on the evaluation of recommendation models - Conclusions may change when tested on cleaned data - This effect is stronger for more complex methods - Our results suggest that the collection should be used only after duplicate elimination - Our results do not suggest that the recent methods do not imply real methodological improvements. • So what? - The quality of the data is critical for learning - Can we automatically control datasets for potential problems? - The community would benefit from having established benchmarks for different tasks (common practice in other communities like IR, NLP) - Evaluation and competitive comparison of existing methods is equally important to the development of new methods. This review goes to all different colour variants % of retained items per category % of retained reviews per category • If there is redundancy, the performance gets worse in the clean data. • The performance differences are not as large as they appear to be on the original data • The performance degrades the stronger (higher y-values), the more redundancy was contained in the original dataset (lower x-values). • Different methods differ in their sensitivity to redundancy (higher potential to overfit). - No sensitivity for simpler methods (offset), high sensitivity for complex methods (hidden factors)