SlideShare a Scribd company logo
1 of 29
Comparative Recommender System Evaluation: 
Benchmarking Recommendation Frameworks 
Alan Said 
@alansaid 
TU Delft 
Alejandro Bellogín 
@abellogin 
Universidad Autónoma de Madrid 
ACM RecSys 2014 
Foster City, CA, USA
A RecSys paper outline 
– We have a new model – it’s 
great 
– We used %DATASET% 100k 
to evaluate it 
– It’s 10% better than our 
baseline 
– It’s 12% better than 
[Authors, 2010] 
2
Benchmarking 
3
LibRec 
mrec 
Python-recsys 
4
What are the differences? 
Some things just work differently 
• Data splitting 
• Algorithm design (implementation) 
• Algorithm optimization 
• Parameter values 
• Evaluation 
• Relevance/ranking 
• Software architecture 
• etc 
Different design choices!! 
How do these choices affect 
evaluation results? 
5
Evaluate evaluation 
• Comparison of frameworks 
• Comparison of implementation 
• Comparison of results 
• Objective benchmarking 
6
Algorithmic Implementation 
Framework Class Similarity 
Item-based 
LensKit ItemItemScorer CosineVectorSimilarity, PearsonCorrelation 
Mahout GenericItemBasedRecommender UncenteredCosineSimilarity, PearsonCorrelationSimilarity 
MyMediaLite ItemKNN Cosine, Pearson 
User-based Parameters 
LensKit UserUserItemScorer 
CosineVectorSimilarity, 
PearsonCorrelation 
SimpleNeighborhoodFinder, 
NeighborhoodSize 
Mahout GenericUserBasedRecommender 
UncenteredCosineSimilarity, 
PearsonCorrelationSimilarity 
NearestNUserNeighborhood, 
neighborhoodsize 
MyMediaLite UserKNN Cosine, Pearson neighborhoodsize 
Matrix Factorization 
LensKit FunkSVDItemScorer IterationsCountStoppingCondition, factors, iterations 
Mahout SVDRecommender FunkSVDFactorizer, factors, iterations 
MyMediaLite SVDPlusPlus factors, iterations 
7
There’s more than 
algorithms though 
There’s the data, evaluation, and more 
Data splits 
• 80-20 Cross-validation 
• Random Cross-validation 
• User-based cross validation 
• Per-user splits 
• Per-item splits 
• Etc. 
Evaluation 
• Metrics 
• Relevance 
• Strategies 
8
Real world examples 
Movielens 1M 
[Cremonesi et al, 2010] 
Movielens 1M 
[Yin et al, 2012] 
9
Evaluation 
Dataset 
Training / 
Test 
Framework Evaluation 
Results 
Algorithm 
10
Internal Evaluation 
Dataset 
Training / 
Test 
Framework Evaluation 
Results 
Algorithm 
11
Internal Evaluation Results 
Algorithm Framework nDCG 
IB Cosine Mahout 0,00041478 
IB Cosine Lenskit 0,94219205 
IB Pearson Mahout 0,00516923 
IB Pearson Lenskit 0,92454613 
SVD50 Mahout 0,10542729 
SVD50 Lenskit 0,94346409 
UB Cosine Mahout 0,16929545 
UB Cosine Lenskit 0,94841356 
UB Pearson Mahout 0,16929545 
UB Pearson Lenskit 0,94841356 
Algorithm Framework RMSE 
IB Cosine Lenskit 1,01390931 
IB Cosine MyMediaLite 0,92476162 
IB Pearson Lenskit 1,05018614 
IB Pearson MyMediaLite 0,92933246 
SVD50 Lenskit 1,01209290 
SVD50 MyMediaLite 0,93074012 
UB Cosine Lenskit 1,02545490 
UB Cosine MyMediaLite 0,93419026 
12
We need a fair and common evaluation protocol! 
13
Reproducible evaluation - 
Benchmarking 
Control all parts of the process 
- Data Splitting strategy 
- Recommendation (black box) 
- Candidate items generation (what 
items to test) 
- Evaluation 
Select strategy 
• By time 
• Cross validation 
• Random 
• Ratio 
Select framework 
• Apache Mahout 
• LensKit 
• MyMediaLite 
Select algorithm 
• Tune settings 
Recommend 
Define strategy 
• What is the 
ground truth 
• What users to 
evaluate 
• What items to 
evaluate 
Select error 
metrics 
• RMSE, MAE 
Select ranking 
metrics 
• nDCG, 
Precision/Recall 
, MAP 
Split Recommend 
Candidate 
items 
Evaluate 
http://rival.recommenders.net 
14
Controlled Evaluation 
Dataset 
Training / 
Test 
Framework Evaluation 
Results 
Algorithm 
15
Lenskit vs. Mahout vs. MyMediaLite 
Movielens 100k (additional datasets in the paper) 
AN OBJECTIVE BENCHMARK 
16
The Frameworks 
AM: Apache Mahout 
LK: Lenskit 
MML: MyMediaLite 
The Candidate Items 
RPN: Relevant + N [Koren, KDD 2008] 
TI: TrainItems 
UT: UserTest 
Split Point 
gl: Global 
pu: Per-user 
Split Strategy 
cv: 5-fold cross-validation 
rt: 80-20 random ratio 
Algorithms nDCG@10 
17
User Coverage 
18
Catalog Coverage 
19
Time 
20
Good accuracy? 
21
Yes, at the cost of coverage 
22
What’s the best result? 
23
Difficult to say … depends on what you’re evaluating!! 
24
In conclusion 
• Design choices matter! 
– Some more than others 
• Evaluation needs to be documented 
• Cross-framework comparison is not easy 
– You need to have control! 
25
What have we learnt? 
26
How did we do this? 
RiVal – an evaluation toolkit for RecSys 
• http://rival.recommenders.net 
• http://github.com/recommenders/rival 
• RiVal demo later today 
• On Maven central! 
• RiVal was also used for this year’s RecSys 
Challenge 
– www.recsyschallenge.com 
27
Thanks! 
QUESTIONS? 
Special thanks: 
• Zeno Gantner 
• Michael Ekstrand 
28
• https://www.flickr.com/photos/13698839@N00/3001363490/in/photostream/ 
• http://rick--hunter.deviantart.com/art/Unfair-scale-1-149667590 
29

More Related Content

What's hot

Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...PyData
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain
 
1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptopRising Media, Inc.
 
Contextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsContextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsMatthias Braunhofer
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...David Zibriczky
 
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...Alain Starke
 
Efficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear RegressionEfficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear RegressionMark Levy
 
A Scalable, High-performance Algorithm for Hybrid Job Recommendations
A Scalable, High-performance Algorithm for Hybrid Job RecommendationsA Scalable, High-performance Algorithm for Hybrid Job Recommendations
A Scalable, High-performance Algorithm for Hybrid Job RecommendationsToon De Pessemier
 
yelp data challenge
yelp data challengeyelp data challenge
yelp data challengeAMR koura
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineeringalessio_ferrari
 
Facebook Talk at Netflix ML Platform meetup Sep 2019
Facebook Talk at Netflix ML Platform meetup Sep 2019Facebook Talk at Netflix ML Platform meetup Sep 2019
Facebook Talk at Netflix ML Platform meetup Sep 2019Faisal Siddiqi
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
Survey Research In Empirical Software Engineering
Survey Research In Empirical Software EngineeringSurvey Research In Empirical Software Engineering
Survey Research In Empirical Software Engineeringalessio_ferrari
 
Temporal Learning and Sequence Modeling for a Job Recommender System
Temporal Learning and Sequence Modeling for a Job Recommender SystemTemporal Learning and Sequence Modeling for a Job Recommender System
Temporal Learning and Sequence Modeling for a Job Recommender SystemAnoop Kumar
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...alessio_ferrari
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningJennifer Morrow
 
Systematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping StudiesSystematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping Studiesalessio_ferrari
 
LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019Faisal Siddiqi
 
Cold-Start Management with Cross-Domain Collaborative Filtering and Tags
Cold-Start Management with Cross-Domain Collaborative Filtering and TagsCold-Start Management with Cross-Domain Collaborative Filtering and Tags
Cold-Start Management with Cross-Domain Collaborative Filtering and TagsMatthias Braunhofer
 

What's hot (20)

Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisited
 
1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop
 
Contextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsContextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender Systems
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
 
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...
Starke2017 - Effective User Interface Designs to Increase Energy-efficient Be...
 
Efficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear RegressionEfficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear Regression
 
A Scalable, High-performance Algorithm for Hybrid Job Recommendations
A Scalable, High-performance Algorithm for Hybrid Job RecommendationsA Scalable, High-performance Algorithm for Hybrid Job Recommendations
A Scalable, High-performance Algorithm for Hybrid Job Recommendations
 
yelp data challenge
yelp data challengeyelp data challenge
yelp data challenge
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineering
 
Facebook Talk at Netflix ML Platform meetup Sep 2019
Facebook Talk at Netflix ML Platform meetup Sep 2019Facebook Talk at Netflix ML Platform meetup Sep 2019
Facebook Talk at Netflix ML Platform meetup Sep 2019
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Survey Research In Empirical Software Engineering
Survey Research In Empirical Software EngineeringSurvey Research In Empirical Software Engineering
Survey Research In Empirical Software Engineering
 
Temporal Learning and Sequence Modeling for a Job Recommender System
Temporal Learning and Sequence Modeling for a Job Recommender SystemTemporal Learning and Sequence Modeling for a Job Recommender System
Temporal Learning and Sequence Modeling for a Job Recommender System
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
 
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data CleaningBrief Introduction to the 12 Steps of Evaluation Data Cleaning
Brief Introduction to the 12 Steps of Evaluation Data Cleaning
 
Systematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping StudiesSystematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping Studies
 
LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019
 
Cold-Start Management with Cross-Domain Collaborative Filtering and Tags
Cold-Start Management with Cross-Domain Collaborative Filtering and TagsCold-Start Management with Cross-Domain Collaborative Filtering and Tags
Cold-Start Management with Cross-Domain Collaborative Filtering and Tags
 

Viewers also liked

The Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsThe Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsAlan Said
 
Sistemas de Recomendação Usando Mahout - Dia 2
Sistemas de Recomendação Usando Mahout - Dia 2Sistemas de Recomendação Usando Mahout - Dia 2
Sistemas de Recomendação Usando Mahout - Dia 2Alison Marczewski
 
Sistemas de Recomendação Usando Mahout - Dia 1
Sistemas de Recomendação Usando Mahout - Dia 1Sistemas de Recomendação Usando Mahout - Dia 1
Sistemas de Recomendação Usando Mahout - Dia 1Alison Marczewski
 
Des maths et des recommandations - Devoxx 2014
Des maths et des recommandations - Devoxx 2014Des maths et des recommandations - Devoxx 2014
Des maths et des recommandations - Devoxx 2014Loïc Knuchel
 
The good the bad and the ugly - final
The good the bad and the ugly - finalThe good the bad and the ugly - final
The good the bad and the ugly - finalAndre Verschelling
 
Le temps réel au coeur de toutes les stratégies digitales
Le temps réel au coeur de toutes les stratégies digitales Le temps réel au coeur de toutes les stratégies digitales
Le temps réel au coeur de toutes les stratégies digitales Netwave
 
Example: movielens data with mahout
Example: movielens data with mahoutExample: movielens data with mahout
Example: movielens data with mahoutGregg Barrett
 
Co-occurrence Based Recommendations with Mahout, Scala and Spark
Co-occurrence Based Recommendations with Mahout, Scala and SparkCo-occurrence Based Recommendations with Mahout, Scala and Spark
Co-occurrence Based Recommendations with Mahout, Scala and Sparksscdotopen
 
Slope one recommender on hadoop
Slope one recommender on hadoopSlope one recommender on hadoop
Slope one recommender on hadoopYONG ZHENG
 
Solving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model ProblemSolving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model ProblemDain Kaplan
 
Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014rpbrehm
 
Recommendation Engine using Apache Mahout
Recommendation Engine using Apache MahoutRecommendation Engine using Apache Mahout
Recommendation Engine using Apache MahoutAmbarish Hazarnis
 
Multi Criteria Recommender Systems - Overview
Multi Criteria Recommender Systems - OverviewMulti Criteria Recommender Systems - Overview
Multi Criteria Recommender Systems - OverviewDavide Giannico
 
Your own recommendation engine with neo4j and reco4php - DPC16
Your own recommendation engine with neo4j and reco4php - DPC16Your own recommendation engine with neo4j and reco4php - DPC16
Your own recommendation engine with neo4j and reco4php - DPC16Christophe Willemsen
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperChangsung Moon
 
Profile injection attack detection in recommender system
Profile injection attack detection in recommender systemProfile injection attack detection in recommender system
Profile injection attack detection in recommender systemASHISH PANNU
 
Mahout Workshop on Google Cloud Platform
Mahout Workshop on Google Cloud PlatformMahout Workshop on Google Cloud Platform
Mahout Workshop on Google Cloud PlatformIMC Institute
 
Recommendation Engine Project Presentation
Recommendation Engine Project PresentationRecommendation Engine Project Presentation
Recommendation Engine Project Presentation19Divya
 
Recommender Systems and Active Learning
Recommender Systems and Active LearningRecommender Systems and Active Learning
Recommender Systems and Active LearningDain Kaplan
 
Online recommendations at scale using matrix factorisation
Online recommendations at scale using matrix factorisationOnline recommendations at scale using matrix factorisation
Online recommendations at scale using matrix factorisationMarcus Ljungblad
 

Viewers also liked (20)

The Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just RatingsThe Magic Barrier of Recommender Systems - No Magic, Just Ratings
The Magic Barrier of Recommender Systems - No Magic, Just Ratings
 
Sistemas de Recomendação Usando Mahout - Dia 2
Sistemas de Recomendação Usando Mahout - Dia 2Sistemas de Recomendação Usando Mahout - Dia 2
Sistemas de Recomendação Usando Mahout - Dia 2
 
Sistemas de Recomendação Usando Mahout - Dia 1
Sistemas de Recomendação Usando Mahout - Dia 1Sistemas de Recomendação Usando Mahout - Dia 1
Sistemas de Recomendação Usando Mahout - Dia 1
 
Des maths et des recommandations - Devoxx 2014
Des maths et des recommandations - Devoxx 2014Des maths et des recommandations - Devoxx 2014
Des maths et des recommandations - Devoxx 2014
 
The good the bad and the ugly - final
The good the bad and the ugly - finalThe good the bad and the ugly - final
The good the bad and the ugly - final
 
Le temps réel au coeur de toutes les stratégies digitales
Le temps réel au coeur de toutes les stratégies digitales Le temps réel au coeur de toutes les stratégies digitales
Le temps réel au coeur de toutes les stratégies digitales
 
Example: movielens data with mahout
Example: movielens data with mahoutExample: movielens data with mahout
Example: movielens data with mahout
 
Co-occurrence Based Recommendations with Mahout, Scala and Spark
Co-occurrence Based Recommendations with Mahout, Scala and SparkCo-occurrence Based Recommendations with Mahout, Scala and Spark
Co-occurrence Based Recommendations with Mahout, Scala and Spark
 
Slope one recommender on hadoop
Slope one recommender on hadoopSlope one recommender on hadoop
Slope one recommender on hadoop
 
Solving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model ProblemSolving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model Problem
 
Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014
 
Recommendation Engine using Apache Mahout
Recommendation Engine using Apache MahoutRecommendation Engine using Apache Mahout
Recommendation Engine using Apache Mahout
 
Multi Criteria Recommender Systems - Overview
Multi Criteria Recommender Systems - OverviewMulti Criteria Recommender Systems - Overview
Multi Criteria Recommender Systems - Overview
 
Your own recommendation engine with neo4j and reco4php - DPC16
Your own recommendation engine with neo4j and reco4php - DPC16Your own recommendation engine with neo4j and reco4php - DPC16
Your own recommendation engine with neo4j and reco4php - DPC16
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paper
 
Profile injection attack detection in recommender system
Profile injection attack detection in recommender systemProfile injection attack detection in recommender system
Profile injection attack detection in recommender system
 
Mahout Workshop on Google Cloud Platform
Mahout Workshop on Google Cloud PlatformMahout Workshop on Google Cloud Platform
Mahout Workshop on Google Cloud Platform
 
Recommendation Engine Project Presentation
Recommendation Engine Project PresentationRecommendation Engine Project Presentation
Recommendation Engine Project Presentation
 
Recommender Systems and Active Learning
Recommender Systems and Active LearningRecommender Systems and Active Learning
Recommender Systems and Active Learning
 
Online recommendations at scale using matrix factorisation
Online recommendations at scale using matrix factorisationOnline recommendations at scale using matrix factorisation
Online recommendations at scale using matrix factorisation
 

Similar to Comparative Recommender System Evaluation: Benchmarking Recommendation Frameworks

Critical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management SolutionCritical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management SolutionDevOps.com
 
Toward a Traceable, Explainable and fair JD/Resume Recommendation System
Toward a Traceable, Explainable and fair JD/Resume Recommendation SystemToward a Traceable, Explainable and fair JD/Resume Recommendation System
Toward a Traceable, Explainable and fair JD/Resume Recommendation SystemAmine Barrak
 
A New Model for Building Business Process Quality
A New Model for Building Business Process QualityA New Model for Building Business Process Quality
A New Model for Building Business Process QualityWorksoft
 
Quality for DevOps teams - Quality engineering in the DevOps culture
Quality for DevOps teams - Quality engineering in the DevOps cultureQuality for DevOps teams - Quality engineering in the DevOps culture
Quality for DevOps teams - Quality engineering in the DevOps cultureRik Marselis
 
Phil Green - We're migrating to the cloud - Who needs service management
Phil Green - We're migrating to the cloud - Who needs service managementPhil Green - We're migrating to the cloud - Who needs service management
Phil Green - We're migrating to the cloud - Who needs service managementitSMF UK
 
2cee Master Cocomo20071
2cee Master Cocomo200712cee Master Cocomo20071
2cee Master Cocomo20071CS, NcState
 
JOB EVALUATION
JOB EVALUATIONJOB EVALUATION
JOB EVALUATIONHRM751
 
A Guide to Effective Benchmarking of Applications Development
A Guide to Effective Benchmarking of Applications DevelopmentA Guide to Effective Benchmarking of Applications Development
A Guide to Effective Benchmarking of Applications DevelopmentComputer Aid, Inc
 
Software Engineering Plan & Methodology Recommendation
Software Engineering Plan & Methodology RecommendationSoftware Engineering Plan & Methodology Recommendation
Software Engineering Plan & Methodology RecommendationDhatri Misra
 
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...Edge AI and Vision Alliance
 
Quality Engineering in today's cross-functTeams with TMAP
Quality Engineering in today's cross-functTeams with TMAPQuality Engineering in today's cross-functTeams with TMAP
Quality Engineering in today's cross-functTeams with TMAPRik Marselis
 
Downloads abc 2006 presentation downloads-ramesh_babu
Downloads abc 2006   presentation downloads-ramesh_babuDownloads abc 2006   presentation downloads-ramesh_babu
Downloads abc 2006 presentation downloads-ramesh_babuHem Rana
 
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
Day 1   1620 - 1705 - maple - pranabendu bhattacharyyaDay 1   1620 - 1705 - maple - pranabendu bhattacharyya
Day 1 1620 - 1705 - maple - pranabendu bhattacharyyaPMI2011
 
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02PMI_IREP_TP
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation SystemsAxel de Romblay
 
Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...
Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...
Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...TEST Huddle
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsAxel de Romblay
 

Similar to Comparative Recommender System Evaluation: Benchmarking Recommendation Frameworks (20)

Critical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management SolutionCritical steps in Determining Your Value Stream Management Solution
Critical steps in Determining Your Value Stream Management Solution
 
Toward a Traceable, Explainable and fair JD/Resume Recommendation System
Toward a Traceable, Explainable and fair JD/Resume Recommendation SystemToward a Traceable, Explainable and fair JD/Resume Recommendation System
Toward a Traceable, Explainable and fair JD/Resume Recommendation System
 
Bd bmk e
Bd bmk eBd bmk e
Bd bmk e
 
A New Model for Building Business Process Quality
A New Model for Building Business Process QualityA New Model for Building Business Process Quality
A New Model for Building Business Process Quality
 
Benchmarking
BenchmarkingBenchmarking
Benchmarking
 
Apsec 2014 Presentation
Apsec 2014 PresentationApsec 2014 Presentation
Apsec 2014 Presentation
 
Quality for DevOps teams - Quality engineering in the DevOps culture
Quality for DevOps teams - Quality engineering in the DevOps cultureQuality for DevOps teams - Quality engineering in the DevOps culture
Quality for DevOps teams - Quality engineering in the DevOps culture
 
Phil Green - We're migrating to the cloud - Who needs service management
Phil Green - We're migrating to the cloud - Who needs service managementPhil Green - We're migrating to the cloud - Who needs service management
Phil Green - We're migrating to the cloud - Who needs service management
 
2cee Master Cocomo20071
2cee Master Cocomo200712cee Master Cocomo20071
2cee Master Cocomo20071
 
JOB EVALUATION
JOB EVALUATIONJOB EVALUATION
JOB EVALUATION
 
A Guide to Effective Benchmarking of Applications Development
A Guide to Effective Benchmarking of Applications DevelopmentA Guide to Effective Benchmarking of Applications Development
A Guide to Effective Benchmarking of Applications Development
 
Software Engineering Plan & Methodology Recommendation
Software Engineering Plan & Methodology RecommendationSoftware Engineering Plan & Methodology Recommendation
Software Engineering Plan & Methodology Recommendation
 
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
 
Quality Engineering in today's cross-functTeams with TMAP
Quality Engineering in today's cross-functTeams with TMAPQuality Engineering in today's cross-functTeams with TMAP
Quality Engineering in today's cross-functTeams with TMAP
 
Downloads abc 2006 presentation downloads-ramesh_babu
Downloads abc 2006   presentation downloads-ramesh_babuDownloads abc 2006   presentation downloads-ramesh_babu
Downloads abc 2006 presentation downloads-ramesh_babu
 
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
Day 1   1620 - 1705 - maple - pranabendu bhattacharyyaDay 1   1620 - 1705 - maple - pranabendu bhattacharyya
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
 
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems
 
Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...
Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...
Isabel Evans - Working Ourselves out of a Job: A Passion For Improvement - Eu...
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation Systems
 

More from Alan Said

Information Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationInformation Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationAlan Said
 
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...Alan Said
 
A 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System EvaluationA 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System EvaluationAlan Said
 
State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012Alan Said
 
RecSysChallenge Opening
RecSysChallenge OpeningRecSysChallenge Opening
RecSysChallenge OpeningAlan Said
 
Estimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyEstimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyAlan Said
 
Users and Noise: The Magic Barrier of Recommender Systems
Users and Noise: The Magic Barrier of Recommender SystemsUsers and Noise: The Magic Barrier of Recommender Systems
Users and Noise: The Magic Barrier of Recommender SystemsAlan Said
 
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Alan Said
 
CaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationCaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationAlan Said
 
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesPersonalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesAlan Said
 
Inferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceInferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceAlan Said
 
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation QualityUsing Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation QualityAlan Said
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsAlan Said
 

More from Alan Said (13)

Information Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System EvaluationInformation Retrieval and User-centric Recommender System Evaluation
Information Retrieval and User-centric Recommender System Evaluation
 
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
User-Centric Evaluation of a K-Furthest Neighbor Collaborative Filtering Reco...
 
A 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System EvaluationA 3D Approach to Recommender System Evaluation
A 3D Approach to Recommender System Evaluation
 
State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012State of RecSys: Recap of RecSys 2012
State of RecSys: Recap of RecSys 2012
 
RecSysChallenge Opening
RecSysChallenge OpeningRecSysChallenge Opening
RecSysChallenge Opening
 
Estimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User StudyEstimating the Magic Barrier of Recommender Systems: A User Study
Estimating the Magic Barrier of Recommender Systems: A User Study
 
Users and Noise: The Magic Barrier of Recommender Systems
Users and Noise: The Magic Barrier of Recommender SystemsUsers and Noise: The Magic Barrier of Recommender Systems
Users and Noise: The Magic Barrier of Recommender Systems
 
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
Analyzing Weighting Schemes in Collaborative Filtering: Cold Start, Post Cold...
 
CaRR 2012 Opening Presentation
CaRR 2012 Opening PresentationCaRR 2012 Opening Presentation
CaRR 2012 Opening Presentation
 
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending MoviesPersonalizing Tags: A Folksonomy-like Approach for Recommending Movies
Personalizing Tags: A Folksonomy-like Approach for Recommending Movies
 
Inferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender PerformanceInferring Contextual User Profiles - Improving Recommender Performance
Inferring Contextual User Profiles - Improving Recommender Performance
 
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation QualityUsing Social- and Pseudo-Social Networks to Improve Recommendation Quality
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 

Recently uploaded

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 

Recently uploaded (20)

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 

Comparative Recommender System Evaluation: Benchmarking Recommendation Frameworks

  • 1. Comparative Recommender System Evaluation: Benchmarking Recommendation Frameworks Alan Said @alansaid TU Delft Alejandro Bellogín @abellogin Universidad Autónoma de Madrid ACM RecSys 2014 Foster City, CA, USA
  • 2. A RecSys paper outline – We have a new model – it’s great – We used %DATASET% 100k to evaluate it – It’s 10% better than our baseline – It’s 12% better than [Authors, 2010] 2
  • 5. What are the differences? Some things just work differently • Data splitting • Algorithm design (implementation) • Algorithm optimization • Parameter values • Evaluation • Relevance/ranking • Software architecture • etc Different design choices!! How do these choices affect evaluation results? 5
  • 6. Evaluate evaluation • Comparison of frameworks • Comparison of implementation • Comparison of results • Objective benchmarking 6
  • 7. Algorithmic Implementation Framework Class Similarity Item-based LensKit ItemItemScorer CosineVectorSimilarity, PearsonCorrelation Mahout GenericItemBasedRecommender UncenteredCosineSimilarity, PearsonCorrelationSimilarity MyMediaLite ItemKNN Cosine, Pearson User-based Parameters LensKit UserUserItemScorer CosineVectorSimilarity, PearsonCorrelation SimpleNeighborhoodFinder, NeighborhoodSize Mahout GenericUserBasedRecommender UncenteredCosineSimilarity, PearsonCorrelationSimilarity NearestNUserNeighborhood, neighborhoodsize MyMediaLite UserKNN Cosine, Pearson neighborhoodsize Matrix Factorization LensKit FunkSVDItemScorer IterationsCountStoppingCondition, factors, iterations Mahout SVDRecommender FunkSVDFactorizer, factors, iterations MyMediaLite SVDPlusPlus factors, iterations 7
  • 8. There’s more than algorithms though There’s the data, evaluation, and more Data splits • 80-20 Cross-validation • Random Cross-validation • User-based cross validation • Per-user splits • Per-item splits • Etc. Evaluation • Metrics • Relevance • Strategies 8
  • 9. Real world examples Movielens 1M [Cremonesi et al, 2010] Movielens 1M [Yin et al, 2012] 9
  • 10. Evaluation Dataset Training / Test Framework Evaluation Results Algorithm 10
  • 11. Internal Evaluation Dataset Training / Test Framework Evaluation Results Algorithm 11
  • 12. Internal Evaluation Results Algorithm Framework nDCG IB Cosine Mahout 0,00041478 IB Cosine Lenskit 0,94219205 IB Pearson Mahout 0,00516923 IB Pearson Lenskit 0,92454613 SVD50 Mahout 0,10542729 SVD50 Lenskit 0,94346409 UB Cosine Mahout 0,16929545 UB Cosine Lenskit 0,94841356 UB Pearson Mahout 0,16929545 UB Pearson Lenskit 0,94841356 Algorithm Framework RMSE IB Cosine Lenskit 1,01390931 IB Cosine MyMediaLite 0,92476162 IB Pearson Lenskit 1,05018614 IB Pearson MyMediaLite 0,92933246 SVD50 Lenskit 1,01209290 SVD50 MyMediaLite 0,93074012 UB Cosine Lenskit 1,02545490 UB Cosine MyMediaLite 0,93419026 12
  • 13. We need a fair and common evaluation protocol! 13
  • 14. Reproducible evaluation - Benchmarking Control all parts of the process - Data Splitting strategy - Recommendation (black box) - Candidate items generation (what items to test) - Evaluation Select strategy • By time • Cross validation • Random • Ratio Select framework • Apache Mahout • LensKit • MyMediaLite Select algorithm • Tune settings Recommend Define strategy • What is the ground truth • What users to evaluate • What items to evaluate Select error metrics • RMSE, MAE Select ranking metrics • nDCG, Precision/Recall , MAP Split Recommend Candidate items Evaluate http://rival.recommenders.net 14
  • 15. Controlled Evaluation Dataset Training / Test Framework Evaluation Results Algorithm 15
  • 16. Lenskit vs. Mahout vs. MyMediaLite Movielens 100k (additional datasets in the paper) AN OBJECTIVE BENCHMARK 16
  • 17. The Frameworks AM: Apache Mahout LK: Lenskit MML: MyMediaLite The Candidate Items RPN: Relevant + N [Koren, KDD 2008] TI: TrainItems UT: UserTest Split Point gl: Global pu: Per-user Split Strategy cv: 5-fold cross-validation rt: 80-20 random ratio Algorithms nDCG@10 17
  • 22. Yes, at the cost of coverage 22
  • 23. What’s the best result? 23
  • 24. Difficult to say … depends on what you’re evaluating!! 24
  • 25. In conclusion • Design choices matter! – Some more than others • Evaluation needs to be documented • Cross-framework comparison is not easy – You need to have control! 25
  • 26. What have we learnt? 26
  • 27. How did we do this? RiVal – an evaluation toolkit for RecSys • http://rival.recommenders.net • http://github.com/recommenders/rival • RiVal demo later today • On Maven central! • RiVal was also used for this year’s RecSys Challenge – www.recsyschallenge.com 27
  • 28. Thanks! QUESTIONS? Special thanks: • Zeno Gantner • Michael Ekstrand 28
  • 29. • https://www.flickr.com/photos/13698839@N00/3001363490/in/photostream/ • http://rick--hunter.deviantart.com/art/Unfair-scale-1-149667590 29

Editor's Notes

  1. What did they actually evaluate?
  2. Let’s start with talking about what benchmarking is. Mainly, we want to focus on the difference between benchmarking and evaluation. When we evaluate, we evaluate one system in the specific context of our evaluation. But what about benchmarking? Benchmarking is when we compare systems against each other.
  3. But, we need to know what it is that we are comparing – or benchmarking – and we need to know that we’re doing it correctly.
  4. Currently, there are several popular recommendation frameworks. The three that are popularly used at RecSys, Mahout, MyMediaLite and LensKit offer very similar recommendation setups (and evaluation solutions). In this paper, we ran experiments to see how comparable the frameworks are to each other.
  5. What we did in this paper How comparable are the frameworks and results
  6. The only thing we know here is that they used ml1m and puresvd. We don’t know the topics, we don’t know the software – The actual implementation of the algorithm Because of that we compared the three most popular frameworks to each other, using three basic algorithms
  7. Using each framework’s internal evaluators!!! Common input: the same dataset, the same algorithms. Mahout does not have an evaluator which is capable of calculating RMSE and nDCG at the same time MyMediaLite does either ranking or rating prediction
  8. You control the data split, the evaluation, the metric When you can control this, you can see all factors, i.e. different splits in the same framework, different evaluation strategies, etc.
  9. Zoom in on one set of results (first quadrant) Candidate items: what am I going to rank TI: rank all items for each user (except those in the training set for that user) UT: only rank items in the user’s test set – this is what you do when you do rating prediction
  10. User coverage is the percentage of users in the dataset that we can recommend for
  11. The percentage of items that we can recommend
  12. Difficult to say Depends on the strategy!!! Need to set the evaluation context beforehand