SlideShare a Scribd company logo
1 of 1
Download to read offline
Predicting Performance in Recommender Systems
                                                                                                  Alejandro Bellogín
                                                                     Supervised by Pablo Castells and Iván Cantador
                                                           Information Retrieval Group, Universidad Autónoma de Madrid, Spain
                                                                                alejandro.bellogin@uam.es
                                                                                                                                                                                                                                                            IRG
                                                                                                                                                                                                                                                            IR Group @ UAM


          Motivation                                                                                                    Research questions
 Is it possible to predict the accuracy of a recommendation?                                             1. Is it possible to define a performance prediction theory for recommender
 E.g., we can decide whether to deliver a recommendation or not,                                            systems in a sound, formal way?
 depending on such prediction. Or, even, to combine different
                                                                                                         2. Is it possible to adapt query performance techniques (from IR) to the
 recommenders according to the expected performance of each one.
                                                                                                            recommendation task?
                                                                                                         3. What kind of evaluation should be performed? Is IR evaluation still valid in our
          Hypothesis                                                                                        problem?
 Data that are commonly available to a Recommender System could
 contain signals that enable an a priori estimation of the success of
                                                                                                         4. What kind of recommendation problems can these models be applied to?
 the recommendation


          Research question 1                                                                 Research question 2
 Is it possible to define a performance prediction theory                                                                                                                                                                     User clarity
                                                                            Is it possible to adapt query performance
 for recommender systems in a sound, formal way?                            techniques (from IR) to the recommendation task? It captures the uncertainty in user’s data
                                                                            • In Information Retrieval: “Estimation of the system’s
                                                                                                                                                                       p  x | u   user’s model
 a) Define a predictor of performance                                          performance in response to a specific query”         clarity  u    p  x | u  log 
                                                                                                                                                                       p  x     
                                                                                                                                                    x X                                                                                                                   c                    system’s model
                         = (u, i, r, …)                                   •       Several predictors proposed                                                                                                               Distance between the user’s and the
 b) Agree on a performance metric                                           •       We focus on query clarity  user clarity                                                                                                  system’s probability model
                         = (u, i, r, …)
                                                                                        0.7                                                           RatItem                                       p_c(x)
                                                                                        0.6                                                                                                         p(x|u1)
                                                                                                                                                                                                    p(x|u2)
                                                                                                                                                                                                                              We propose 3 formulations (for space X):
 c) Check predictive power by measuring correlation
                                                                                        0.5



                                                                                                                                                                                                                              • Based on ratings
                                                                                        0.4




         corr([(x1), …, (xn)], [(x1), …, (xn)])
                                                                                        0.3

                                                                                        0.2
                                                                                                                                                                                                                              • Based on items
                                                                                                                                                                                                                              • Based on ratings and items
                                                                                        0.1

 d) Evaluate final performance: dynamic vs static                                       0.0
                                                                                              4           3         5           2         1        Ratings 4         3       5       2          1




Correlation coefficients
                                                        r ~ 0.57
                                                                                              Research question 4
                                                                            What kind of recommendation problems can these models be applied to?
Pearson: linear correlation                                                 •      Whenever a combination of strategies is available
Spearman: rank correlation
                                                                            • Example 1: dynamic neighbor weighting                                                                                             • Example 2: dynamic ensemble recommendation
                                                                            • The user’s neighbors are                                                                                weighted                  • Weight is the same for every item and user
                                                                              according to their similarity                                                                                                       (learnt from training)
                                                                            • Can we take into account the uncertainty in                                                                                       • What about boosting those users predicted to
          Research question 3                                                 neighbor’s data?                                                                                                                    perform better for some recommender?
 What kind of evaluation should be performed? Is IR                         • User neighbor weighting:                                                                                                          • Hybrid recommendation:
 evaluation still valid in our problem?                                            • Static:                                g  u, i   C            sim u, v   rat  v, i                                  • Static: g u, i     gR1 u, i   1     gR2 u, i 
                                                                                                                                                   vN [ u ]
 •    In IR: Mean Average Precision + correlation                             • Dynamic: g  u, i   C  γ  v   sim u, v   rat  v, i  • Dynamic: g u, i   γ u   gR1 u, i   1  γ u   gR2 u, i 
                                                                                                       vN [ u ]
           50 points (queries) vs 1000+ points (users)                      • Correlation analysis:                                           • Correlation analysis:
 •    Performance metric is not clear:
             error-based?
                                                                            • Performance:
             precision-based?                                                   0,98                               Standard CF                            0,88 b) Neighbourhood size: 500
                                                                                                                                                                                  Standard CF                            b) 80% training
                                                                                 0,96                               Clarity-enhanced CF                    0,87                    Clarity-enhanced CF


 • What is performance?                                                          0,94                                                                      0,86
                                                                                                                                                                                                                                                                 nDCG@50
                                                                                 0,92
                                                                                 0,90
                                                                                                                                                           0,85
                                                                                                                                                                                                                • Performance:
                                                                                                                                                     MAE




                                                                                                                                                                                                                                                      0.2
                                                                           MAE




          It may depend on the final application
                                                                                                                                                           0,84
                                                                                 0,88
                                                                                                                                                           0,83                                                                                      0.15
                                                                                 0,86
                                                                                 0,84                                                                      0,82

 • Possible bias                                                                 0,82
                                                                                 0,80
                                                                                                                                                           0,81
                                                                                                                                                           0,80
                                                                                                                                                                                                                                                      0.1


                                                                                                                                                                                                                                                     0.05

          E.g., towards users or items with larger profiles
                                                                                         10       20     30    40    50    60       70   80   90       10      20 100 150 200 250 300 350 400 450 500
                                                                                                                                                                   30 40 50 60 70 80 90                        100 150 200 250 300 350 400 450 500
                                                                                                       % of ratings for training                                         Neighbourhood size                                                            0
                                                                                                                                                                                                                                                            H1   H2                      H3           H4

                                                                                                                                                                                                                                                                 Adaptive       Static




          Future Work                                                                                                                                                                                                 Publications
                  Explore other input sources                              We need a theoretical background                                                                                                   • A Performance Prediction Aproach to Enhance
                                                                           Why do some predictors work better?                                                                                                  Collaborative Filtering Performance. A. Bellogín
      Implicit data (with time)       Social links
                                                                                                                                                                                                                and P. Castells. In ECIR 2010.
      Item predictors                 o Use graph-based measures
                                                                                                                                                                                                              • Predicting the Performance of Recommender
      o Different recommender          as indicators of user strength
                                                                                                                                                                                                                Systems: An Information Theoretic Approach. A.
      behavior depending on item       o   First results are positive                                                                                                                                           Bellogín, P. Castells, and I. Cantador. In ICTIR
      attributes                                                                                                                                                                                                2011.
      o They would allow to capture                                                                                                                                                                           • Self-adjusting Hybrid Recommenders based on
      popularity, diversity, etc.                                                                                                                                                                               Social Network Analysis. A. Bellogín, P. Castells,
                                                                           Larger datasets
                                                                                                                                                                                                                and I. Cantador. In SIGIR 2011.
                                                 5th ACM Conference on Recommender Systems (RecSys2011) – Doctoral Symposium Acknowledgments to the National Science Foundation
                                                                        Chicago, USA, 23-27 October 2011                            for the funding to attend the conference

More Related Content

What's hot

97cogsci.doc
97cogsci.doc97cogsci.doc
97cogsci.docbutest
 
Calling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationCalling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationNick van Terheyden
 
Meta analysis-sloan
Meta analysis-sloanMeta analysis-sloan
Meta analysis-sloanJohn Turner
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio ManagementIRJET Journal
 
Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Venkat Projects
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningSalford Systems
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011blaudez
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objectsYogeeswar Reddy
 
Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Accenture
 
Secondary mathematics wednesday august 22 2012
Secondary mathematics   wednesday august 22 2012Secondary mathematics   wednesday august 22 2012
Secondary mathematics wednesday august 22 2012brearatliff
 
Chapter6.doc
Chapter6.docChapter6.doc
Chapter6.docbutest
 

What's hot (12)

97cogsci.doc
97cogsci.doc97cogsci.doc
97cogsci.doc
 
G
GG
G
 
Calling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationCalling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA Presentation
 
Meta analysis-sloan
Meta analysis-sloanMeta analysis-sloan
Meta analysis-sloan
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio Management
 
Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learning
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects
 
Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..
 
Secondary mathematics wednesday august 22 2012
Secondary mathematics   wednesday august 22 2012Secondary mathematics   wednesday august 22 2012
Secondary mathematics wednesday august 22 2012
 
Chapter6.doc
Chapter6.docChapter6.doc
Chapter6.doc
 

Viewers also liked

2012 Digital Opportunities
2012 Digital Opportunities2012 Digital Opportunities
2012 Digital OpportunitiesCLOSEUP Agency
 
Leergang Opleidingsmanagement
Leergang OpleidingsmanagementLeergang Opleidingsmanagement
Leergang OpleidingsmanagementMarvinMoentadj
 
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...EXPOO
 
PEShare.co.uk Shared Resource
PEShare.co.uk Shared ResourcePEShare.co.uk Shared Resource
PEShare.co.uk Shared Resourcepeshare.co.uk
 

Viewers also liked (6)

2012 Digital Opportunities
2012 Digital Opportunities2012 Digital Opportunities
2012 Digital Opportunities
 
Leergang Opleidingsmanagement
Leergang OpleidingsmanagementLeergang Opleidingsmanagement
Leergang Opleidingsmanagement
 
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
 
Segundo examen
Segundo examenSegundo examen
Segundo examen
 
PEShare.co.uk Shared Resource
PEShare.co.uk Shared ResourcePEShare.co.uk Shared Resource
PEShare.co.uk Shared Resource
 
Pratica i
Pratica iPratica i
Pratica i
 

Similar to Predicting performance in Recommender Systems - Poster

Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesAlejandro Bellogin
 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee AttritionShruti Mohan
 
2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern RecognitionDongseo University
 
LAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkLAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkHendrik Drachsler
 
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Kalle
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningFrancisco E. Figueroa-Nigaglioni
 
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...IRJET Journal
 
Adaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentAdaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentEditor IJCATR
 
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningA Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningBRNSSPublicationHubI
 
Screening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxScreening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxNitishChoudhary23
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardsonmerichar
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET Journal
 
IRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET Journal
 
An Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsAn Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsEduardo Castillejo Gil
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET Journal
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Francesca Lazzeri, PhD
 
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET Journal
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation Englishbusinessangeleu
 

Similar to Predicting performance in Recommender Systems - Poster (20)

Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - Slides
 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee Attrition
 
2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition
 
LAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkLAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_framework
 
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
 
14
1414
14
 
14
1414
14
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
 
Adaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentAdaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability Assessment
 
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningA Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
 
Screening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxScreening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptx
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardson
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 
IRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural Network
 
An Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsAn Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart Hotels
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
 
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine Learning
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation English
 

More from Alejandro Bellogin

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Alejandro Bellogin
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosAlejandro Bellogin
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsAlejandro Bellogin
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsAlejandro Bellogin
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationAlejandro Bellogin
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationAlejandro Bellogin
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...Alejandro Bellogin
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013Alejandro Bellogin
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013Alejandro Bellogin
 
Probabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyProbabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyAlejandro Bellogin
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsAlejandro Bellogin
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Alejandro Bellogin
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Alejandro Bellogin
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Alejandro Bellogin
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Alejandro Bellogin
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamAlejandro Bellogin
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Alejandro Bellogin
 

More from Alejandro Bellogin (19)

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenarios
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systems
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender Systems
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix Factorization
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013
 
Probabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyProbabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross Entropy
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender Systems
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slam
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 

Recently uploaded

Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 

Recently uploaded (20)

Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 

Predicting performance in Recommender Systems - Poster

  • 1. Predicting Performance in Recommender Systems Alejandro Bellogín Supervised by Pablo Castells and Iván Cantador Information Retrieval Group, Universidad Autónoma de Madrid, Spain alejandro.bellogin@uam.es IRG IR Group @ UAM Motivation Research questions Is it possible to predict the accuracy of a recommendation? 1. Is it possible to define a performance prediction theory for recommender E.g., we can decide whether to deliver a recommendation or not, systems in a sound, formal way? depending on such prediction. Or, even, to combine different 2. Is it possible to adapt query performance techniques (from IR) to the recommenders according to the expected performance of each one. recommendation task? 3. What kind of evaluation should be performed? Is IR evaluation still valid in our Hypothesis problem? Data that are commonly available to a Recommender System could contain signals that enable an a priori estimation of the success of 4. What kind of recommendation problems can these models be applied to? the recommendation Research question 1 Research question 2 Is it possible to define a performance prediction theory User clarity Is it possible to adapt query performance for recommender systems in a sound, formal way? techniques (from IR) to the recommendation task? It captures the uncertainty in user’s data • In Information Retrieval: “Estimation of the system’s  p  x | u   user’s model a) Define a predictor of performance performance in response to a specific query” clarity  u    p  x | u  log   p  x   x X  c  system’s model  = (u, i, r, …) • Several predictors proposed Distance between the user’s and the b) Agree on a performance metric • We focus on query clarity  user clarity system’s probability model  = (u, i, r, …) 0.7 RatItem p_c(x) 0.6 p(x|u1) p(x|u2) We propose 3 formulations (for space X): c) Check predictive power by measuring correlation 0.5 • Based on ratings 0.4 corr([(x1), …, (xn)], [(x1), …, (xn)]) 0.3 0.2 • Based on items • Based on ratings and items 0.1 d) Evaluate final performance: dynamic vs static 0.0 4 3 5 2 1 Ratings 4 3 5 2 1 Correlation coefficients r ~ 0.57 Research question 4 What kind of recommendation problems can these models be applied to? Pearson: linear correlation • Whenever a combination of strategies is available Spearman: rank correlation • Example 1: dynamic neighbor weighting • Example 2: dynamic ensemble recommendation • The user’s neighbors are weighted • Weight is the same for every item and user according to their similarity (learnt from training) • Can we take into account the uncertainty in • What about boosting those users predicted to Research question 3 neighbor’s data? perform better for some recommender? What kind of evaluation should be performed? Is IR • User neighbor weighting: • Hybrid recommendation: evaluation still valid in our problem? • Static: g  u, i   C  sim u, v   rat  v, i  • Static: g u, i     gR1 u, i   1     gR2 u, i  vN [ u ] • In IR: Mean Average Precision + correlation • Dynamic: g  u, i   C  γ  v   sim u, v   rat  v, i  • Dynamic: g u, i   γ u   gR1 u, i   1  γ u   gR2 u, i  vN [ u ] 50 points (queries) vs 1000+ points (users) • Correlation analysis: • Correlation analysis: • Performance metric is not clear:  error-based? • Performance:  precision-based? 0,98 Standard CF 0,88 b) Neighbourhood size: 500 Standard CF b) 80% training 0,96 Clarity-enhanced CF 0,87 Clarity-enhanced CF • What is performance? 0,94 0,86 nDCG@50 0,92 0,90 0,85 • Performance: MAE 0.2 MAE  It may depend on the final application 0,84 0,88 0,83 0.15 0,86 0,84 0,82 • Possible bias 0,82 0,80 0,81 0,80 0.1 0.05  E.g., towards users or items with larger profiles 10 20 30 40 50 60 70 80 90 10 20 100 150 200 250 300 350 400 450 500 30 40 50 60 70 80 90 100 150 200 250 300 350 400 450 500 % of ratings for training Neighbourhood size 0 H1 H2 H3 H4 Adaptive Static Future Work Publications Explore other input sources We need a theoretical background • A Performance Prediction Aproach to Enhance Why do some predictors work better? Collaborative Filtering Performance. A. Bellogín  Implicit data (with time)  Social links and P. Castells. In ECIR 2010.  Item predictors o Use graph-based measures • Predicting the Performance of Recommender o Different recommender as indicators of user strength Systems: An Information Theoretic Approach. A. behavior depending on item o First results are positive Bellogín, P. Castells, and I. Cantador. In ICTIR attributes 2011. o They would allow to capture • Self-adjusting Hybrid Recommenders based on popularity, diversity, etc. Social Network Analysis. A. Bellogín, P. Castells, Larger datasets and I. Cantador. In SIGIR 2011. 5th ACM Conference on Recommender Systems (RecSys2011) – Doctoral Symposium Acknowledgments to the National Science Foundation Chicago, USA, 23-27 October 2011 for the funding to attend the conference