SlideShare a Scribd company logo
Predicting Performance in Recommender Systems
                                                                                                  Alejandro Bellogín
                                                                     Supervised by Pablo Castells and Iván Cantador
                                                           Information Retrieval Group, Universidad Autónoma de Madrid, Spain
                                                                                alejandro.bellogin@uam.es
                                                                                                                                                                                                                                                            IRG
                                                                                                                                                                                                                                                            IR Group @ UAM


          Motivation                                                                                                    Research questions
 Is it possible to predict the accuracy of a recommendation?                                             1. Is it possible to define a performance prediction theory for recommender
 E.g., we can decide whether to deliver a recommendation or not,                                            systems in a sound, formal way?
 depending on such prediction. Or, even, to combine different
                                                                                                         2. Is it possible to adapt query performance techniques (from IR) to the
 recommenders according to the expected performance of each one.
                                                                                                            recommendation task?
                                                                                                         3. What kind of evaluation should be performed? Is IR evaluation still valid in our
          Hypothesis                                                                                        problem?
 Data that are commonly available to a Recommender System could
 contain signals that enable an a priori estimation of the success of
                                                                                                         4. What kind of recommendation problems can these models be applied to?
 the recommendation


          Research question 1                                                                 Research question 2
 Is it possible to define a performance prediction theory                                                                                                                                                                     User clarity
                                                                            Is it possible to adapt query performance
 for recommender systems in a sound, formal way?                            techniques (from IR) to the recommendation task? It captures the uncertainty in user’s data
                                                                            • In Information Retrieval: “Estimation of the system’s
                                                                                                                                                                       p  x | u   user’s model
 a) Define a predictor of performance                                          performance in response to a specific query”         clarity  u    p  x | u  log 
                                                                                                                                                                       p  x     
                                                                                                                                                    x X                                                                                                                   c                    system’s model
                         = (u, i, r, …)                                   •       Several predictors proposed                                                                                                               Distance between the user’s and the
 b) Agree on a performance metric                                           •       We focus on query clarity  user clarity                                                                                                  system’s probability model
                         = (u, i, r, …)
                                                                                        0.7                                                           RatItem                                       p_c(x)
                                                                                        0.6                                                                                                         p(x|u1)
                                                                                                                                                                                                    p(x|u2)
                                                                                                                                                                                                                              We propose 3 formulations (for space X):
 c) Check predictive power by measuring correlation
                                                                                        0.5



                                                                                                                                                                                                                              • Based on ratings
                                                                                        0.4




         corr([(x1), …, (xn)], [(x1), …, (xn)])
                                                                                        0.3

                                                                                        0.2
                                                                                                                                                                                                                              • Based on items
                                                                                                                                                                                                                              • Based on ratings and items
                                                                                        0.1

 d) Evaluate final performance: dynamic vs static                                       0.0
                                                                                              4           3         5           2         1        Ratings 4         3       5       2          1




Correlation coefficients
                                                        r ~ 0.57
                                                                                              Research question 4
                                                                            What kind of recommendation problems can these models be applied to?
Pearson: linear correlation                                                 •      Whenever a combination of strategies is available
Spearman: rank correlation
                                                                            • Example 1: dynamic neighbor weighting                                                                                             • Example 2: dynamic ensemble recommendation
                                                                            • The user’s neighbors are                                                                                weighted                  • Weight is the same for every item and user
                                                                              according to their similarity                                                                                                       (learnt from training)
                                                                            • Can we take into account the uncertainty in                                                                                       • What about boosting those users predicted to
          Research question 3                                                 neighbor’s data?                                                                                                                    perform better for some recommender?
 What kind of evaluation should be performed? Is IR                         • User neighbor weighting:                                                                                                          • Hybrid recommendation:
 evaluation still valid in our problem?                                            • Static:                                g  u, i   C            sim u, v   rat  v, i                                  • Static: g u, i     gR1 u, i   1     gR2 u, i 
                                                                                                                                                   vN [ u ]
 •    In IR: Mean Average Precision + correlation                             • Dynamic: g  u, i   C  γ  v   sim u, v   rat  v, i  • Dynamic: g u, i   γ u   gR1 u, i   1  γ u   gR2 u, i 
                                                                                                       vN [ u ]
           50 points (queries) vs 1000+ points (users)                      • Correlation analysis:                                           • Correlation analysis:
 •    Performance metric is not clear:
             error-based?
                                                                            • Performance:
             precision-based?                                                   0,98                               Standard CF                            0,88 b) Neighbourhood size: 500
                                                                                                                                                                                  Standard CF                            b) 80% training
                                                                                 0,96                               Clarity-enhanced CF                    0,87                    Clarity-enhanced CF


 • What is performance?                                                          0,94                                                                      0,86
                                                                                                                                                                                                                                                                 nDCG@50
                                                                                 0,92
                                                                                 0,90
                                                                                                                                                           0,85
                                                                                                                                                                                                                • Performance:
                                                                                                                                                     MAE




                                                                                                                                                                                                                                                      0.2
                                                                           MAE




          It may depend on the final application
                                                                                                                                                           0,84
                                                                                 0,88
                                                                                                                                                           0,83                                                                                      0.15
                                                                                 0,86
                                                                                 0,84                                                                      0,82

 • Possible bias                                                                 0,82
                                                                                 0,80
                                                                                                                                                           0,81
                                                                                                                                                           0,80
                                                                                                                                                                                                                                                      0.1


                                                                                                                                                                                                                                                     0.05

          E.g., towards users or items with larger profiles
                                                                                         10       20     30    40    50    60       70   80   90       10      20 100 150 200 250 300 350 400 450 500
                                                                                                                                                                   30 40 50 60 70 80 90                        100 150 200 250 300 350 400 450 500
                                                                                                       % of ratings for training                                         Neighbourhood size                                                            0
                                                                                                                                                                                                                                                            H1   H2                      H3           H4

                                                                                                                                                                                                                                                                 Adaptive       Static




          Future Work                                                                                                                                                                                                 Publications
                  Explore other input sources                              We need a theoretical background                                                                                                   • A Performance Prediction Aproach to Enhance
                                                                           Why do some predictors work better?                                                                                                  Collaborative Filtering Performance. A. Bellogín
      Implicit data (with time)       Social links
                                                                                                                                                                                                                and P. Castells. In ECIR 2010.
      Item predictors                 o Use graph-based measures
                                                                                                                                                                                                              • Predicting the Performance of Recommender
      o Different recommender          as indicators of user strength
                                                                                                                                                                                                                Systems: An Information Theoretic Approach. A.
      behavior depending on item       o   First results are positive                                                                                                                                           Bellogín, P. Castells, and I. Cantador. In ICTIR
      attributes                                                                                                                                                                                                2011.
      o They would allow to capture                                                                                                                                                                           • Self-adjusting Hybrid Recommenders based on
      popularity, diversity, etc.                                                                                                                                                                               Social Network Analysis. A. Bellogín, P. Castells,
                                                                           Larger datasets
                                                                                                                                                                                                                and I. Cantador. In SIGIR 2011.
                                                 5th ACM Conference on Recommender Systems (RecSys2011) – Doctoral Symposium Acknowledgments to the National Science Foundation
                                                                        Chicago, USA, 23-27 October 2011                            for the funding to attend the conference

More Related Content

What's hot

97cogsci.doc
97cogsci.doc97cogsci.doc
97cogsci.doc
butest
 
Calling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationCalling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA Presentation
Nick van Terheyden
 
Meta analysis-sloan
Meta analysis-sloanMeta analysis-sloan
Meta analysis-sloan
John Turner
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learning
Salford Systems
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011
blaudez
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects
Yogeeswar Reddy
 
Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..
Accenture
 
Chapter6.doc
Chapter6.docChapter6.doc
Chapter6.doc
butest
 

What's hot (12)

97cogsci.doc
97cogsci.doc97cogsci.doc
97cogsci.doc
 
G
GG
G
 
Calling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationCalling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA Presentation
 
Meta analysis-sloan
Meta analysis-sloanMeta analysis-sloan
Meta analysis-sloan
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio Management
 
Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learning
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects
 
Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..
 
Secondary mathematics wednesday august 22 2012
Secondary mathematics   wednesday august 22 2012Secondary mathematics   wednesday august 22 2012
Secondary mathematics wednesday august 22 2012
 
Chapter6.doc
Chapter6.docChapter6.doc
Chapter6.doc
 

Viewers also liked (6)

2012 Digital Opportunities
2012 Digital Opportunities2012 Digital Opportunities
2012 Digital Opportunities
 
Leergang Opleidingsmanagement
Leergang OpleidingsmanagementLeergang Opleidingsmanagement
Leergang Opleidingsmanagement
 
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
 
Segundo examen
Segundo examenSegundo examen
Segundo examen
 
PEShare.co.uk Shared Resource
PEShare.co.uk Shared ResourcePEShare.co.uk Shared Resource
PEShare.co.uk Shared Resource
 
Pratica i
Pratica iPratica i
Pratica i
 

Similar to Predicting performance in Recommender Systems - Poster

2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition
Dongseo University
 
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningA Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
BRNSSPublicationHubI
 
Screening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxScreening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptx
NitishChoudhary23
 
An Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsAn Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart Hotels
Eduardo Castillejo Gil
 

Similar to Predicting performance in Recommender Systems - Poster (20)

Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - Slides
 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee Attrition
 
2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition
 
LAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkLAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_framework
 
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
 
14
1414
14
 
14
1414
14
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
 
Adaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentAdaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability Assessment
 
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningA Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
 
Screening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxScreening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptx
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardson
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 
IRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural Network
 
An Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsAn Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart Hotels
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
 
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine Learning
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation English
 

More from Alejandro Bellogin

Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Alejandro Bellogin
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Alejandro Bellogin
 

More from Alejandro Bellogin (19)

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenarios
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systems
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender Systems
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix Factorization
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013
 
Probabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyProbabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross Entropy
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender Systems
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slam
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 

Recently uploaded

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Recently uploaded (20)

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 

Predicting performance in Recommender Systems - Poster

  • 1. Predicting Performance in Recommender Systems Alejandro Bellogín Supervised by Pablo Castells and Iván Cantador Information Retrieval Group, Universidad Autónoma de Madrid, Spain alejandro.bellogin@uam.es IRG IR Group @ UAM Motivation Research questions Is it possible to predict the accuracy of a recommendation? 1. Is it possible to define a performance prediction theory for recommender E.g., we can decide whether to deliver a recommendation or not, systems in a sound, formal way? depending on such prediction. Or, even, to combine different 2. Is it possible to adapt query performance techniques (from IR) to the recommenders according to the expected performance of each one. recommendation task? 3. What kind of evaluation should be performed? Is IR evaluation still valid in our Hypothesis problem? Data that are commonly available to a Recommender System could contain signals that enable an a priori estimation of the success of 4. What kind of recommendation problems can these models be applied to? the recommendation Research question 1 Research question 2 Is it possible to define a performance prediction theory User clarity Is it possible to adapt query performance for recommender systems in a sound, formal way? techniques (from IR) to the recommendation task? It captures the uncertainty in user’s data • In Information Retrieval: “Estimation of the system’s  p  x | u   user’s model a) Define a predictor of performance performance in response to a specific query” clarity  u    p  x | u  log   p  x   x X  c  system’s model  = (u, i, r, …) • Several predictors proposed Distance between the user’s and the b) Agree on a performance metric • We focus on query clarity  user clarity system’s probability model  = (u, i, r, …) 0.7 RatItem p_c(x) 0.6 p(x|u1) p(x|u2) We propose 3 formulations (for space X): c) Check predictive power by measuring correlation 0.5 • Based on ratings 0.4 corr([(x1), …, (xn)], [(x1), …, (xn)]) 0.3 0.2 • Based on items • Based on ratings and items 0.1 d) Evaluate final performance: dynamic vs static 0.0 4 3 5 2 1 Ratings 4 3 5 2 1 Correlation coefficients r ~ 0.57 Research question 4 What kind of recommendation problems can these models be applied to? Pearson: linear correlation • Whenever a combination of strategies is available Spearman: rank correlation • Example 1: dynamic neighbor weighting • Example 2: dynamic ensemble recommendation • The user’s neighbors are weighted • Weight is the same for every item and user according to their similarity (learnt from training) • Can we take into account the uncertainty in • What about boosting those users predicted to Research question 3 neighbor’s data? perform better for some recommender? What kind of evaluation should be performed? Is IR • User neighbor weighting: • Hybrid recommendation: evaluation still valid in our problem? • Static: g  u, i   C  sim u, v   rat  v, i  • Static: g u, i     gR1 u, i   1     gR2 u, i  vN [ u ] • In IR: Mean Average Precision + correlation • Dynamic: g  u, i   C  γ  v   sim u, v   rat  v, i  • Dynamic: g u, i   γ u   gR1 u, i   1  γ u   gR2 u, i  vN [ u ] 50 points (queries) vs 1000+ points (users) • Correlation analysis: • Correlation analysis: • Performance metric is not clear:  error-based? • Performance:  precision-based? 0,98 Standard CF 0,88 b) Neighbourhood size: 500 Standard CF b) 80% training 0,96 Clarity-enhanced CF 0,87 Clarity-enhanced CF • What is performance? 0,94 0,86 nDCG@50 0,92 0,90 0,85 • Performance: MAE 0.2 MAE  It may depend on the final application 0,84 0,88 0,83 0.15 0,86 0,84 0,82 • Possible bias 0,82 0,80 0,81 0,80 0.1 0.05  E.g., towards users or items with larger profiles 10 20 30 40 50 60 70 80 90 10 20 100 150 200 250 300 350 400 450 500 30 40 50 60 70 80 90 100 150 200 250 300 350 400 450 500 % of ratings for training Neighbourhood size 0 H1 H2 H3 H4 Adaptive Static Future Work Publications Explore other input sources We need a theoretical background • A Performance Prediction Aproach to Enhance Why do some predictors work better? Collaborative Filtering Performance. A. Bellogín  Implicit data (with time)  Social links and P. Castells. In ECIR 2010.  Item predictors o Use graph-based measures • Predicting the Performance of Recommender o Different recommender as indicators of user strength Systems: An Information Theoretic Approach. A. behavior depending on item o First results are positive Bellogín, P. Castells, and I. Cantador. In ICTIR attributes 2011. o They would allow to capture • Self-adjusting Hybrid Recommenders based on popularity, diversity, etc. Social Network Analysis. A. Bellogín, P. Castells, Larger datasets and I. Cantador. In SIGIR 2011. 5th ACM Conference on Recommender Systems (RecSys2011) – Doctoral Symposium Acknowledgments to the National Science Foundation Chicago, USA, 23-27 October 2011 for the funding to attend the conference