Context-aware preference modeling with factorization

Balázs Hidasi
Balázs HidasiHead of Data Mining and Research at Gravity R&D
Context-aware Preference
Modeling with Factorization
Balázs Hidasi
RecSys’15, Doctoral Symposium
19. September 2015
Outline
• I. Background
a. Implicit feedback
b. Context
c. Factorization
• II. Finished research
a. Context-aware algorithms (iTALS, iTALSx)
b. Speeding-up ALS
c. General Factorization Framework
• III. Future research
a. Automatic preference model learning
b. Context-related research
Background
Implicit feedback
• + The practical scenario
• + Collected by passive monitoring
• + Available in large quantities
• - Preferences are not explicit
• - Noisy positive feedback
• - No negative feedback
• - Missing feedback needs to be handled
Context
• Context: Additional side information that can help refining
the recommendations and tailoring them in order to fit the
users’ actual needs better.
• Context helps:
Dealing with context related effects during training
Adapting recommendation lists during recommendation time
• Types
User side information: user metadata, social networks, etc.
Item side information: item metadata, etc.
Context of transactions: time, location, device, etc.
Factorization
• Project entities into a low dimensional latent feature
space
• The interaction between the representations estimate
the preferences
Research
Context-aware algorithms [1,2]
• iTALS / iTALSx
Pointwise preference estimation
ALS learning
Scales linearly with the number of transactions
Different models
• Models for different problems
Low number of features, sparser data
iTALSx
Denser data, using higher number of
features is possible iTALS
User-
item-
context
relation
Item feature
matrix
Userfeature
matrix
≈ Preference
tensor
(R)
Item feature
matrixUserfeature
matrix
≈
Items
Users
User-item
realtion
Item-context
relation
User-context
relation
N-way model (iTALS) Pairwise interaction model (iTALSx)
Speeding up ALS [3]
• ALS scales cubically (quadratically in practice) with the
number of features
Bottleneck: solving a system of linear equations
Highly impractical to use high factor models
• Approximate solutions for speed-up
ALS-CG: conjugate gradient based direct approximation of ALS
o Efficiency depends on matrix-vector multiplication
ALS-CD: optimize on a feature-by-feature basis (instead of
computing whole feature vectors)
o Implicit case: lots of negative examples compression
Speed-up results
• Accuracy similar to ALS
• Significant speed-up
Better trade-offs (accuracy vs. time)
More efficient resource usage
• Linear scaling with the number of
features (in practice)
High factor models are usable
• CG or CD? 0
500
1000
1500
2000
20 40 60 80 100 120 140 160 180 200
Time(s)
Number of features
iTALS iTALS - CG iTALS - CD
Method Similar Worse Better
ALS-CG 62 of 75 (82.67%) 10 of 75 (13.33%) 3 of 75 (4%)
ALS-CD 57 of 75 (76%) 16 of 75 (21.33%) 2 of 75 (2.67%)
GFF: General Factorization Framework [4]
• An algorithm that allows experimentation with novel models for the context-
aware recommendation problem, that are not restricted to the two main model
classes used by the state-of-the-art.
• Motivation
dimensions lots of different possible preference models
Standard models not necessarily fit the problem (e.q. asymmetry)
Lack of tool that has this flexibility
• Features
No restriction on the context
Large preference model class
Data type independence
Flexibility
Scalability
Novel preference models with GFF (1)
• Interactions with context
User-item
User-context-item
(reweighting)
User-context (bias)
Item-context (bias)
Context-context?
• A 4D problem
Users (U)
Items (I)
Seasonality (S)
Sequentiality (Q)
• Traditional models
N-way (USQI)
Pairwise (UI+US+IS+UQ+IQ+SQ)
• Novel models
Interaction (UI+USI+UQI)
Context-interaction (USI+UQI)
Reduced pairwise
(UI+US+IS+UQ+IQ)
User bias (UI+US+UQ)
Item bias (UI+UQ+IQ)
(Other interesting ones: UI+USQI;
UI+USI+UQI+USQI; USI+UQI+USQI)
Novel preference models with GFF (2)
Performance of novel models
Dataset Best model Improvement
(over traditional)
Novel better
than traditional
Grocery UI+USI+UQI +20.14% 3 of 5
TV1 USI+UQI +15.37% 2 of 5
TV2 UI+USI+UQI +30.30% 4 of 5
LastFM UI+USI+UQI +12.40% 3 of 5
VoD UI+USI+UQI +19.02% 2 of 5
Grocery VoD
USQI
UI+US+IS+UQ+IQ+SQ
USI+UQI
UI+USI+UQI
UI+US+IS+UQ+IQ
UI+US+UQ
UI+IS+IQ
USQI
UI+US+IS+UQ+IQ+SQ
USI+UQI
UI+USI+UQI
UI+US+IS+UQ+IQ
UI+US+UQ
UI+IS+IQ
+8.23%
+20.14% +16.50% +17.74% +19.02%
Future research
Automatic model learning for GFF
• Flexibility of GFF
Useful for experimentation
Finding the best (or fairly good) model requires lots of
experiments for a new setup
• Automatize model selection
Which contexts should be used?
Which interactions should be used?
Model selection with LARS
4/22/2015
• Model: UI+US+IS+USI+UQ+IQ+UQI+USQI+USQ+ISQ+SQ
• Each term contributes to the prediction of the
preferences
• Terms are the features
• Inferred preferences (0/1) are the target
For every possible (u,i,s,q) combination
Weighting: multiply examples of positive feedback by the
weight
Efficiency of the model selection
4/22/2015
• Lot of examples efficiency?
• Efficient LARS implementations require only the
Covariance of features
Correlation of features with the target
• E.g.: ∑ , , , 1 ∘ 1 ∘ ∘, , ,
Sum has many members
Can be computed efficiently
Precomputed covariance matrices and sums of vectors required
Interaction of dimensions
4/22/2015
• When to use the model selection?
• Dimension interact
One ALS epoch modifies a certain feature to be optimal with the current
model
Different terms optimize for different aspects (e.g. USI and IS)
Shared features will be suboptimal to either but may lean to one side
o Problems with unbiased selection
• Handle terms or groups of terms separately
Hard to integrate into solution
Requires multiple instances of feature matrices
Increases model complexity
Selection strategies
4/22/2015
• Joint pretraining (few epochs), model selection, training
selected model
• Multiple iterations of pretraining and selecting
• Joint training of a few terms, extend to full model using the
trained features, (additional training), selection, train
• Separate training, model selection, (merge separate feature
matrices for the same dimension), (training)
• Separate training, model selection, train non selected
members on the residual
Context-related research
• Non-conventional context
Standard context: entity based
Other types
o Hierarchical
o Composite
o Ordered
o Continuous
• Context quality
General quality
Suitability for a model or interaction type
Improving quality by splitting/combining context-states
Thank you!
References (papers can be downloaded from http://hidasi.eu)
[1] Balázs Hidasi and Domonkos Tikk: Fast ALS-based tensor factorization for context-aware recommendation from implicit
feedback. ECML-PKDD (2012)
[2] Balázs Hidasi: Factorization models for context-aware recommendations. Infocommunications Journal VI(4) (2014)
[3] Balázs Hidasi and Domonkos Tikk: Speeding up ALS learning via approximate methods for context-aware
recommendations. Knowledge and Information Systems (2015)
[4] Balázs Hidasi and Domonkos Tikk: General factorization framework for context-aware recommendations. Data Mining and
Knowledge Discovery (2015)
1 of 22

Recommended

Utilizing additional information in factorization methods (research overview,... by
Utilizing additional information in factorization methods (research overview,...Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Balázs Hidasi
628 views24 slides
Deep learning to the rescue - solving long standing problems of recommender ... by
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Balázs Hidasi
13.6K views17 slides
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base... by
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Balázs Hidasi
2.3K views15 slides
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec... by
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...Balázs Hidasi
3.2K views30 slides
Talk@rmit 09112017 by
Talk@rmit 09112017Talk@rmit 09112017
Talk@rmit 09112017Shuai Zhang
660 views50 slides
Generative Models for General Audiences by
Generative Models for General AudiencesGenerative Models for General Audiences
Generative Models for General AudiencesSangwoo Mo
588 views25 slides

More Related Content

What's hot

Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2... by
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...민진 최
171 views31 slides
Deep Learning Models for Question Answering by
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question AnsweringSujit Pal
14.4K views44 slides
Evolving a Medical Image Similarity Search by
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity SearchSujit Pal
1.2K views23 slides
Icml2017 overview by
Icml2017 overviewIcml2017 overview
Icml2017 overviewTatsuya Shirakawa
1.4K views33 slides
Learning to learn unlearned feature for segmentation by
Learning to learn unlearned feature for segmentationLearning to learn unlearned feature for segmentation
Learning to learn unlearned feature for segmentationNAVER Engineering
1.2K views45 slides
林守德/Practical Issues in Machine Learning by
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning台灣資料科學年會
7.8K views55 slides

What's hot(20)

Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2... by 민진 최
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
Session-aware Linear Item-Item Models for Session-based Recommendation (WWW 2...
민진 최171 views
Deep Learning Models for Question Answering by Sujit Pal
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question Answering
Sujit Pal14.4K views
Evolving a Medical Image Similarity Search by Sujit Pal
Evolving a Medical Image Similarity SearchEvolving a Medical Image Similarity Search
Evolving a Medical Image Similarity Search
Sujit Pal1.2K views
Learning to learn unlearned feature for segmentation by NAVER Engineering
Learning to learn unlearned feature for segmentationLearning to learn unlearned feature for segmentation
Learning to learn unlearned feature for segmentation
NAVER Engineering1.2K views
Local collaborative autoencoders (WSDM2021) by 민진 최
Local collaborative autoencoders (WSDM2021)Local collaborative autoencoders (WSDM2021)
Local collaborative autoencoders (WSDM2021)
민진 최232 views
Machine learning for_finance by Stefan Duprey
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
Stefan Duprey2K views
Mathematical Background for Artificial Intelligence by ananth
Mathematical Background for Artificial IntelligenceMathematical Background for Artificial Intelligence
Mathematical Background for Artificial Intelligence
ananth1.7K views
Kaggle Days Madrid - Alberto Danese by Alberto Danese
Kaggle Days Madrid - Alberto DaneseKaggle Days Madrid - Alberto Danese
Kaggle Days Madrid - Alberto Danese
Alberto Danese815 views
Introduction to cyclical learning rates for training neural nets by Sayak Paul
Introduction to cyclical learning rates for training neural netsIntroduction to cyclical learning rates for training neural nets
Introduction to cyclical learning rates for training neural nets
Sayak Paul40 views
Understanding Basics of Machine Learning by Pranav Ainavolu
Understanding Basics of Machine LearningUnderstanding Basics of Machine Learning
Understanding Basics of Machine Learning
Pranav Ainavolu473 views
Tutorial on Theory and Application of Generative Adversarial Networks by MLReview
Tutorial on Theory and Application of Generative Adversarial NetworksTutorial on Theory and Application of Generative Adversarial Networks
Tutorial on Theory and Application of Generative Adversarial Networks
MLReview3.8K views
Learning to Compose Domain-Specific Transformations for Data Augmentation by Tatsuya Shirakawa
Learning to Compose Domain-Specific Transformations for Data AugmentationLearning to Compose Domain-Specific Transformations for Data Augmentation
Learning to Compose Domain-Specific Transformations for Data Augmentation
Tatsuya Shirakawa785 views
Recommendation system using collaborative deep learning by Ritesh Sawant
Recommendation system using collaborative deep learningRecommendation system using collaborative deep learning
Recommendation system using collaborative deep learning
Ritesh Sawant433 views
End to-end semi-supervised object detection with soft teacher ver.1.0 by taeseon ryu
End to-end semi-supervised object detection with soft teacher ver.1.0End to-end semi-supervised object detection with soft teacher ver.1.0
End to-end semi-supervised object detection with soft teacher ver.1.0
taeseon ryu557 views
Generative Adversarial Networks and Their Applications in Medical Imaging by Sanghoon Hong
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical Imaging
Sanghoon Hong972 views
Semi-supervised Learning by butest
Semi-supervised LearningSemi-supervised Learning
Semi-supervised Learning
butest4.2K views

Similar to Context-aware preference modeling with factorization

MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification... by
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...The Statistical and Applied Mathematical Sciences Institute
164 views32 slides
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S... by
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...Lauri Eloranta
5.9K views53 slides
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf by
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdfVincenzo Lomonaco
105 views74 slides
Goal Decomposition and Abductive Reasoning for Policy Analysis and Refinement by
Goal Decomposition and Abductive Reasoning for Policy Analysis and RefinementGoal Decomposition and Abductive Reasoning for Policy Analysis and Refinement
Goal Decomposition and Abductive Reasoning for Policy Analysis and RefinementEmil Lupu
2.1K views14 slides
QoE++: Shifting from Ego- to Eco-System? QCMan 2015 Keynote by
QoE++: Shifting from Ego- to Eco-System? QCMan 2015 KeynoteQoE++: Shifting from Ego- to Eco-System? QCMan 2015 Keynote
QoE++: Shifting from Ego- to Eco-System? QCMan 2015 KeynoteTobias Hoßfeld
1K views28 slides
Learning Content and Usage Factors Simultaneously by
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyArnab Bhadury
355 views19 slides

Similar to Context-aware preference modeling with factorization(20)

Simulation in Social Sciences - Lecture 6 in Introduction to Computational S... by Lauri Eloranta
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Lauri Eloranta5.9K views
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf by Vincenzo Lomonaco
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
Vincenzo Lomonaco105 views
Goal Decomposition and Abductive Reasoning for Policy Analysis and Refinement by Emil Lupu
Goal Decomposition and Abductive Reasoning for Policy Analysis and RefinementGoal Decomposition and Abductive Reasoning for Policy Analysis and Refinement
Goal Decomposition and Abductive Reasoning for Policy Analysis and Refinement
Emil Lupu2.1K views
QoE++: Shifting from Ego- to Eco-System? QCMan 2015 Keynote by Tobias Hoßfeld
QoE++: Shifting from Ego- to Eco-System? QCMan 2015 KeynoteQoE++: Shifting from Ego- to Eco-System? QCMan 2015 Keynote
QoE++: Shifting from Ego- to Eco-System? QCMan 2015 Keynote
Tobias Hoßfeld1K views
Learning Content and Usage Factors Simultaneously by Arnab Bhadury
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors Simultaneously
Arnab Bhadury355 views
Simulation Models as a Research Method.ppt by QidiwQidiwQidiw
Simulation Models as a Research Method.pptSimulation Models as a Research Method.ppt
Simulation Models as a Research Method.ppt
QidiwQidiwQidiw79 views
[SEBD2020] OLAP Querying of Document Stores in the Presence of Schema Variety by University of Bologna
[SEBD2020] OLAP Querying of Document Stores in the Presence of Schema Variety[SEBD2020] OLAP Querying of Document Stores in the Presence of Schema Variety
[SEBD2020] OLAP Querying of Document Stores in the Presence of Schema Variety
Demography basedhybridrecommendersystemformovierecommendation by UmmeSalmaM1
Demography basedhybridrecommendersystemformovierecommendationDemography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendation
UmmeSalmaM180 views
AGRIS toolkit by FAO
AGRIS toolkitAGRIS toolkit
AGRIS toolkit
FAO266 views
A flexible recommenndation system for Cable TV by IntoTheMinds
A flexible recommenndation system for Cable TVA flexible recommenndation system for Cable TV
A flexible recommenndation system for Cable TV
IntoTheMinds693 views
A Flexible Recommendation System for Cable TV by Francisco Couto
A Flexible Recommendation System for Cable TVA Flexible Recommendation System for Cable TV
A Flexible Recommendation System for Cable TV
Francisco Couto300 views
Artificial Intelligence for Automated Software Testing by Lionel Briand
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software Testing
Lionel Briand3.3K views
Top 10 Data Science Practitioner Pitfalls by Sri Ambati
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner Pitfalls
Sri Ambati3.6K views
Statistics in the age of data science, issues you can not ignore by Turi, Inc.
Statistics in the age of data science, issues you can not ignoreStatistics in the age of data science, issues you can not ignore
Statistics in the age of data science, issues you can not ignore
Turi, Inc.804 views
Software Engineering Research: Leading a Double-Agent Life. by Lionel Briand
Software Engineering Research: Leading a Double-Agent Life.Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.
Lionel Briand13.9K views
Approaches to Preservation Storage Technologies by Micah Altman
Approaches to Preservation Storage Technologies Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies
Micah Altman1.2K views
June brownbagpressurvey by Micah Altman
June brownbagpressurveyJune brownbagpressurvey
June brownbagpressurvey
Micah Altman383 views

More from Balázs Hidasi

Egyedi termék kreatívok tömeges gyártása generatív AI segítségével by
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelEgyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelBalázs Hidasi
45 views57 slides
The Effect of Third Party Implementations on Reproducibility by
The Effect of Third Party Implementations on ReproducibilityThe Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on ReproducibilityBalázs Hidasi
116 views41 slides
Deep Learning in Recommender Systems - RecSys Summer School 2017 by
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi
11.3K views77 slides
Context aware factorization methods for implicit feedback based recommendatio... by
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Balázs Hidasi
524 views34 slides
Deep learning: the future of recommendations by
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendationsBalázs Hidasi
15.1K views23 slides
Approximate modeling of continuous context in factorization algorithms (CaRR1... by
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Balázs Hidasi
1.4K views17 slides

More from Balázs Hidasi(13)

Egyedi termék kreatívok tömeges gyártása generatív AI segítségével by Balázs Hidasi
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelEgyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségével
Balázs Hidasi45 views
The Effect of Third Party Implementations on Reproducibility by Balázs Hidasi
The Effect of Third Party Implementations on ReproducibilityThe Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on Reproducibility
Balázs Hidasi116 views
Deep Learning in Recommender Systems - RecSys Summer School 2017 by Balázs Hidasi
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017
Balázs Hidasi11.3K views
Context aware factorization methods for implicit feedback based recommendatio... by Balázs Hidasi
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...
Balázs Hidasi524 views
Deep learning: the future of recommendations by Balázs Hidasi
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendations
Balázs Hidasi15.1K views
Approximate modeling of continuous context in factorization algorithms (CaRR1... by Balázs Hidasi
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...
Balázs Hidasi1.4K views
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad... by Balázs Hidasi
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Balázs Hidasi383 views
Context-aware similarities within the factorization framework (CaRR 2013 pres... by Balázs Hidasi
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Balázs Hidasi1.2K views
iTALS: implicit tensor factorization for context-aware recommendations (ECML/... by Balázs Hidasi
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
Balázs Hidasi637 views
Initialization of matrix factorization (CaRR 2012 presentation) by Balázs Hidasi
Initialization of matrix factorization (CaRR 2012 presentation)Initialization of matrix factorization (CaRR 2012 presentation)
Initialization of matrix factorization (CaRR 2012 presentation)
Balázs Hidasi737 views
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás) by Balázs Hidasi
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
Balázs Hidasi327 views
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012) by Balázs Hidasi
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
Balázs Hidasi357 views
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation) by Balázs Hidasi
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
Balázs Hidasi726 views

Recently uploaded

What is Authentication Active Directory_.pptx by
What is Authentication Active Directory_.pptxWhat is Authentication Active Directory_.pptx
What is Authentication Active Directory_.pptxHeenaMehta35
15 views7 slides
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell by
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell
"Node.js vs workers — A comparison of two JavaScript runtimes", James M SnellFwdays
14 views30 slides
Business Analyst Series 2023 - Week 4 Session 8 by
Business Analyst Series 2023 -  Week 4 Session 8Business Analyst Series 2023 -  Week 4 Session 8
Business Analyst Series 2023 - Week 4 Session 8DianaGray10
145 views13 slides
LLMs in Production: Tooling, Process, and Team Structure by
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team StructureAggregage
57 views77 slides
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023BookNet Canada
44 views19 slides
KubeConNA23 Recap.pdf by
KubeConNA23 Recap.pdfKubeConNA23 Recap.pdf
KubeConNA23 Recap.pdfMichaelOLeary82
24 views27 slides

Recently uploaded(20)

What is Authentication Active Directory_.pptx by HeenaMehta35
What is Authentication Active Directory_.pptxWhat is Authentication Active Directory_.pptx
What is Authentication Active Directory_.pptx
HeenaMehta3515 views
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell by Fwdays
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell
Fwdays14 views
Business Analyst Series 2023 - Week 4 Session 8 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 8Business Analyst Series 2023 -  Week 4 Session 8
Business Analyst Series 2023 - Week 4 Session 8
DianaGray10145 views
LLMs in Production: Tooling, Process, and Team Structure by Aggregage
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team Structure
Aggregage57 views
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by BookNet Canada
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
BookNet Canada44 views
"Surviving highload with Node.js", Andrii Shumada by Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays58 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue199 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash162 views
Initiating and Advancing Your Strategic GIS Governance Strategy by Safe Software
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
Safe Software184 views
"Package management in monorepos", Zoltan Kochan by Fwdays
"Package management in monorepos", Zoltan Kochan"Package management in monorepos", Zoltan Kochan
"Package management in monorepos", Zoltan Kochan
Fwdays34 views
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty65 views
Deep Tech and the Amplified Organisation: Core Concepts by Holonomics
Deep Tech and the Amplified Organisation: Core ConceptsDeep Tech and the Amplified Organisation: Core Concepts
Deep Tech and the Amplified Organisation: Core Concepts
Holonomics17 views
Measurecamp Brussels - Synthetic data.pdf by Human37
Measurecamp Brussels - Synthetic data.pdfMeasurecamp Brussels - Synthetic data.pdf
Measurecamp Brussels - Synthetic data.pdf
Human37 26 views
Mobile Core Solutions & Successful Cases.pdf by IPLOOK Networks
Mobile Core Solutions & Successful Cases.pdfMobile Core Solutions & Successful Cases.pdf
Mobile Core Solutions & Successful Cases.pdf
IPLOOK Networks14 views

Context-aware preference modeling with factorization

  • 1. Context-aware Preference Modeling with Factorization Balázs Hidasi RecSys’15, Doctoral Symposium 19. September 2015
  • 2. Outline • I. Background a. Implicit feedback b. Context c. Factorization • II. Finished research a. Context-aware algorithms (iTALS, iTALSx) b. Speeding-up ALS c. General Factorization Framework • III. Future research a. Automatic preference model learning b. Context-related research
  • 4. Implicit feedback • + The practical scenario • + Collected by passive monitoring • + Available in large quantities • - Preferences are not explicit • - Noisy positive feedback • - No negative feedback • - Missing feedback needs to be handled
  • 5. Context • Context: Additional side information that can help refining the recommendations and tailoring them in order to fit the users’ actual needs better. • Context helps: Dealing with context related effects during training Adapting recommendation lists during recommendation time • Types User side information: user metadata, social networks, etc. Item side information: item metadata, etc. Context of transactions: time, location, device, etc.
  • 6. Factorization • Project entities into a low dimensional latent feature space • The interaction between the representations estimate the preferences
  • 8. Context-aware algorithms [1,2] • iTALS / iTALSx Pointwise preference estimation ALS learning Scales linearly with the number of transactions Different models • Models for different problems Low number of features, sparser data iTALSx Denser data, using higher number of features is possible iTALS User- item- context relation Item feature matrix Userfeature matrix ≈ Preference tensor (R) Item feature matrixUserfeature matrix ≈ Items Users User-item realtion Item-context relation User-context relation N-way model (iTALS) Pairwise interaction model (iTALSx)
  • 9. Speeding up ALS [3] • ALS scales cubically (quadratically in practice) with the number of features Bottleneck: solving a system of linear equations Highly impractical to use high factor models • Approximate solutions for speed-up ALS-CG: conjugate gradient based direct approximation of ALS o Efficiency depends on matrix-vector multiplication ALS-CD: optimize on a feature-by-feature basis (instead of computing whole feature vectors) o Implicit case: lots of negative examples compression
  • 10. Speed-up results • Accuracy similar to ALS • Significant speed-up Better trade-offs (accuracy vs. time) More efficient resource usage • Linear scaling with the number of features (in practice) High factor models are usable • CG or CD? 0 500 1000 1500 2000 20 40 60 80 100 120 140 160 180 200 Time(s) Number of features iTALS iTALS - CG iTALS - CD Method Similar Worse Better ALS-CG 62 of 75 (82.67%) 10 of 75 (13.33%) 3 of 75 (4%) ALS-CD 57 of 75 (76%) 16 of 75 (21.33%) 2 of 75 (2.67%)
  • 11. GFF: General Factorization Framework [4] • An algorithm that allows experimentation with novel models for the context- aware recommendation problem, that are not restricted to the two main model classes used by the state-of-the-art. • Motivation dimensions lots of different possible preference models Standard models not necessarily fit the problem (e.q. asymmetry) Lack of tool that has this flexibility • Features No restriction on the context Large preference model class Data type independence Flexibility Scalability
  • 12. Novel preference models with GFF (1) • Interactions with context User-item User-context-item (reweighting) User-context (bias) Item-context (bias) Context-context? • A 4D problem Users (U) Items (I) Seasonality (S) Sequentiality (Q) • Traditional models N-way (USQI) Pairwise (UI+US+IS+UQ+IQ+SQ) • Novel models Interaction (UI+USI+UQI) Context-interaction (USI+UQI) Reduced pairwise (UI+US+IS+UQ+IQ) User bias (UI+US+UQ) Item bias (UI+UQ+IQ) (Other interesting ones: UI+USQI; UI+USI+UQI+USQI; USI+UQI+USQI)
  • 13. Novel preference models with GFF (2)
  • 14. Performance of novel models Dataset Best model Improvement (over traditional) Novel better than traditional Grocery UI+USI+UQI +20.14% 3 of 5 TV1 USI+UQI +15.37% 2 of 5 TV2 UI+USI+UQI +30.30% 4 of 5 LastFM UI+USI+UQI +12.40% 3 of 5 VoD UI+USI+UQI +19.02% 2 of 5 Grocery VoD USQI UI+US+IS+UQ+IQ+SQ USI+UQI UI+USI+UQI UI+US+IS+UQ+IQ UI+US+UQ UI+IS+IQ USQI UI+US+IS+UQ+IQ+SQ USI+UQI UI+USI+UQI UI+US+IS+UQ+IQ UI+US+UQ UI+IS+IQ +8.23% +20.14% +16.50% +17.74% +19.02%
  • 16. Automatic model learning for GFF • Flexibility of GFF Useful for experimentation Finding the best (or fairly good) model requires lots of experiments for a new setup • Automatize model selection Which contexts should be used? Which interactions should be used?
  • 17. Model selection with LARS 4/22/2015 • Model: UI+US+IS+USI+UQ+IQ+UQI+USQI+USQ+ISQ+SQ • Each term contributes to the prediction of the preferences • Terms are the features • Inferred preferences (0/1) are the target For every possible (u,i,s,q) combination Weighting: multiply examples of positive feedback by the weight
  • 18. Efficiency of the model selection 4/22/2015 • Lot of examples efficiency? • Efficient LARS implementations require only the Covariance of features Correlation of features with the target • E.g.: ∑ , , , 1 ∘ 1 ∘ ∘, , , Sum has many members Can be computed efficiently Precomputed covariance matrices and sums of vectors required
  • 19. Interaction of dimensions 4/22/2015 • When to use the model selection? • Dimension interact One ALS epoch modifies a certain feature to be optimal with the current model Different terms optimize for different aspects (e.g. USI and IS) Shared features will be suboptimal to either but may lean to one side o Problems with unbiased selection • Handle terms or groups of terms separately Hard to integrate into solution Requires multiple instances of feature matrices Increases model complexity
  • 20. Selection strategies 4/22/2015 • Joint pretraining (few epochs), model selection, training selected model • Multiple iterations of pretraining and selecting • Joint training of a few terms, extend to full model using the trained features, (additional training), selection, train • Separate training, model selection, (merge separate feature matrices for the same dimension), (training) • Separate training, model selection, train non selected members on the residual
  • 21. Context-related research • Non-conventional context Standard context: entity based Other types o Hierarchical o Composite o Ordered o Continuous • Context quality General quality Suitability for a model or interaction type Improving quality by splitting/combining context-states
  • 22. Thank you! References (papers can be downloaded from http://hidasi.eu) [1] Balázs Hidasi and Domonkos Tikk: Fast ALS-based tensor factorization for context-aware recommendation from implicit feedback. ECML-PKDD (2012) [2] Balázs Hidasi: Factorization models for context-aware recommendations. Infocommunications Journal VI(4) (2014) [3] Balázs Hidasi and Domonkos Tikk: Speeding up ALS learning via approximate methods for context-aware recommendations. Knowledge and Information Systems (2015) [4] Balázs Hidasi and Domonkos Tikk: General factorization framework for context-aware recommendations. Data Mining and Knowledge Discovery (2015)