Approximate modeling of continuous context in factorization algorithms (CaRR14 presentation)

Balázs Hidasi
Balázs HidasiHead of Data Mining and Research at Gravity R&D
APPROXIMATE MODELING OF
CONTINUOUS CONTEXT IN
FACTORIZATION ALGORITHMS
Balázs Hidasi
Domonkos Tikk
CARR WORKSHOP, 13TH APRIL 2014, AMSTERDAM
CONTEXT – TO BE ON THE SAME PAGE
 Event context (transaction context)
 Associated with the transaction:
 Time
 Device
 External parameter matched to event by time (e.g. weather)
 Etc.
 Determined by both the item and user
 Previously bought item
 Atomic context
 Context value is atomic
FACTORIZATION

CONTEXT WITH FACTORIZATION
 Factorization: entity based
 Context must be:
 Atomic
 Nominal (categorical)
 „Entitization” of non-nominal context
 1. Discretization (if continuous)
 2. Dismissal of ordering
DIVERSITY IN CONTEXT
 Previously bought item (sequentiality): nominal
 Weather (e.g.: sunny, rainy, etc): nominal?
 Screen size (of device): ordinal
 Ordering is dismissed
 Temperature: continuous
 Discretization: from freezing to extremely hot
 Ordering is dismissed
 Time: continuous
SEASONALITY
(EXAMPLE CONTEXT DIMENSION)
 Periodic, time based, widely used
context
 Length of season (e.g. 1 day)
 Time MOD length of season
 Same context if time between events is
N*(length of season)
 „Entitized” seasonality
 1. Discretization: creation of time band
within the season
 E.g.: night (0-4), dawn (4-8), morning (8-12),
afternoon (12-16), late afternoon (16-20),
evening (20-24)
 2. Dismissal of ordering:
 Sequence and neighboring properties of time
bands does not matter
 Time bands fully independent from each
other
1 day
24/0
4
8
12
16
20
PROBLEMS OF „ENTITIZED” CONTINUOUS
CONTEXT
 1. Context-state rigidness
 Strict boundaries
 Events close to the boundary  fully associated with one
context-state
 In reality: events close to boundary belong to both context-
states to some extent
 2. Context-state ordinality
 Context-states treated independently
 In reality: behavior in neighboring context-states is more
similar
 Gradual change in behavior
 Order of context-states therefore matter
 3. Context-state continuity
 The representative of the context-state should be continuous
 Stricter version of (2)
APPROXIMATE MODELING OF CONTINUOUS
CONTEXT
 Modeling approaches
 How to handle these dimensions?
 Not a concrete algorithm
 Can be integrated into most factorization algorithm
 (One example with iTALS is shown)
 Why is approximate in the title?
 Not fully continuous modeling
 (3) is not addressed
 Some form of discretization is kept
APPROACH 1: FUZZY EVENT MODELING (1/2)
 Addresses the rigidness problem (1)
 Initial discretization
 Events near context-state boundaries
 Associated with both
 Event validity: instantaneous  interval
 Event associated with every context that intersects with its
interval
 Weights for the extent association
APPROACH 1: FUZZY EVENT MODELING (2/2)
 Integration to factorization algorithm:
 Duplication of certain events
 No modification in algorithm
 Ordinality problem addressed indirectly:
 Duplicate events are used to train neighboring context-
states  enforce similarity (to some extent)
APPROACH 2: FUZZY CONTEXT MODELING
 Addresses rigidness & ordinality problem (1 & 2)
 Overlapping of context-state intervals
 Initial discretization
 Overlapping around the boundary  solves (1)
 Context-state of an event
 Mixture of context-states (generally)  solves (2)
 Here: mixture of 2 if in overlapping zone; or 1 if not
 Weights of context-states in mixture
 Feature vector of an event for the context dimension
 Linear combination of 2 context-state feature vectors (if in the overlapping
zone)
 The feature vector of the current context-state (otherwise)
APPLYING CONTEXT MODELING APPROACH
 Base context dimension
 Discretization of the continuous context dimension with
higher resolution than before
 Needed, because factorization methods are entity
based
 Prediction should be given using the base context
dimension
 Changing prediction model
 In place of the context-state feature vector use the
mixture
 Loss should sum over the base context dimension
 For periodic context: last and first context-state are
neighbors
ITALS

ITALS WITH FUZZY CONTEXT MODELING

RESULTS
 5 implicit datasets
 Recommendation accuracy, measured by recall@20
 Context: seasonality (4 hours of a day)
+4.28%
+3.14%
+24.96%
+203.12% +3.48%
+39.40%
+103.32%
+43.89%
+496.88%
+219.80%
0.0000
0.0500
0.1000
0.1500
0.2000
0.2500
0.3000
0.3500
Grocery TV1 TV2 LastFM VoD
iTALS - original iTALS - event modeling iTALS - context modeling
SUMMARY & FUTURE WORK
 Problems with „entitization” of continuous context in
factorization algorithms
 (1) Rigidness problem
 (2) Ordinality problem
 Fuzzy event modeling (for (1), indirectly for (2))
 Events with validity interval
 Duplication of events
 Straightforward integration into algorithms
 Fuzzy context modeling (for (1) & (2))
 Context-states overlap
 Mixture context-state feature vectors
 Considerable improvement in recommendation accuracy
 Future work
 Applying modeling approaches to other algorithms
 Solving the continuity problem
THANKS FOR THE ATTENTION!
For more of my recommender systems related research visit my website:
http://www.hidasi.eu
1 of 17

Recommended

Forest fire management using gis by
Forest fire management using gisForest fire management using gis
Forest fire management using gisKajal Thakkar
5.3K views25 slides
Simulated Annealing by
Simulated AnnealingSimulated Annealing
Simulated Annealingguestead520
4.4K views7 slides
MS thesis by
MS thesisMS thesis
MS thesisBo Luan
153 views31 slides
One dimensional heat conduction equation by
One dimensional heat conduction equationOne dimensional heat conduction equation
One dimensional heat conduction equationমনোয়ারুল এহসান
5.3K views8 slides
BNL_Research_Poster by
BNL_Research_PosterBNL_Research_Poster
BNL_Research_PosterBrandon McKinzie
70 views1 slide
Öncel Akademi: İstatistiksel Sismoloji by
Öncel Akademi: İstatistiksel SismolojiÖncel Akademi: İstatistiksel Sismoloji
Öncel Akademi: İstatistiksel SismolojiAli Osman Öncel
44 views15 slides

More Related Content

What's hot

Multisensor data fusion in object tracking applications by
Multisensor data fusion in object tracking applicationsMultisensor data fusion in object tracking applications
Multisensor data fusion in object tracking applicationsSayed Abulhasan Quadri
4.6K views20 slides
Interpolation 2013 by
Interpolation 2013Interpolation 2013
Interpolation 2013Atiqa khan
870 views52 slides
Climate Extremes Workshop - The Dependence Between Extreme Precipitation and... by
Climate Extremes Workshop -  The Dependence Between Extreme Precipitation and...Climate Extremes Workshop -  The Dependence Between Extreme Precipitation and...
Climate Extremes Workshop - The Dependence Between Extreme Precipitation and...The Statistical and Applied Mathematical Sciences Institute
70 views65 slides
Stochastic modeling of Rainfall Disaggregation using ANN by
Stochastic modeling of Rainfall Disaggregation using ANNStochastic modeling of Rainfall Disaggregation using ANN
Stochastic modeling of Rainfall Disaggregation using ANNShashank Singh
890 views24 slides
Finite difference equation by
Finite difference equationFinite difference equation
Finite difference equationAchmad Rachmad Tullah
7.4K views19 slides

Similar to Approximate modeling of continuous context in factorization algorithms (CaRR14 presentation)

Climate downscaling by
Climate downscalingClimate downscaling
Climate downscalingIC3Climate
2.1K views20 slides
Program on Mathematical and Statistical Methods for Climate and the Earth Sys... by
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...The Statistical and Applied Mathematical Sciences Institute
419 views45 slides
Generating event storylines from microblogs by
Generating event storylines from microblogsGenerating event storylines from microblogs
Generating event storylines from microblogsmoresmile
481 views45 slides
Akselos solutions for oil & gas by
Akselos solutions for oil & gasAkselos solutions for oil & gas
Akselos solutions for oil & gasAlonso Giannoni
641 views24 slides
FEM Lecture.ppt by
FEM Lecture.pptFEM Lecture.ppt
FEM Lecture.pptjuzaila
138 views35 slides
Fem lecture by
Fem lectureFem lecture
Fem lectureM.Thirumalai Muthukumaran
174 views35 slides

Similar to Approximate modeling of continuous context in factorization algorithms (CaRR14 presentation)(20)

Climate downscaling by IC3Climate
Climate downscalingClimate downscaling
Climate downscaling
IC3Climate2.1K views
Generating event storylines from microblogs by moresmile
Generating event storylines from microblogsGenerating event storylines from microblogs
Generating event storylines from microblogs
moresmile481 views
FEM Lecture.ppt by juzaila
FEM Lecture.pptFEM Lecture.ppt
FEM Lecture.ppt
juzaila138 views
Artificial Intelligence by butest
Artificial Intelligence Artificial Intelligence
Artificial Intelligence
butest1.1K views
Quantum generalized linear models by Colleen Farrelly
Quantum generalized linear modelsQuantum generalized linear models
Quantum generalized linear models
Colleen Farrelly1.8K views
Vldb 2010 event processing tutorial by Opher Etzion
Vldb 2010 event processing tutorialVldb 2010 event processing tutorial
Vldb 2010 event processing tutorial
Opher Etzion1.5K views
Intelligent Wireless Sensor Network Simulation by catherine roussey
Intelligent Wireless Sensor Network SimulationIntelligent Wireless Sensor Network Simulation
Intelligent Wireless Sensor Network Simulation
catherine roussey219 views
Structural Dynamics by Abdul Majid
Structural DynamicsStructural Dynamics
Structural Dynamics
Abdul Majid197 views
Semantic Complex Event Processing by Adrian Paschke
Semantic Complex Event ProcessingSemantic Complex Event Processing
Semantic Complex Event Processing
Adrian Paschke10.5K views
Approaches to online quantile estimation by Data Con LA
Approaches to online quantile estimationApproaches to online quantile estimation
Approaches to online quantile estimation
Data Con LA41 views
Product failure analysis using Explicit dynamic by naga ram
Product failure analysis using Explicit dynamicProduct failure analysis using Explicit dynamic
Product failure analysis using Explicit dynamic
naga ram739 views
Preconditioning in Large-scale VDA by Joseph Parks
Preconditioning in Large-scale VDAPreconditioning in Large-scale VDA
Preconditioning in Large-scale VDA
Joseph Parks91 views

More from Balázs Hidasi

Egyedi termék kreatívok tömeges gyártása generatív AI segítségével by
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelEgyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelBalázs Hidasi
45 views57 slides
The Effect of Third Party Implementations on Reproducibility by
The Effect of Third Party Implementations on ReproducibilityThe Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on ReproducibilityBalázs Hidasi
116 views41 slides
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec... by
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...Balázs Hidasi
3.2K views30 slides
Deep Learning in Recommender Systems - RecSys Summer School 2017 by
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi
11.3K views77 slides
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base... by
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Balázs Hidasi
2.3K views15 slides
Context aware factorization methods for implicit feedback based recommendatio... by
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Balázs Hidasi
524 views34 slides

More from Balázs Hidasi(17)

Egyedi termék kreatívok tömeges gyártása generatív AI segítségével by Balázs Hidasi
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelEgyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségével
Balázs Hidasi45 views
The Effect of Third Party Implementations on Reproducibility by Balázs Hidasi
The Effect of Third Party Implementations on ReproducibilityThe Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on Reproducibility
Balázs Hidasi116 views
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec... by Balázs Hidasi
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
Balázs Hidasi3.2K views
Deep Learning in Recommender Systems - RecSys Summer School 2017 by Balázs Hidasi
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017
Balázs Hidasi11.3K views
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base... by Balázs Hidasi
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Balázs Hidasi2.3K views
Context aware factorization methods for implicit feedback based recommendatio... by Balázs Hidasi
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...
Balázs Hidasi524 views
Deep learning to the rescue - solving long standing problems of recommender ... by Balázs Hidasi
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...
Balázs Hidasi13.6K views
Deep learning: the future of recommendations by Balázs Hidasi
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendations
Balázs Hidasi15.1K views
Context-aware preference modeling with factorization by Balázs Hidasi
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorization
Balázs Hidasi1.7K views
Utilizing additional information in factorization methods (research overview,... by Balázs Hidasi
Utilizing additional information in factorization methods (research overview,...Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...
Balázs Hidasi628 views
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad... by Balázs Hidasi
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Balázs Hidasi383 views
Context-aware similarities within the factorization framework (CaRR 2013 pres... by Balázs Hidasi
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Balázs Hidasi1.2K views
iTALS: implicit tensor factorization for context-aware recommendations (ECML/... by Balázs Hidasi
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
Balázs Hidasi637 views
Initialization of matrix factorization (CaRR 2012 presentation) by Balázs Hidasi
Initialization of matrix factorization (CaRR 2012 presentation)Initialization of matrix factorization (CaRR 2012 presentation)
Initialization of matrix factorization (CaRR 2012 presentation)
Balázs Hidasi737 views
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás) by Balázs Hidasi
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
Balázs Hidasi327 views
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012) by Balázs Hidasi
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
Balázs Hidasi357 views
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation) by Balázs Hidasi
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
Balázs Hidasi726 views

Recently uploaded

Real Science Radio - Dr Paul Homan Climate Change.pptx by
Real Science Radio - Dr Paul Homan Climate Change.pptxReal Science Radio - Dr Paul Homan Climate Change.pptx
Real Science Radio - Dr Paul Homan Climate Change.pptxFred Williams
8 views43 slides
Presentation on experimental laboratory animal- Hamster by
Presentation on experimental laboratory animal- HamsterPresentation on experimental laboratory animal- Hamster
Presentation on experimental laboratory animal- HamsterKanika13641
6 views8 slides
Micelle Drug Delivery System (Nanotechnology).pptx by
Micelle Drug Delivery System (Nanotechnology).pptxMicelle Drug Delivery System (Nanotechnology).pptx
Micelle Drug Delivery System (Nanotechnology).pptxANANYA KUMAR
5 views27 slides
Non Aqueous titration: Definition, Principle and Application by
Non Aqueous titration: Definition, Principle and ApplicationNon Aqueous titration: Definition, Principle and Application
Non Aqueous titration: Definition, Principle and ApplicationPoonam Aher Patil
67 views22 slides
Factors affecting fluorescence and phosphorescence.pptx by
Factors affecting fluorescence and phosphorescence.pptxFactors affecting fluorescence and phosphorescence.pptx
Factors affecting fluorescence and phosphorescence.pptxSamarthGiri1
9 views11 slides
NUTRITION IN BACTERIA.pdf by
NUTRITION IN BACTERIA.pdfNUTRITION IN BACTERIA.pdf
NUTRITION IN BACTERIA.pdfNandadulalSannigrahi
40 views14 slides

Recently uploaded(20)

Real Science Radio - Dr Paul Homan Climate Change.pptx by Fred Williams
Real Science Radio - Dr Paul Homan Climate Change.pptxReal Science Radio - Dr Paul Homan Climate Change.pptx
Real Science Radio - Dr Paul Homan Climate Change.pptx
Fred Williams8 views
Presentation on experimental laboratory animal- Hamster by Kanika13641
Presentation on experimental laboratory animal- HamsterPresentation on experimental laboratory animal- Hamster
Presentation on experimental laboratory animal- Hamster
Kanika136416 views
Micelle Drug Delivery System (Nanotechnology).pptx by ANANYA KUMAR
Micelle Drug Delivery System (Nanotechnology).pptxMicelle Drug Delivery System (Nanotechnology).pptx
Micelle Drug Delivery System (Nanotechnology).pptx
ANANYA KUMAR5 views
Non Aqueous titration: Definition, Principle and Application by Poonam Aher Patil
Non Aqueous titration: Definition, Principle and ApplicationNon Aqueous titration: Definition, Principle and Application
Non Aqueous titration: Definition, Principle and Application
Factors affecting fluorescence and phosphorescence.pptx by SamarthGiri1
Factors affecting fluorescence and phosphorescence.pptxFactors affecting fluorescence and phosphorescence.pptx
Factors affecting fluorescence and phosphorescence.pptx
SamarthGiri19 views
Heavy Tails Workshop NeurIPS2023.pdf by Charles Martin
Heavy Tails Workshop NeurIPS2023.pdfHeavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdf
Charles Martin41 views
Worldviews and their (im)plausibility: Science and Holism by JohnWilkins48
Worldviews and their (im)plausibility: Science and HolismWorldviews and their (im)plausibility: Science and Holism
Worldviews and their (im)plausibility: Science and Holism
JohnWilkins4844 views
INTRODUCTION TO PLANT SYSTEMATICS.pptx by RASHMI M G
INTRODUCTION TO PLANT SYSTEMATICS.pptxINTRODUCTION TO PLANT SYSTEMATICS.pptx
INTRODUCTION TO PLANT SYSTEMATICS.pptx
RASHMI M G 5 views
Oral_Presentation_by_Fatma (2).pdf by fatmaalmrzqi
Oral_Presentation_by_Fatma (2).pdfOral_Presentation_by_Fatma (2).pdf
Oral_Presentation_by_Fatma (2).pdf
fatmaalmrzqi8 views
COMPLEXOMETRIC TITRATION OR CHEALATOMETRIC TITRATION by Poonam Aher Patil
COMPLEXOMETRIC TITRATION OR CHEALATOMETRIC TITRATIONCOMPLEXOMETRIC TITRATION OR CHEALATOMETRIC TITRATION
COMPLEXOMETRIC TITRATION OR CHEALATOMETRIC TITRATION
Poonam Aher Patil200 views
Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F... by SwagatBehera9
Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F...Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F...
Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F...
SwagatBehera96 views
Eukaryotic microbiology lab Dos and Donts.pptx by Prasanna Kumar
Eukaryotic microbiology lab Dos and Donts.pptxEukaryotic microbiology lab Dos and Donts.pptx
Eukaryotic microbiology lab Dos and Donts.pptx
Prasanna Kumar8 views
KeyAI. Solving a math problem to recover lost crypto assets. by RFID INC
KeyAI. Solving a math problem to recover lost crypto assets.KeyAI. Solving a math problem to recover lost crypto assets.
KeyAI. Solving a math problem to recover lost crypto assets.
RFID INC7 views
Exploring the nature and synchronicity of early cluster formation in the Larg... by Sérgio Sacani
Exploring the nature and synchronicity of early cluster formation in the Larg...Exploring the nature and synchronicity of early cluster formation in the Larg...
Exploring the nature and synchronicity of early cluster formation in the Larg...
Sérgio Sacani1.5K views
A giant thin stellar stream in the Coma Galaxy Cluster by Sérgio Sacani
A giant thin stellar stream in the Coma Galaxy ClusterA giant thin stellar stream in the Coma Galaxy Cluster
A giant thin stellar stream in the Coma Galaxy Cluster
Sérgio Sacani23 views
RADIATION PHYSICS.pptx by drpriyanka8
RADIATION PHYSICS.pptxRADIATION PHYSICS.pptx
RADIATION PHYSICS.pptx
drpriyanka815 views
Towards Error-Corrected Quantum Computing with Neutral Atoms by Yuval Boger
Towards Error-Corrected Quantum Computing with Neutral AtomsTowards Error-Corrected Quantum Computing with Neutral Atoms
Towards Error-Corrected Quantum Computing with Neutral Atoms
Yuval Boger5 views

Approximate modeling of continuous context in factorization algorithms (CaRR14 presentation)

  • 1. APPROXIMATE MODELING OF CONTINUOUS CONTEXT IN FACTORIZATION ALGORITHMS Balázs Hidasi Domonkos Tikk CARR WORKSHOP, 13TH APRIL 2014, AMSTERDAM
  • 2. CONTEXT – TO BE ON THE SAME PAGE  Event context (transaction context)  Associated with the transaction:  Time  Device  External parameter matched to event by time (e.g. weather)  Etc.  Determined by both the item and user  Previously bought item  Atomic context  Context value is atomic
  • 4. CONTEXT WITH FACTORIZATION  Factorization: entity based  Context must be:  Atomic  Nominal (categorical)  „Entitization” of non-nominal context  1. Discretization (if continuous)  2. Dismissal of ordering
  • 5. DIVERSITY IN CONTEXT  Previously bought item (sequentiality): nominal  Weather (e.g.: sunny, rainy, etc): nominal?  Screen size (of device): ordinal  Ordering is dismissed  Temperature: continuous  Discretization: from freezing to extremely hot  Ordering is dismissed  Time: continuous
  • 6. SEASONALITY (EXAMPLE CONTEXT DIMENSION)  Periodic, time based, widely used context  Length of season (e.g. 1 day)  Time MOD length of season  Same context if time between events is N*(length of season)  „Entitized” seasonality  1. Discretization: creation of time band within the season  E.g.: night (0-4), dawn (4-8), morning (8-12), afternoon (12-16), late afternoon (16-20), evening (20-24)  2. Dismissal of ordering:  Sequence and neighboring properties of time bands does not matter  Time bands fully independent from each other 1 day 24/0 4 8 12 16 20
  • 7. PROBLEMS OF „ENTITIZED” CONTINUOUS CONTEXT  1. Context-state rigidness  Strict boundaries  Events close to the boundary  fully associated with one context-state  In reality: events close to boundary belong to both context- states to some extent  2. Context-state ordinality  Context-states treated independently  In reality: behavior in neighboring context-states is more similar  Gradual change in behavior  Order of context-states therefore matter  3. Context-state continuity  The representative of the context-state should be continuous  Stricter version of (2)
  • 8. APPROXIMATE MODELING OF CONTINUOUS CONTEXT  Modeling approaches  How to handle these dimensions?  Not a concrete algorithm  Can be integrated into most factorization algorithm  (One example with iTALS is shown)  Why is approximate in the title?  Not fully continuous modeling  (3) is not addressed  Some form of discretization is kept
  • 9. APPROACH 1: FUZZY EVENT MODELING (1/2)  Addresses the rigidness problem (1)  Initial discretization  Events near context-state boundaries  Associated with both  Event validity: instantaneous  interval  Event associated with every context that intersects with its interval  Weights for the extent association
  • 10. APPROACH 1: FUZZY EVENT MODELING (2/2)  Integration to factorization algorithm:  Duplication of certain events  No modification in algorithm  Ordinality problem addressed indirectly:  Duplicate events are used to train neighboring context- states  enforce similarity (to some extent)
  • 11. APPROACH 2: FUZZY CONTEXT MODELING  Addresses rigidness & ordinality problem (1 & 2)  Overlapping of context-state intervals  Initial discretization  Overlapping around the boundary  solves (1)  Context-state of an event  Mixture of context-states (generally)  solves (2)  Here: mixture of 2 if in overlapping zone; or 1 if not  Weights of context-states in mixture  Feature vector of an event for the context dimension  Linear combination of 2 context-state feature vectors (if in the overlapping zone)  The feature vector of the current context-state (otherwise)
  • 12. APPLYING CONTEXT MODELING APPROACH  Base context dimension  Discretization of the continuous context dimension with higher resolution than before  Needed, because factorization methods are entity based  Prediction should be given using the base context dimension  Changing prediction model  In place of the context-state feature vector use the mixture  Loss should sum over the base context dimension  For periodic context: last and first context-state are neighbors
  • 14. ITALS WITH FUZZY CONTEXT MODELING 
  • 15. RESULTS  5 implicit datasets  Recommendation accuracy, measured by recall@20  Context: seasonality (4 hours of a day) +4.28% +3.14% +24.96% +203.12% +3.48% +39.40% +103.32% +43.89% +496.88% +219.80% 0.0000 0.0500 0.1000 0.1500 0.2000 0.2500 0.3000 0.3500 Grocery TV1 TV2 LastFM VoD iTALS - original iTALS - event modeling iTALS - context modeling
  • 16. SUMMARY & FUTURE WORK  Problems with „entitization” of continuous context in factorization algorithms  (1) Rigidness problem  (2) Ordinality problem  Fuzzy event modeling (for (1), indirectly for (2))  Events with validity interval  Duplication of events  Straightforward integration into algorithms  Fuzzy context modeling (for (1) & (2))  Context-states overlap  Mixture context-state feature vectors  Considerable improvement in recommendation accuracy  Future work  Applying modeling approaches to other algorithms  Solving the continuity problem
  • 17. THANKS FOR THE ATTENTION! For more of my recommender systems related research visit my website: http://www.hidasi.eu