SlideShare a Scribd company logo
iTALS
Fast ALS-based tensor factorization for context-aware
recommendation from implicit feedback
Balázs Hidasi – balazs.hidasi@gravityrd.com
Domonkos Tikk – domonkos.tikk@gravityrd.com




 ECML/PKDD, 25TH SEPTEMBER 2012, BRISTOL
Overview
• Implicit feedback problem
• Context-awareness
 ▫ Seasonality
 ▫ Sequentaility
• iTALS
 ▫ Model
 ▫ Learning
 ▫ Prediction
• Experiments
Feedback types
• Feedback: user-item interraction (events)
• Explicit:
  ▫ Preferences explicitely coded
  ▫ E.g.: Ratings
• Implicit:
  ▫ Preferences not coded explicitely
  ▫ E.g.: purchase history
Problems with implicit feedback
• Noisy positive preferences
 ▫ E.g.: bought & disappointed
• No negative feedback available
 ▫ E.g.: had no info on item

• Usually evaluated by ranking metrics
 ▫ Can not be directly optimized
Why to use implicit feedback?
• Every user provides
• Some magnitudes larger amount of information
  than explicit feedback
• More important in practice
 ▫ Explicit algorithms are for the biggest only
Context-awareness
• Context: any information associated with events
• Context state: a possible value of the context
  dimension
• Context-awareness
  ▫ Usage of context information
  ▫ Incorporating additional informations into the method
  ▫ Different predictions for same user given different
    context states
  ▫ Can greatly outperform context-unaware methods
     Context segmentates items/users well
Seasonality as context
• Season: a time period
  ▫ E.g.: a week
• Timeband: given interval in season
  ▫ Context-states
  ▫ E.g.: days
• Assumed:
  ▫ aggregated behaviour in a given
    timeband is similar inbetween
    seasons                            User   Item     Date       Context
  ▫ and different for different         1      A     12/07/2010      1
    timebands                           2      B     15/07/2010      3
  ▫ E.g.: daily/weekly routines         1      B     15/07/2010      3
                                        …      …         …
                                        1      A     19/07/2010      1
Sequentiality
• Bought A after B
  ▫ B is the context-state of the user’s
    event on A
• Usefullness
  ▫ Some items are bought together
  ▫ Some items bought repetetively
  ▫ Some are not
• Association rule like information
  incorporated into model as
  context
  ▫ Here: into factorization methods
• Can learn negated rules
  ▫ If C then not A
M3
iTALS - Model
• Binary tensor
  ▫ D dimensional




                                         User
  ▫ User – item – context(s)




                                    M1
                                                 T
• Importance weights
  ▫ Lower weights to zeroes (NIF)
  ▫ Higher weights to cells with                Item
    more events                                 M2
• Cells approximated by sum of
  the values in the elementwise
  product of D vectors
  ▫ Each for a dimension
  ▫ Low dimensional vectors
iTALS - Learning
•
iTALS - Prediction
• Sum of values in elementwise product of vectors
 ▫ User-item: scalar product of feature vectors
 ▫ User-item-context: weighted scalar product of
   feature vectors
• Context-state dependent reweighting of features
• E.g.:
 ▫ Third feature = horror movie
 ▫ Context state1 = Friday night  third feature high
 ▫ Context state2 = Sunday afternoon  third
   feature low
Experiments
• 5 databases
  ▫ 3 implicit
     Online grocery shopping
     VoD consumption
     Music listening habits
  ▫ 2 implicitizied explicit
     Netflix
     MovieLens 10M
• Recall@20 as primary evaluation metric
• Baseline: context-unaware method in every context-
  state
Scalability
                                      Running times on the Grocery dataset
                     350

                     300
                                                                                iALS epoch
Running time (sec)




                     250
                                                                                iTALS (time bands) epoch
                     200
                                                                                iTALS (sequence) epoch
                     150
                                                                                iALS calc. matrix
                     100
                                                                                iTALS (time bands) calc.
                                                                                matrix
                      50
                                                                                iTALS (sequence) calc. matrix
                      0
                           10   20   30     40   50   60   70   80   90   100
                                          Number of factors (K)
Results – Recall@20
0.1200

0.1000

0.0800

0.0600

0.0400

0.0200

0.0000
             VoD        Grocery        LastFM            Netflix           MovieLens
     iALS (MF method)   iCA baseline   iTALS (seasonality)         iTALS (sequence)
Results – Precision-recall curves
0.25        Recall-precision (Grocery)
0.20
0.15
0.10                                                            iALS

0.05
0.00                                                            iCA baseline
    0.00   0.05          0.10       0.15          0.20
                  Recall-precision (VoD)                        iTALS
0.08                                                            (season)

0.06                                                            iTALS
                                                                (sequence)
0.04

0.02

0.00
    0.00    0.05           0.10            0.15          0.20
Summary
• iTALS is a
  ▫   scalable
  ▫   context-aware
  ▫   factorization method
  ▫   on implicit feedback data
• The problem is modelled
  ▫ by approximating the values of a binary tensor
  ▫ with elementwise product of short vectors
  ▫ using importance weighting
• Learning can be efficiently done by using
  ▫ ALS
  ▫ and other computation time reducing tricks
• Recommendation accuracy is significantly better than iCA-baseline
• Introduced a new context type: sequentiality
  ▫ association rule like information in factorization framework
Thank you for your attention!



For more of my recommender systems related research visit my website: http://www.hidasi.eu




                                         T

More Related Content

Viewers also liked

Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...
Balázs Hidasi
 
Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...
Balázs Hidasi
 
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Balázs Hidasi
 
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Balázs Hidasi
 
Deep learning: the future of recommendations
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendations
Balázs Hidasi
 
Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...
Balázs Hidasi
 

Viewers also liked (6)

Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...
 
Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...
 
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...
 
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
 
Deep learning: the future of recommendations
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendations
 
Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...
 

Similar to iTALS: implicit tensor factorization for context-aware recommendations (ECML/PKDD 2012 presentation)

Optimising digital content delivery
Optimising digital content deliveryOptimising digital content delivery
Optimising digital content delivery
Tamas Jambor
 
H formal lab report
H formal lab reportH formal lab report
H formal lab report
stephm32
 
Evaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated DatabasesEvaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated Databases
Miguel Araújo
 
IoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDBIoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDB
Ivo Andreev
 
It's all about the timing
It's all about the timingIt's all about the timing
It's all about the timing
SensePost
 
Mining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOAMining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOA
Albert Bifet
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)
Ivo Andreev
 
Mining big data streams with APACHE SAMOA by Albert Bifet
Mining big data streams with APACHE SAMOA by Albert BifetMining big data streams with APACHE SAMOA by Albert Bifet
Mining big data streams with APACHE SAMOA by Albert Bifet
J On The Beach
 
Effectiveness and code optimization in Java
Effectiveness and code optimization in JavaEffectiveness and code optimization in Java
Effectiveness and code optimization in Java
Strannik_2013
 
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
ScyllaDB
 
ETALIS_RuleML_2011_Retractions
ETALIS_RuleML_2011_RetractionsETALIS_RuleML_2011_Retractions
ETALIS_RuleML_2011_Retractions
Darko Anicic
 
Modeling the IoT with TitanDB and Cassandra
Modeling the IoT with TitanDB and CassandraModeling the IoT with TitanDB and Cassandra
Modeling the IoT with TitanDB and Cassandra
twilmes
 
DL'12 mastro at work
DL'12 mastro at workDL'12 mastro at work
DL'12 mastro at work
Mariano Rodriguez-Muro
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description
Sulman Ahmed
 
Database Health Check
Database Health CheckDatabase Health Check
Database Health Check
PostgreSQL Experts, Inc.
 

Similar to iTALS: implicit tensor factorization for context-aware recommendations (ECML/PKDD 2012 presentation) (15)

Optimising digital content delivery
Optimising digital content deliveryOptimising digital content delivery
Optimising digital content delivery
 
H formal lab report
H formal lab reportH formal lab report
H formal lab report
 
Evaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated DatabasesEvaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated Databases
 
IoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDBIoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDB
 
It's all about the timing
It's all about the timingIt's all about the timing
It's all about the timing
 
Mining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOAMining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOA
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)
 
Mining big data streams with APACHE SAMOA by Albert Bifet
Mining big data streams with APACHE SAMOA by Albert BifetMining big data streams with APACHE SAMOA by Albert Bifet
Mining big data streams with APACHE SAMOA by Albert Bifet
 
Effectiveness and code optimization in Java
Effectiveness and code optimization in JavaEffectiveness and code optimization in Java
Effectiveness and code optimization in Java
 
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
 
ETALIS_RuleML_2011_Retractions
ETALIS_RuleML_2011_RetractionsETALIS_RuleML_2011_Retractions
ETALIS_RuleML_2011_Retractions
 
Modeling the IoT with TitanDB and Cassandra
Modeling the IoT with TitanDB and CassandraModeling the IoT with TitanDB and Cassandra
Modeling the IoT with TitanDB and Cassandra
 
DL'12 mastro at work
DL'12 mastro at workDL'12 mastro at work
DL'12 mastro at work
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description
 
Database Health Check
Database Health CheckDatabase Health Check
Database Health Check
 

Recently uploaded

Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 

Recently uploaded (20)

Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 

iTALS: implicit tensor factorization for context-aware recommendations (ECML/PKDD 2012 presentation)

  • 1. iTALS Fast ALS-based tensor factorization for context-aware recommendation from implicit feedback Balázs Hidasi – balazs.hidasi@gravityrd.com Domonkos Tikk – domonkos.tikk@gravityrd.com ECML/PKDD, 25TH SEPTEMBER 2012, BRISTOL
  • 2. Overview • Implicit feedback problem • Context-awareness ▫ Seasonality ▫ Sequentaility • iTALS ▫ Model ▫ Learning ▫ Prediction • Experiments
  • 3. Feedback types • Feedback: user-item interraction (events) • Explicit: ▫ Preferences explicitely coded ▫ E.g.: Ratings • Implicit: ▫ Preferences not coded explicitely ▫ E.g.: purchase history
  • 4. Problems with implicit feedback • Noisy positive preferences ▫ E.g.: bought & disappointed • No negative feedback available ▫ E.g.: had no info on item • Usually evaluated by ranking metrics ▫ Can not be directly optimized
  • 5. Why to use implicit feedback? • Every user provides • Some magnitudes larger amount of information than explicit feedback • More important in practice ▫ Explicit algorithms are for the biggest only
  • 6. Context-awareness • Context: any information associated with events • Context state: a possible value of the context dimension • Context-awareness ▫ Usage of context information ▫ Incorporating additional informations into the method ▫ Different predictions for same user given different context states ▫ Can greatly outperform context-unaware methods  Context segmentates items/users well
  • 7. Seasonality as context • Season: a time period ▫ E.g.: a week • Timeband: given interval in season ▫ Context-states ▫ E.g.: days • Assumed: ▫ aggregated behaviour in a given timeband is similar inbetween seasons User Item Date Context ▫ and different for different 1 A 12/07/2010 1 timebands 2 B 15/07/2010 3 ▫ E.g.: daily/weekly routines 1 B 15/07/2010 3 … … … 1 A 19/07/2010 1
  • 8. Sequentiality • Bought A after B ▫ B is the context-state of the user’s event on A • Usefullness ▫ Some items are bought together ▫ Some items bought repetetively ▫ Some are not • Association rule like information incorporated into model as context ▫ Here: into factorization methods • Can learn negated rules ▫ If C then not A
  • 9. M3 iTALS - Model • Binary tensor ▫ D dimensional User ▫ User – item – context(s) M1 T • Importance weights ▫ Lower weights to zeroes (NIF) ▫ Higher weights to cells with Item more events M2 • Cells approximated by sum of the values in the elementwise product of D vectors ▫ Each for a dimension ▫ Low dimensional vectors
  • 11. iTALS - Prediction • Sum of values in elementwise product of vectors ▫ User-item: scalar product of feature vectors ▫ User-item-context: weighted scalar product of feature vectors • Context-state dependent reweighting of features • E.g.: ▫ Third feature = horror movie ▫ Context state1 = Friday night  third feature high ▫ Context state2 = Sunday afternoon  third feature low
  • 12. Experiments • 5 databases ▫ 3 implicit  Online grocery shopping  VoD consumption  Music listening habits ▫ 2 implicitizied explicit  Netflix  MovieLens 10M • Recall@20 as primary evaluation metric • Baseline: context-unaware method in every context- state
  • 13. Scalability Running times on the Grocery dataset 350 300 iALS epoch Running time (sec) 250 iTALS (time bands) epoch 200 iTALS (sequence) epoch 150 iALS calc. matrix 100 iTALS (time bands) calc. matrix 50 iTALS (sequence) calc. matrix 0 10 20 30 40 50 60 70 80 90 100 Number of factors (K)
  • 14. Results – Recall@20 0.1200 0.1000 0.0800 0.0600 0.0400 0.0200 0.0000 VoD Grocery LastFM Netflix MovieLens iALS (MF method) iCA baseline iTALS (seasonality) iTALS (sequence)
  • 15. Results – Precision-recall curves 0.25 Recall-precision (Grocery) 0.20 0.15 0.10 iALS 0.05 0.00 iCA baseline 0.00 0.05 0.10 0.15 0.20 Recall-precision (VoD) iTALS 0.08 (season) 0.06 iTALS (sequence) 0.04 0.02 0.00 0.00 0.05 0.10 0.15 0.20
  • 16. Summary • iTALS is a ▫ scalable ▫ context-aware ▫ factorization method ▫ on implicit feedback data • The problem is modelled ▫ by approximating the values of a binary tensor ▫ with elementwise product of short vectors ▫ using importance weighting • Learning can be efficiently done by using ▫ ALS ▫ and other computation time reducing tricks • Recommendation accuracy is significantly better than iCA-baseline • Introduced a new context type: sequentiality ▫ association rule like information in factorization framework
  • 17. Thank you for your attention! For more of my recommender systems related research visit my website: http://www.hidasi.eu T