SlideShare a Scribd company logo
Towards Better Online Personalization: A
Framework for Empirical Evaluation and
Real-Life Validation of Hybrid
Recommendation Systems
Stijn Geuens, Koen W. De Bock, Kristof Coussement
Recommendation Systems: Examples
207/20/2016 AMS World Marketing Congress 2016
How to Calculate Recommendations
[Bobadilla et al. 2013; Adomavicius et al. 2008]
 Classification based on calculation paradigm:
 Classification based on input data:
3AMS World Marketing Congress 201607/20/2016
How to Calculate Recommendations
[Bobadilla et al. 2013; Adomavicius et al. 2008]
 Classification based on calculation paradigm:
 Memory-based [Goldberg, 1992]
 Model-based [Koren, 2008]
 Classification based on input data:
 Socio-demographic information  Demographic RecSys [eg. Pazzani 1999; Porcel et al. 2012]
 Product characteristics  Content-based RecSys [eg. Lang 1995; Meteren and Someren 2000]
 Real-time navigation information  Knowledge-based RecSys [eg. Burke 2000]
 Behavioral history  Collaborative filtering RecSys [eg. Herlocker et al. 2004]
 Hybrid solutions [eg. Burke 2002; Preece and Sneiderman 2009]
3AMS World Marketing Congress 201607/20/2016
A Shift Towards Hybrid Algorithms
 Single data source systems: advantages and disadvantages [Bobadilla et al. 2013]
 Hybridization resolves these issues and leads to better performance [Bobadilla et al. 2013]
 Algorithm combination vs. data source combination [Bobadilla et al. 2013]
 Burke’s classification [Burke, 2002]:
 Weighting
 Feature combination
4AMS World Marketing Congress 201607/20/2016
Contributions
 Go beyond creation of a hybrid algorithm by:
 Creation of a decision framework for marketing academics and professionals
to guide them in their efforts to create recommendation systems
 Opening the black-box of recommendation systems by introducing the
concept of feature importance
5AMS World Marketing Congress 201607/20/2016
Research Questions
6AMS World Marketing Congress 2016
 Data:
 Recommendation Calculation:
 Feature Importance:
07/20/2016
Research Questions
6AMS World Marketing Congress 2016
 Data:
RQ1.a. Do Recommendation systems based on different single data sources differ in performance?
RQ1.b. Does combining different data sources add predictive performance?
 Recommendation Calculation:
RQ2. Which hybridization technique performs best for algorithms with the optimal number of data
sources?
 Feature Importance:
RQ3. Which are the most important predictors in the best performing algorithm?
07/20/2016
Framework
AMS World Marketing Congress 2016 707/20/2016
Framework
AMS World Marketing Congress 2016 8
[Song, 2000; Kohavi et al., 2004]
07/20/2016
Framework
AMS World Marketing Congress 2016 8
[Rendle, 2010]
[Burke, 2002; Adomavicius & Tuzhilin, 2005]
07/20/2016
Framework
AMS World Marketing Congress 2016 8
[Lipton, 2014]
[Herlocker et al., 2004]
[Breiman, 2003]
07/20/2016
Framework
AMS World Marketing Congress 2016 807/20/2016
Experimental Setup
 8 different company specific datasets
AMS World Marketing Congress 2016 9
Product Category Visitors Products
Shoes 31,536 11,712
Children's Clothing 16,752 3,956
Decoration 12,747 5,054
Lingerie 11,672 3,514
Furniture 20,507 6,481
Men's Clothing 8,412 4,737
Women's Clothing 50,336 12,979
Household linen 12,376 2,934
07/20/2016
Experimental Setup
 Evaluation metric: F1@5 [Lipton, 2015]
 Method of analysis:
AMS World Marketing Congress 2016 1007/20/2016
Experimental Setup
 Evaluation metric: F1@5 [Lipton, 2015]
 Method of analysis:
 Evaluation: Data and Recommendation Calculation
 Friedman aligned rank test with Li’s procedure for posthoc testing [Garçia, 2010]
AMS World Marketing Congress 2016 1007/20/2016
Experimental Setup
 Evaluation metric: F1@5 [Lipton, 2015]
 Method of analysis:
 Evaluation: Data and Recommendation Calculation
 Friedman aligned rank test with Li’s procedure for posthoc testing [Garçia, 2010]
 Interpretation: Variable importance
 Implementation of Breiman’s (2003) method developed for random forests
AMS World Marketing Congress 2016 10
𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑖
=
𝐹1@5 𝐹𝑢𝑙𝑙 − 𝐹1@5 𝑅𝑎𝑛𝑑𝑜𝑚 𝑝𝑒𝑟𝑚𝑢𝑡𝑎𝑡𝑖𝑜𝑛
𝑖
𝐹1@5 𝐹𝑢𝑙𝑙
𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑎𝑔𝑔𝑟
𝑖
=
1
𝑑
𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑖
𝑑
07/20/2016
Results: Data
 RQ1.a. Do Recommendation systems based on different single data sources differ in
performance?
AMS World Marketing Congress 2016 11
---- indicate a non-significant difference @ 95% CI
07/20/2016
Results: Data
 RQ1.a. Do Recommendation systems based on different single data sources differ in
performance?
 Yes, there is a difference in performance of different single data source
recommendation sytems
AMS World Marketing Congress 2016 11
---- indicate a non-significant difference @ 95% CI
07/20/2016
Results: Data
 RQ1.a. Do Recommendation systems based on different single data sources differ in
performance?
 Yes, there is a difference in performance of different single data source
recommendation sytems
A company focusses best on a RBD (or PD) based recommendation sytem when
building a single data source recommender system
AMS World Marketing Congress 2016 11
---- indicate a non-significant difference @ 95% CI
07/20/2016
Results: Data
 RQ1.b. Does combining different data sources add predictive performance?
AMS World Marketing Congress 2016 12
…... indicate a marginally significant difference
07/20/2016
Results: Data
 RQ1.b. Does combining different data sources add predictive performance?
 Yes, performance increases when adding data sources
AMS World Marketing Congress 2016 12
…... indicate a marginally significant difference
07/20/2016
Results: Data
 RQ1.b. Does combining different data sources add predictive performance?
 Yes, performance increases when adding data sources
It is worthwhile for a company to investigate data source combination to improve
performance of recommendation systems
AMS World Marketing Congress 2016 12
…... indicate a marginally significant difference
07/20/2016
Results: Recommendation Calculation
 RQ2. Which hybridization technique performs best for algorithms with the optimal
number of data sources?
AMS World Marketing Congress 2016 1307/20/2016
Results: Recommendation Calculation
 RQ2. Which hybridization technique performs best for algorithms with the optimal
number of data sources?
 Factorization machines are out performing an a posteriori weighting of single data
source algorithms
AMS World Marketing Congress 2016 1307/20/2016
Results: Recommendation Calculation
 RQ2. Which hybridization technique performs best for algorithms with the optimal
number of data sources?
 Factorization machines are out performing an a posteriori weighting of single data
source algorithms
It is worthwhile for a company to investigate advanced hybridization techniques to
improve the performance of recommendation systems
AMS World Marketing Congress 2016 1307/20/2016
Results: Feature Importance
 RQ3. Which are the most important predictors in the best performing algorithm?
 Within the best performing algorithm (RQ1 and RQ2), distinction can be made
between data source importance scores. RBD > PD > CD > ABD
AMS World Marketing Congress 2016 14
0% 5% 10% 15% 20% 25% 30% 35% 40%
Aggregated Behavioral Data
Customer Data
Product Data
Raw Behavioral Data
07/20/2016
Results: Feature Importance
AMS World Marketing Congress 2016 15
0% 2% 4% 6% 8% 10% 12% 14%
Number of total purchases
Mean product rating
Total value of purchases
Length of relationship
Time since last purchase
Internal vs external
Value-based segmentation
Mean Product Rating
Explicit ratings
Number of children
Marital Status
Place of residence
Age of Children
Brand
Gender
Age
Internal search
Product Division 3
Product Division 2
Product Division 1
Purchases
Addition to cart
Views
RBD
PD
CD
ABD
07/20/2016
Conclusions
 A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
 Empirical validation of the framework on 8 datasets:
AMS World Marketing Congress 2016 1607/20/2016
Conclusions
 A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
 Empirical validation of the framework on 8 datasets:
 Single data sources recommendation systems differ in performance
AMS World Marketing Congress 2016 1607/20/2016
Conclusions
 A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
 Empirical validation of the framework on 8 datasets:
 Single data sources recommendation systems differ in performance
 Combining data sources adds to the performance of recommendation systems
AMS World Marketing Congress 2016 1607/20/2016
Conclusions
 A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
 Empirical validation of the framework on 8 datasets:
 Single data sources recommendation systems differ in performance
 Combining data sources adds to the performance of recommendation systems
 An advanced combination technique based on feature combination outperforms
a posteriori weighting of single data source algorithms
AMS World Marketing Congress 2016 1607/20/2016
Conclusions
 A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
 Empirical validation of the framework on 8 datasets:
 Single data sources recommendation systems differ in performance
 Combining data sources adds to the performance of recommendation systems
 An advanced combination technique based on feature combination outperforms
a posteriori weighting of single data source algorithms
 RBD is the most important data source in the best performing model followed by
PD, CD, and finally ABD
AMS World Marketing Congress 2016 1607/20/2016
Future Work
 Incorporation of other evaluation metrics in the framework
 Field test  Evaluation of different recommendation strategies in terms
of business metrics
 Identification of the relationship between ‘academic’ metrics and
business metrics
AMS World Marketing Congress 2016 1707/20/2016
References
 J. Bobadilla, F. Ortega, A. Hernando, A. Gutierrez, Recommender systems survey, Knowl.-Based Syst.,
46 (2013) 109-132
 ] G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: A survey of the
state-of-the-art and possible extensions, IEEE Trans. Knowl. Data Eng., 17 (2005) 734-749
 Y. Koren, Factorization meets the neighborhood: A multifaceted collaborative filtering model, 14th
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Las Vegas,
NV, 2008, pp. 426-434
 M.J. Pazzani, A framework for collaborative, content-based and demographic filtering, Artif. Intell.
Rev., 13 (1999) 393-408
 C. Porcel, A. Tejeda-Lorente, M.A. Martinez, E. Herrera-Viedma, A hybrid recommender system for
the selective dissemination of research resources in a technology transfer office, Inform. Sciences,
184 (2012) 1-19
 R. Burke, Hybrid recommender systems: Survey and experiments, User Modeling and User-Adapted
Interaction, 12 (2002) 331-370
AMS World Marketing Congress 2016 1807/20/2016
References
 J.L. Herlocker, J.A. Konstan, L.G. Terveen, J.T. Riedl, Evaluating collaborative filtering recommender
systems, ACM Trans. Inf. Syst., 22 (2004) 5-53
 I.-Y. Song, Database Design for Real-World E-Commerce Systems, IEEE Data Engineering Bulletin, 23
(2000) 23-28.
 R. Kohavi, L. Mason, R. Parekh, Z. Zheng, Lessons and Challenges from Mining Retail E-Commerce
Data, Mach. Learn., 57 (2004) 83-113
 S. Rendle, Factorization Machines, IEEE International Conference on Data Mining, Sydney, Australia,
2010
 Z.C. Lipton, C. Elkan, B. Naryanaswamy, Optimal thresholding of classifiers to maximize F1 measure,
in: T. Calders, F. Esposito, E. Hüllermeier, R. Meo (Eds.) Machine Learning and Knowledge Discovery in
Databases, Springer Berlin Heidelberg 2014, pp. 225-239
 L. Breiman, Random forests, Mach. Learn., 45 (2001) 5-32
AMS World Marketing Congress 2016 1907/20/2016
Thank you for
your Attention
Contact:
Stijn Geuens (0)3.20.545.892
IESEG School of Management s.geuens@ieseg.fr
3 Rue de la Digue fr.linkedin.com/pub/stijn-geuens/
F-59000 Lille stijn.geuens
AMS World Marketing Congress 2016 2007/20/2016
Appendix 1: Advantages and disadvantages
of different systems
[Burke, 2002]
AMS World Marketing Congress 2016 21
Collaborative
Filtering
Content-based Knowledge-Based Demographic
Pros
No metadata
engineering needed
Comparison between
items possible
Deterministic
No metadata
engineering needed
Serendipity in results
No metadata
engineering needed
No cold-start Serendipity in results
Adaptive Adaptive
Cons
Scalability Overspecialization
Knowledge engineering
required
Long tail
Cold Start for new users
and items
Cold start for new users Subjective Cold start for new users
Long tail problem
Collection of product
information
Static Static
Stability
07/20/2016
Appendix 2: Experimental Framework
Data
22AMS World Marketing Congress 2016
Data
Product Data
Three main
product division
Brand
Mean product
rating
Internal vs.
external
Availability on
the web
Customer Data
Age
Gender
Marital status
Place of
residence
Number of
children
Age of children
Aggregated
Behavioral Data
RFM
Time since last
purchase
Number of total
purchases
Total value of
purchases
Relationship
features
Length of
Relationship
Value-based
segmentation
Mean product
rating
Raw Behavioral
Data
Explicit ratings
Purchases
Internal search
Addition to cart
Views
07/20/2016
Appendix 2: Experimental Framework
Data
AMS World Marketing Congress 2016 2307/20/2016
Appendix 2: Experimental Framework
Data
AMS World Marketing Congress 2016 24
Product Category Visitors Products
Shoes 31,536 11,712
Children's Clothing 16,752 3,956
Decoration 12,747 5,054
Lingerie 11,672 3,514
Furniture 20,507 6,481
Men's Clothing 8,412 4,737
Women's Clothing 50,336 12,979
Household linen 12,376 2,934
07/20/2016
Appendix 3: Experimental Framework:
Recommendation Calculation
25
 Factorization Machines
 Introduced by Rendle (2010)
 Based on Support Vector Machines (SVM) and factorization models and combines the advantages
of both.
 SVM: Works with any real valued feature vector, allowing to integrated different data sources
 Factorization Models: Variable interaction is calculated based on factorized parameters, allowing
to estimate interaction under huge sparsity, where SVM’s fail.
 General FM model equation of degree 2:
AMS World Marketing Congress 201607/20/2016

More Related Content

Viewers also liked

As media evaluation jake
As media evaluation jakeAs media evaluation jake
As media evaluation jake
J4K360
 
Teorías y estándares internacionales para la construcción de
Teorías y estándares internacionales para la construcción deTeorías y estándares internacionales para la construcción de
Teorías y estándares internacionales para la construcción de
ldu_galo
 
Hayatin sirri
Hayatin sirriHayatin sirri
Hayatin sirriendaze41
 
Film pitch for class on the 22nd of october
Film pitch for class on the 22nd of octoberFilm pitch for class on the 22nd of october
Film pitch for class on the 22nd of october
J4K360
 
My PhD trajectory
My PhD trajectoryMy PhD trajectory
My PhD trajectory
Stijn Geuens
 
As media evaluation jake
As media evaluation jakeAs media evaluation jake
As media evaluation jake
J4K360
 
Desart presentation irca1 apr15
Desart presentation irca1 apr15Desart presentation irca1 apr15
Desart presentation irca1 apr15Desart Inc
 
Ulasan jurnal
Ulasan jurnalUlasan jurnal
Ulasan jurnal
noursham
 
Cableado estructurado
Cableado estructuradoCableado estructurado
Cableado estructurado
ldu_galo
 
Escuela inteligente
Escuela inteligenteEscuela inteligente
Escuela inteligente
ldu_galo
 
게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉 공개용
게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉   공개용게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉   공개용
게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉 공개용
lswsox
 

Viewers also liked (11)

As media evaluation jake
As media evaluation jakeAs media evaluation jake
As media evaluation jake
 
Teorías y estándares internacionales para la construcción de
Teorías y estándares internacionales para la construcción deTeorías y estándares internacionales para la construcción de
Teorías y estándares internacionales para la construcción de
 
Hayatin sirri
Hayatin sirriHayatin sirri
Hayatin sirri
 
Film pitch for class on the 22nd of october
Film pitch for class on the 22nd of octoberFilm pitch for class on the 22nd of october
Film pitch for class on the 22nd of october
 
My PhD trajectory
My PhD trajectoryMy PhD trajectory
My PhD trajectory
 
As media evaluation jake
As media evaluation jakeAs media evaluation jake
As media evaluation jake
 
Desart presentation irca1 apr15
Desart presentation irca1 apr15Desart presentation irca1 apr15
Desart presentation irca1 apr15
 
Ulasan jurnal
Ulasan jurnalUlasan jurnal
Ulasan jurnal
 
Cableado estructurado
Cableado estructuradoCableado estructurado
Cableado estructurado
 
Escuela inteligente
Escuela inteligenteEscuela inteligente
Escuela inteligente
 
게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉 공개용
게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉   공개용게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉   공개용
게임 캐릭터의 실시간 렌더링을 위한 리깅 테크닉 공개용
 

Similar to Towards Better Online Personalization: A Framework for Empirical Evaluation and Real-Life Validation of Hybrid Recommendation Systems

The art of implementing data lineage
The art of implementing data lineageThe art of implementing data lineage
The art of implementing data lineage
Leigh Hill
 
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon SessionMicrosoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
MSFTAdvertising
 
Corporate Sustainabiltiy Assessment and DJSI debrief at TSX
Corporate Sustainabiltiy Assessment and DJSI debrief at TSXCorporate Sustainabiltiy Assessment and DJSI debrief at TSX
Corporate Sustainabiltiy Assessment and DJSI debrief at TSX
Robert Dornau
 
How to leverage a market data inventory platform for enterprise-wide gains
How to leverage a market data inventory platform for enterprise-wide gainsHow to leverage a market data inventory platform for enterprise-wide gains
How to leverage a market data inventory platform for enterprise-wide gains
Leigh Hill
 
Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020
Mirko Marras
 
Four categories of entity data quality management
Four categories of entity data quality managementFour categories of entity data quality management
Four categories of entity data quality management
Leigh Hill
 
Driving Results through Strategic Data Sourcing and Optimization: Life Line G...
Driving Results through Strategic Data Sourcing and Optimization: Life Line G...Driving Results through Strategic Data Sourcing and Optimization: Life Line G...
Driving Results through Strategic Data Sourcing and Optimization: Life Line G...Vivastream
 
Strategic value assessment (sva) web
Strategic value assessment (sva)   webStrategic value assessment (sva)   web
Strategic value assessment (sva) web
Charles Novak
 
SC Cost optimization
SC Cost optimizationSC Cost optimization
SC Cost optimizationJulia Runow
 
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...
BAINIDA
 
Benchmarking Your Online Impact: From Stats to Reputation Management
Benchmarking Your Online Impact: From Stats to Reputation ManagementBenchmarking Your Online Impact: From Stats to Reputation Management
Benchmarking Your Online Impact: From Stats to Reputation Management
NSI Partners, LLC
 
Predictive Analytics: From Insight to Action
Predictive Analytics: From Insight to ActionPredictive Analytics: From Insight to Action
Predictive Analytics: From Insight to Action
Philippe Nemery
 
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market ShareData Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Formulatedby
 
Pw C Engage 10.14r.09
Pw C Engage 10.14r.09Pw C Engage 10.14r.09
Pw C Engage 10.14r.09
jdanko26
 
Ebook definitive guide to attribution final
Ebook definitive guide to attribution finalEbook definitive guide to attribution final
Ebook definitive guide to attribution final
Nicolas Valenzuela
 
How SASB Can Help Companies Manage the Sustainability Factors that Impact Value
How SASB Can Help Companies Manage the Sustainability Factors that Impact ValueHow SASB Can Help Companies Manage the Sustainability Factors that Impact Value
How SASB Can Help Companies Manage the Sustainability Factors that Impact Value
Sustainable Brands
 
Digital_Marketing_Environment.doc.docx
Digital_Marketing_Environment.doc.docxDigital_Marketing_Environment.doc.docx
Digital_Marketing_Environment.doc.docx
OmPrakash63906
 
Hub16: Why Bespoke Supply Chain Analytics?
Hub16: Why Bespoke Supply Chain Analytics?Hub16: Why Bespoke Supply Chain Analytics?
Hub16: Why Bespoke Supply Chain Analytics?
Anaplan
 
2016 Network Monitoring & Backup Brand Leader Survey Mini-Report
2016 Network Monitoring & Backup Brand Leader Survey Mini-Report2016 Network Monitoring & Backup Brand Leader Survey Mini-Report
2016 Network Monitoring & Backup Brand Leader Survey Mini-Report
IT Brand Pulse
 

Similar to Towards Better Online Personalization: A Framework for Empirical Evaluation and Real-Life Validation of Hybrid Recommendation Systems (20)

The art of implementing data lineage
The art of implementing data lineageThe art of implementing data lineage
The art of implementing data lineage
 
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon SessionMicrosoft Advertising Growth Solutions Bootcamp - Afternoon Session
Microsoft Advertising Growth Solutions Bootcamp - Afternoon Session
 
Corporate Sustainabiltiy Assessment and DJSI debrief at TSX
Corporate Sustainabiltiy Assessment and DJSI debrief at TSXCorporate Sustainabiltiy Assessment and DJSI debrief at TSX
Corporate Sustainabiltiy Assessment and DJSI debrief at TSX
 
How to leverage a market data inventory platform for enterprise-wide gains
How to leverage a market data inventory platform for enterprise-wide gainsHow to leverage a market data inventory platform for enterprise-wide gains
How to leverage a market data inventory platform for enterprise-wide gains
 
Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020
 
Big data research
Big data researchBig data research
Big data research
 
Four categories of entity data quality management
Four categories of entity data quality managementFour categories of entity data quality management
Four categories of entity data quality management
 
Driving Results through Strategic Data Sourcing and Optimization: Life Line G...
Driving Results through Strategic Data Sourcing and Optimization: Life Line G...Driving Results through Strategic Data Sourcing and Optimization: Life Line G...
Driving Results through Strategic Data Sourcing and Optimization: Life Line G...
 
Strategic value assessment (sva) web
Strategic value assessment (sva)   webStrategic value assessment (sva)   web
Strategic value assessment (sva) web
 
SC Cost optimization
SC Cost optimizationSC Cost optimization
SC Cost optimization
 
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...
 
Benchmarking Your Online Impact: From Stats to Reputation Management
Benchmarking Your Online Impact: From Stats to Reputation ManagementBenchmarking Your Online Impact: From Stats to Reputation Management
Benchmarking Your Online Impact: From Stats to Reputation Management
 
Predictive Analytics: From Insight to Action
Predictive Analytics: From Insight to ActionPredictive Analytics: From Insight to Action
Predictive Analytics: From Insight to Action
 
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market ShareData Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
 
Pw C Engage 10.14r.09
Pw C Engage 10.14r.09Pw C Engage 10.14r.09
Pw C Engage 10.14r.09
 
Ebook definitive guide to attribution final
Ebook definitive guide to attribution finalEbook definitive guide to attribution final
Ebook definitive guide to attribution final
 
How SASB Can Help Companies Manage the Sustainability Factors that Impact Value
How SASB Can Help Companies Manage the Sustainability Factors that Impact ValueHow SASB Can Help Companies Manage the Sustainability Factors that Impact Value
How SASB Can Help Companies Manage the Sustainability Factors that Impact Value
 
Digital_Marketing_Environment.doc.docx
Digital_Marketing_Environment.doc.docxDigital_Marketing_Environment.doc.docx
Digital_Marketing_Environment.doc.docx
 
Hub16: Why Bespoke Supply Chain Analytics?
Hub16: Why Bespoke Supply Chain Analytics?Hub16: Why Bespoke Supply Chain Analytics?
Hub16: Why Bespoke Supply Chain Analytics?
 
2016 Network Monitoring & Backup Brand Leader Survey Mini-Report
2016 Network Monitoring & Backup Brand Leader Survey Mini-Report2016 Network Monitoring & Backup Brand Leader Survey Mini-Report
2016 Network Monitoring & Backup Brand Leader Survey Mini-Report
 

Recently uploaded

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

Towards Better Online Personalization: A Framework for Empirical Evaluation and Real-Life Validation of Hybrid Recommendation Systems

  • 1. Towards Better Online Personalization: A Framework for Empirical Evaluation and Real-Life Validation of Hybrid Recommendation Systems Stijn Geuens, Koen W. De Bock, Kristof Coussement
  • 2. Recommendation Systems: Examples 207/20/2016 AMS World Marketing Congress 2016
  • 3. How to Calculate Recommendations [Bobadilla et al. 2013; Adomavicius et al. 2008]  Classification based on calculation paradigm:  Classification based on input data: 3AMS World Marketing Congress 201607/20/2016
  • 4. How to Calculate Recommendations [Bobadilla et al. 2013; Adomavicius et al. 2008]  Classification based on calculation paradigm:  Memory-based [Goldberg, 1992]  Model-based [Koren, 2008]  Classification based on input data:  Socio-demographic information  Demographic RecSys [eg. Pazzani 1999; Porcel et al. 2012]  Product characteristics  Content-based RecSys [eg. Lang 1995; Meteren and Someren 2000]  Real-time navigation information  Knowledge-based RecSys [eg. Burke 2000]  Behavioral history  Collaborative filtering RecSys [eg. Herlocker et al. 2004]  Hybrid solutions [eg. Burke 2002; Preece and Sneiderman 2009] 3AMS World Marketing Congress 201607/20/2016
  • 5. A Shift Towards Hybrid Algorithms  Single data source systems: advantages and disadvantages [Bobadilla et al. 2013]  Hybridization resolves these issues and leads to better performance [Bobadilla et al. 2013]  Algorithm combination vs. data source combination [Bobadilla et al. 2013]  Burke’s classification [Burke, 2002]:  Weighting  Feature combination 4AMS World Marketing Congress 201607/20/2016
  • 6. Contributions  Go beyond creation of a hybrid algorithm by:  Creation of a decision framework for marketing academics and professionals to guide them in their efforts to create recommendation systems  Opening the black-box of recommendation systems by introducing the concept of feature importance 5AMS World Marketing Congress 201607/20/2016
  • 7. Research Questions 6AMS World Marketing Congress 2016  Data:  Recommendation Calculation:  Feature Importance: 07/20/2016
  • 8. Research Questions 6AMS World Marketing Congress 2016  Data: RQ1.a. Do Recommendation systems based on different single data sources differ in performance? RQ1.b. Does combining different data sources add predictive performance?  Recommendation Calculation: RQ2. Which hybridization technique performs best for algorithms with the optimal number of data sources?  Feature Importance: RQ3. Which are the most important predictors in the best performing algorithm? 07/20/2016
  • 9. Framework AMS World Marketing Congress 2016 707/20/2016
  • 10. Framework AMS World Marketing Congress 2016 8 [Song, 2000; Kohavi et al., 2004] 07/20/2016
  • 11. Framework AMS World Marketing Congress 2016 8 [Rendle, 2010] [Burke, 2002; Adomavicius & Tuzhilin, 2005] 07/20/2016
  • 12. Framework AMS World Marketing Congress 2016 8 [Lipton, 2014] [Herlocker et al., 2004] [Breiman, 2003] 07/20/2016
  • 13. Framework AMS World Marketing Congress 2016 807/20/2016
  • 14. Experimental Setup  8 different company specific datasets AMS World Marketing Congress 2016 9 Product Category Visitors Products Shoes 31,536 11,712 Children's Clothing 16,752 3,956 Decoration 12,747 5,054 Lingerie 11,672 3,514 Furniture 20,507 6,481 Men's Clothing 8,412 4,737 Women's Clothing 50,336 12,979 Household linen 12,376 2,934 07/20/2016
  • 15. Experimental Setup  Evaluation metric: F1@5 [Lipton, 2015]  Method of analysis: AMS World Marketing Congress 2016 1007/20/2016
  • 16. Experimental Setup  Evaluation metric: F1@5 [Lipton, 2015]  Method of analysis:  Evaluation: Data and Recommendation Calculation  Friedman aligned rank test with Li’s procedure for posthoc testing [Garçia, 2010] AMS World Marketing Congress 2016 1007/20/2016
  • 17. Experimental Setup  Evaluation metric: F1@5 [Lipton, 2015]  Method of analysis:  Evaluation: Data and Recommendation Calculation  Friedman aligned rank test with Li’s procedure for posthoc testing [Garçia, 2010]  Interpretation: Variable importance  Implementation of Breiman’s (2003) method developed for random forests AMS World Marketing Congress 2016 10 𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑖 = 𝐹1@5 𝐹𝑢𝑙𝑙 − 𝐹1@5 𝑅𝑎𝑛𝑑𝑜𝑚 𝑝𝑒𝑟𝑚𝑢𝑡𝑎𝑡𝑖𝑜𝑛 𝑖 𝐹1@5 𝐹𝑢𝑙𝑙 𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑎𝑔𝑔𝑟 𝑖 = 1 𝑑 𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑖 𝑑 07/20/2016
  • 18. Results: Data  RQ1.a. Do Recommendation systems based on different single data sources differ in performance? AMS World Marketing Congress 2016 11 ---- indicate a non-significant difference @ 95% CI 07/20/2016
  • 19. Results: Data  RQ1.a. Do Recommendation systems based on different single data sources differ in performance?  Yes, there is a difference in performance of different single data source recommendation sytems AMS World Marketing Congress 2016 11 ---- indicate a non-significant difference @ 95% CI 07/20/2016
  • 20. Results: Data  RQ1.a. Do Recommendation systems based on different single data sources differ in performance?  Yes, there is a difference in performance of different single data source recommendation sytems A company focusses best on a RBD (or PD) based recommendation sytem when building a single data source recommender system AMS World Marketing Congress 2016 11 ---- indicate a non-significant difference @ 95% CI 07/20/2016
  • 21. Results: Data  RQ1.b. Does combining different data sources add predictive performance? AMS World Marketing Congress 2016 12 …... indicate a marginally significant difference 07/20/2016
  • 22. Results: Data  RQ1.b. Does combining different data sources add predictive performance?  Yes, performance increases when adding data sources AMS World Marketing Congress 2016 12 …... indicate a marginally significant difference 07/20/2016
  • 23. Results: Data  RQ1.b. Does combining different data sources add predictive performance?  Yes, performance increases when adding data sources It is worthwhile for a company to investigate data source combination to improve performance of recommendation systems AMS World Marketing Congress 2016 12 …... indicate a marginally significant difference 07/20/2016
  • 24. Results: Recommendation Calculation  RQ2. Which hybridization technique performs best for algorithms with the optimal number of data sources? AMS World Marketing Congress 2016 1307/20/2016
  • 25. Results: Recommendation Calculation  RQ2. Which hybridization technique performs best for algorithms with the optimal number of data sources?  Factorization machines are out performing an a posteriori weighting of single data source algorithms AMS World Marketing Congress 2016 1307/20/2016
  • 26. Results: Recommendation Calculation  RQ2. Which hybridization technique performs best for algorithms with the optimal number of data sources?  Factorization machines are out performing an a posteriori weighting of single data source algorithms It is worthwhile for a company to investigate advanced hybridization techniques to improve the performance of recommendation systems AMS World Marketing Congress 2016 1307/20/2016
  • 27. Results: Feature Importance  RQ3. Which are the most important predictors in the best performing algorithm?  Within the best performing algorithm (RQ1 and RQ2), distinction can be made between data source importance scores. RBD > PD > CD > ABD AMS World Marketing Congress 2016 14 0% 5% 10% 15% 20% 25% 30% 35% 40% Aggregated Behavioral Data Customer Data Product Data Raw Behavioral Data 07/20/2016
  • 28. Results: Feature Importance AMS World Marketing Congress 2016 15 0% 2% 4% 6% 8% 10% 12% 14% Number of total purchases Mean product rating Total value of purchases Length of relationship Time since last purchase Internal vs external Value-based segmentation Mean Product Rating Explicit ratings Number of children Marital Status Place of residence Age of Children Brand Gender Age Internal search Product Division 3 Product Division 2 Product Division 1 Purchases Addition to cart Views RBD PD CD ABD 07/20/2016
  • 29. Conclusions  A framework to guide marketing professionals and academics in their efforts to create recommendation systems  Empirical validation of the framework on 8 datasets: AMS World Marketing Congress 2016 1607/20/2016
  • 30. Conclusions  A framework to guide marketing professionals and academics in their efforts to create recommendation systems  Empirical validation of the framework on 8 datasets:  Single data sources recommendation systems differ in performance AMS World Marketing Congress 2016 1607/20/2016
  • 31. Conclusions  A framework to guide marketing professionals and academics in their efforts to create recommendation systems  Empirical validation of the framework on 8 datasets:  Single data sources recommendation systems differ in performance  Combining data sources adds to the performance of recommendation systems AMS World Marketing Congress 2016 1607/20/2016
  • 32. Conclusions  A framework to guide marketing professionals and academics in their efforts to create recommendation systems  Empirical validation of the framework on 8 datasets:  Single data sources recommendation systems differ in performance  Combining data sources adds to the performance of recommendation systems  An advanced combination technique based on feature combination outperforms a posteriori weighting of single data source algorithms AMS World Marketing Congress 2016 1607/20/2016
  • 33. Conclusions  A framework to guide marketing professionals and academics in their efforts to create recommendation systems  Empirical validation of the framework on 8 datasets:  Single data sources recommendation systems differ in performance  Combining data sources adds to the performance of recommendation systems  An advanced combination technique based on feature combination outperforms a posteriori weighting of single data source algorithms  RBD is the most important data source in the best performing model followed by PD, CD, and finally ABD AMS World Marketing Congress 2016 1607/20/2016
  • 34. Future Work  Incorporation of other evaluation metrics in the framework  Field test  Evaluation of different recommendation strategies in terms of business metrics  Identification of the relationship between ‘academic’ metrics and business metrics AMS World Marketing Congress 2016 1707/20/2016
  • 35. References  J. Bobadilla, F. Ortega, A. Hernando, A. Gutierrez, Recommender systems survey, Knowl.-Based Syst., 46 (2013) 109-132  ] G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions, IEEE Trans. Knowl. Data Eng., 17 (2005) 734-749  Y. Koren, Factorization meets the neighborhood: A multifaceted collaborative filtering model, 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Las Vegas, NV, 2008, pp. 426-434  M.J. Pazzani, A framework for collaborative, content-based and demographic filtering, Artif. Intell. Rev., 13 (1999) 393-408  C. Porcel, A. Tejeda-Lorente, M.A. Martinez, E. Herrera-Viedma, A hybrid recommender system for the selective dissemination of research resources in a technology transfer office, Inform. Sciences, 184 (2012) 1-19  R. Burke, Hybrid recommender systems: Survey and experiments, User Modeling and User-Adapted Interaction, 12 (2002) 331-370 AMS World Marketing Congress 2016 1807/20/2016
  • 36. References  J.L. Herlocker, J.A. Konstan, L.G. Terveen, J.T. Riedl, Evaluating collaborative filtering recommender systems, ACM Trans. Inf. Syst., 22 (2004) 5-53  I.-Y. Song, Database Design for Real-World E-Commerce Systems, IEEE Data Engineering Bulletin, 23 (2000) 23-28.  R. Kohavi, L. Mason, R. Parekh, Z. Zheng, Lessons and Challenges from Mining Retail E-Commerce Data, Mach. Learn., 57 (2004) 83-113  S. Rendle, Factorization Machines, IEEE International Conference on Data Mining, Sydney, Australia, 2010  Z.C. Lipton, C. Elkan, B. Naryanaswamy, Optimal thresholding of classifiers to maximize F1 measure, in: T. Calders, F. Esposito, E. Hüllermeier, R. Meo (Eds.) Machine Learning and Knowledge Discovery in Databases, Springer Berlin Heidelberg 2014, pp. 225-239  L. Breiman, Random forests, Mach. Learn., 45 (2001) 5-32 AMS World Marketing Congress 2016 1907/20/2016
  • 37. Thank you for your Attention Contact: Stijn Geuens (0)3.20.545.892 IESEG School of Management s.geuens@ieseg.fr 3 Rue de la Digue fr.linkedin.com/pub/stijn-geuens/ F-59000 Lille stijn.geuens AMS World Marketing Congress 2016 2007/20/2016
  • 38. Appendix 1: Advantages and disadvantages of different systems [Burke, 2002] AMS World Marketing Congress 2016 21 Collaborative Filtering Content-based Knowledge-Based Demographic Pros No metadata engineering needed Comparison between items possible Deterministic No metadata engineering needed Serendipity in results No metadata engineering needed No cold-start Serendipity in results Adaptive Adaptive Cons Scalability Overspecialization Knowledge engineering required Long tail Cold Start for new users and items Cold start for new users Subjective Cold start for new users Long tail problem Collection of product information Static Static Stability 07/20/2016
  • 39. Appendix 2: Experimental Framework Data 22AMS World Marketing Congress 2016 Data Product Data Three main product division Brand Mean product rating Internal vs. external Availability on the web Customer Data Age Gender Marital status Place of residence Number of children Age of children Aggregated Behavioral Data RFM Time since last purchase Number of total purchases Total value of purchases Relationship features Length of Relationship Value-based segmentation Mean product rating Raw Behavioral Data Explicit ratings Purchases Internal search Addition to cart Views 07/20/2016
  • 40. Appendix 2: Experimental Framework Data AMS World Marketing Congress 2016 2307/20/2016
  • 41. Appendix 2: Experimental Framework Data AMS World Marketing Congress 2016 24 Product Category Visitors Products Shoes 31,536 11,712 Children's Clothing 16,752 3,956 Decoration 12,747 5,054 Lingerie 11,672 3,514 Furniture 20,507 6,481 Men's Clothing 8,412 4,737 Women's Clothing 50,336 12,979 Household linen 12,376 2,934 07/20/2016
  • 42. Appendix 3: Experimental Framework: Recommendation Calculation 25  Factorization Machines  Introduced by Rendle (2010)  Based on Support Vector Machines (SVM) and factorization models and combines the advantages of both.  SVM: Works with any real valued feature vector, allowing to integrated different data sources  Factorization Models: Variable interaction is calculated based on factorized parameters, allowing to estimate interaction under huge sparsity, where SVM’s fail.  General FM model equation of degree 2: AMS World Marketing Congress 201607/20/2016