Presentation given at AMS 2016 discussing a framework to calculate and evaluate hybrid recommendation systems and a method to calculate feature importance.
Data and analytics allow organizations to use intelligence from feedback to tailor offerings that improve customer satisfaction.
B2B are gaining the most since they are able to share data that directly strengthens their relationship.
This presentation from Research Scientist Stacey Frederick provides an overview of a method to identify the firms and organizations involved in the nano economy throughout the value chain and introduces a new web-based platform to disseminate the information.
Nucleus Research found organizations can earn an incremental ROI of 241 percent by using Big Data capabilities to examine large and complex data sets. One driver of high returns was the ability to improve business processes and decisions by increasing the types of data that can be analyzed.
Getting IDMP Ready via Modern Product Data ManagementCognizant
For life sciences and pharma companies, compliance with the identification of medicinal products (IDMP) mandate is best achieved by building on an existing IDMP-compliant data management platform that can integrate all product master data.
Managing costs and delivering savings continues to be a focal
point of the procurement agenda. However, procurement also
has to focus on driving innovation to ultimately help drive
the growth agenda. This is against a backdrop of increasing
regulation that requires more transparency governance and
compliance in ever-increasing competitive markets.
An analysis of financial performance of manufacturers and retailers at the intersection of growth, operating margin, inventory turns and Return on Invested Capital (ROIC) for the period of 2010-2019. Congratulations to 22 winners including AbbVie Inc., Assa Abloy AB, BorgWarner Inc., Broadcom, Dollar Tree Stores, Ecolab Inc., iRobot Corporation, Lockheed Martin Corporation, Koninklijke Ahold N.V. (Ahold), L'Oréal S.A, Monster Beverage Company, PACCAR Inc, Reckitt Benckiser Group plc, ResMed, Rockwell Automation, Samsung, Sleep Number, Taiwan Semiconductor Manufacturing (TSMC) Company, The Toro Company, TJX Companies, Ubiquiti Networks, United Tractors, and VF Corporation.
Data and analytics allow organizations to use intelligence from feedback to tailor offerings that improve customer satisfaction.
B2B are gaining the most since they are able to share data that directly strengthens their relationship.
This presentation from Research Scientist Stacey Frederick provides an overview of a method to identify the firms and organizations involved in the nano economy throughout the value chain and introduces a new web-based platform to disseminate the information.
Nucleus Research found organizations can earn an incremental ROI of 241 percent by using Big Data capabilities to examine large and complex data sets. One driver of high returns was the ability to improve business processes and decisions by increasing the types of data that can be analyzed.
Getting IDMP Ready via Modern Product Data ManagementCognizant
For life sciences and pharma companies, compliance with the identification of medicinal products (IDMP) mandate is best achieved by building on an existing IDMP-compliant data management platform that can integrate all product master data.
Managing costs and delivering savings continues to be a focal
point of the procurement agenda. However, procurement also
has to focus on driving innovation to ultimately help drive
the growth agenda. This is against a backdrop of increasing
regulation that requires more transparency governance and
compliance in ever-increasing competitive markets.
An analysis of financial performance of manufacturers and retailers at the intersection of growth, operating margin, inventory turns and Return on Invested Capital (ROIC) for the period of 2010-2019. Congratulations to 22 winners including AbbVie Inc., Assa Abloy AB, BorgWarner Inc., Broadcom, Dollar Tree Stores, Ecolab Inc., iRobot Corporation, Lockheed Martin Corporation, Koninklijke Ahold N.V. (Ahold), L'Oréal S.A, Monster Beverage Company, PACCAR Inc, Reckitt Benckiser Group plc, ResMed, Rockwell Automation, Samsung, Sleep Number, Taiwan Semiconductor Manufacturing (TSMC) Company, The Toro Company, TJX Companies, Ubiquiti Networks, United Tractors, and VF Corporation.
Data lineage is a regulatory and internal requirement with potential to deliver significant operational and business benefits, but financial institutions can find it difficult to implement and complex to maintain as systems and regulatory requirements themselves, change quickly. The importance of understanding where the true source of the data is coming from, where the data flows to and what has changed cannot be overstated. The webinar defines data lineage and discuss implementation through the eyes of those that have implemented and sustained successful lineage solutions with significant benefits.
Listen to the webinar to find out about:
- Data management for data lineage
- Winning buy-in for projects
- Best practice implementation
- Operational and business benefits
- Expert practitioner advice
Corporate Sustainabiltiy Assessment and DJSI debrief at TSXRobert Dornau
Understand the participation process for RobecoSAM's Corporate Sustainability Assessment and some key developments on Materiality, Impact valuation and human rights in the 2016 assessment round for DJSI.
How to leverage a market data inventory platform for enterprise-wide gainsLeigh Hill
What do global heads of market data thinking about the best ways of managing costs?
What are the considerations, strengths and weaknesses of a market data inventory platform?
Join us on a webinar where we reveal the findings from our survey including:
-How to best manage market data costs
-The functions and capabilities to look for in market data spend management software
-The pitfalls of using legacy inventory platforms
Four categories of entity data quality managementLeigh Hill
We all know entity data is difficult. But there are four key categories of data quality management that you can apply to make measurable improvements to your entity data.
Traditional methods no longer work. You need methods to accurately measure the quality of your data and give you the ability to take meaningful action.
In this webinar we discuss:
-The biggest challenges with managing entity data quality
-The four categories of data quality management
-You can’t manage what you can’t measure. So how can you measure the quality of entity data?
-Traditional vs modern methods for measuring entity data quality
-Understanding the scale of ongoing entity data maintenance challenges
-The impact of utilities and industry initiatives such as LEI on data quality
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...BAINIDA
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analytics and Data Sciences Contest
ผู้ที่ได้รางวัลชนะเลิศ
นายเธียรศักดิ์ พลาดิศัยเลิศ นักศึกษาคณะไอทีลาดกระบัง
นายก่อกฤษฎิ์ เอกพาณิชย์ถาวร จากคณะเศรษฐศาสตร์และวิทยาศาสตร์ข้อมูล WESLEYAN UNIVERSITY
นายณัฐพล รักษ์รัชตกุล จากคณะวิศวกรรมศาสตร์ จุฬาลงกรณ์มหาวิทยาลัย
Data lineage is a regulatory and internal requirement with potential to deliver significant operational and business benefits, but financial institutions can find it difficult to implement and complex to maintain as systems and regulatory requirements themselves, change quickly. The importance of understanding where the true source of the data is coming from, where the data flows to and what has changed cannot be overstated. The webinar defines data lineage and discuss implementation through the eyes of those that have implemented and sustained successful lineage solutions with significant benefits.
Listen to the webinar to find out about:
- Data management for data lineage
- Winning buy-in for projects
- Best practice implementation
- Operational and business benefits
- Expert practitioner advice
Corporate Sustainabiltiy Assessment and DJSI debrief at TSXRobert Dornau
Understand the participation process for RobecoSAM's Corporate Sustainability Assessment and some key developments on Materiality, Impact valuation and human rights in the 2016 assessment round for DJSI.
How to leverage a market data inventory platform for enterprise-wide gainsLeigh Hill
What do global heads of market data thinking about the best ways of managing costs?
What are the considerations, strengths and weaknesses of a market data inventory platform?
Join us on a webinar where we reveal the findings from our survey including:
-How to best manage market data costs
-The functions and capabilities to look for in market data spend management software
-The pitfalls of using legacy inventory platforms
Four categories of entity data quality managementLeigh Hill
We all know entity data is difficult. But there are four key categories of data quality management that you can apply to make measurable improvements to your entity data.
Traditional methods no longer work. You need methods to accurately measure the quality of your data and give you the ability to take meaningful action.
In this webinar we discuss:
-The biggest challenges with managing entity data quality
-The four categories of data quality management
-You can’t manage what you can’t measure. So how can you measure the quality of entity data?
-Traditional vs modern methods for measuring entity data quality
-Understanding the scale of ongoing entity data maintenance challenges
-The impact of utilities and industry initiatives such as LEI on data quality
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analyti...BAINIDA
ผลการวิเคราะห์ข้อมูลของทีมที่ได้รางวัลชนะเลิศ The First NIDA Business Analytics and Data Sciences Contest
ผู้ที่ได้รางวัลชนะเลิศ
นายเธียรศักดิ์ พลาดิศัยเลิศ นักศึกษาคณะไอทีลาดกระบัง
นายก่อกฤษฎิ์ เอกพาณิชย์ถาวร จากคณะเศรษฐศาสตร์และวิทยาศาสตร์ข้อมูล WESLEYAN UNIVERSITY
นายณัฐพล รักษ์รัชตกุล จากคณะวิศวกรรมศาสตร์ จุฬาลงกรณ์มหาวิทยาลัย
Benchmarking Your Online Impact: From Stats to Reputation ManagementNSI Partners, LLC
"Benchmarking Your Online Impact: From Stats to Reputation Management" delivered live at UnTech10, 1:30pm Thursday Feb. 11, 2010; combined from 2 sessions originally scheduled for delivery at the ASAE Technology Conference (canceled due to snowfall in Washington, DC)
The ultimate value of historical data lays in addressing the questions "what will happen?" and "what is the best that could happen?". This session will detail how SAP Predictive Analytics empowers business decision makers by making accurate predictions in an agile and self-service manner.
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market ShareFormulatedby
The race is on to gain strategic and proprietary insights into changes in customer preferences before your competitors. This workshop will cover how and why machine learning is the tool for marketers to drive revenue and increase market share. The adoption of machine learning does not happen overnight. We will discuss the Five Es of machine learning maturity – Educating, Exploring, Engaging, Executing and Expanding. Hear real-world examples of using machine learning to accelerate revenue, identify new customers and introduce new products based on machine learning capabilities.
Next DSS MIA Event - https://datascience.salon/miami/
How SASB Can Help Companies Manage the Sustainability Factors that Impact ValueSustainable Brands
How SASB Can Help Companies Manage the Sustainability Factors that Impact Value
How SASB Can Help Companies Manage the Sustainability Factors that Impact Value
Who wouldn’t prefer to wear a custom-tailored suit over something bought off the rack? Especially if it can be had for the same price, or even cheaper? In much the same way, we find that companies have a taste for supply chain analytics that are carefully tailored to their own business, quirks and all. In this talk we will discuss supply chain analytics broadly, provide some examples, and then address conditions when a custom approach to creating a supply chain decision support tool makes good sense.
This IT Brand Pulse mini-report includes only market leader data from the independent, non-sponsored survey covering six categories of brand leadership–Market, Price, Performance, Reliability, Service & Support and Innovation–for twelve Network Monitoring & Backup products.
Complete survey data for each product category is available. Please contact us at info@itbrandpulse.com for information and pricing.
Similar to Towards Better Online Personalization: A Framework for Empirical Evaluation and Real-Life Validation of Hybrid Recommendation Systems (20)
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
StarCompliance is a leading firm specializing in the recovery of stolen cryptocurrency. Our comprehensive services are designed to assist individuals and organizations in navigating the complex process of fraud reporting, investigation, and fund recovery. We combine cutting-edge technology with expert legal support to provide a robust solution for victims of crypto theft.
Our Services Include:
Reporting to Tracking Authorities:
We immediately notify all relevant centralized exchanges (CEX), decentralized exchanges (DEX), and wallet providers about the stolen cryptocurrency. This ensures that the stolen assets are flagged as scam transactions, making it impossible for the thief to use them.
Assistance with Filing Police Reports:
We guide you through the process of filing a valid police report. Our support team provides detailed instructions on which police department to contact and helps you complete the necessary paperwork within the critical 72-hour window.
Launching the Refund Process:
Our team of experienced lawyers can initiate lawsuits on your behalf and represent you in various jurisdictions around the world. They work diligently to recover your stolen funds and ensure that justice is served.
At StarCompliance, we understand the urgency and stress involved in dealing with cryptocurrency theft. Our dedicated team works quickly and efficiently to provide you with the support and expertise needed to recover your assets. Trust us to be your partner in navigating the complexities of the crypto world and safeguarding your investments.
Towards Better Online Personalization: A Framework for Empirical Evaluation and Real-Life Validation of Hybrid Recommendation Systems
1. Towards Better Online Personalization: A
Framework for Empirical Evaluation and
Real-Life Validation of Hybrid
Recommendation Systems
Stijn Geuens, Koen W. De Bock, Kristof Coussement
3. How to Calculate Recommendations
[Bobadilla et al. 2013; Adomavicius et al. 2008]
Classification based on calculation paradigm:
Classification based on input data:
3AMS World Marketing Congress 201607/20/2016
4. How to Calculate Recommendations
[Bobadilla et al. 2013; Adomavicius et al. 2008]
Classification based on calculation paradigm:
Memory-based [Goldberg, 1992]
Model-based [Koren, 2008]
Classification based on input data:
Socio-demographic information Demographic RecSys [eg. Pazzani 1999; Porcel et al. 2012]
Product characteristics Content-based RecSys [eg. Lang 1995; Meteren and Someren 2000]
Real-time navigation information Knowledge-based RecSys [eg. Burke 2000]
Behavioral history Collaborative filtering RecSys [eg. Herlocker et al. 2004]
Hybrid solutions [eg. Burke 2002; Preece and Sneiderman 2009]
3AMS World Marketing Congress 201607/20/2016
5. A Shift Towards Hybrid Algorithms
Single data source systems: advantages and disadvantages [Bobadilla et al. 2013]
Hybridization resolves these issues and leads to better performance [Bobadilla et al. 2013]
Algorithm combination vs. data source combination [Bobadilla et al. 2013]
Burke’s classification [Burke, 2002]:
Weighting
Feature combination
4AMS World Marketing Congress 201607/20/2016
6. Contributions
Go beyond creation of a hybrid algorithm by:
Creation of a decision framework for marketing academics and professionals
to guide them in their efforts to create recommendation systems
Opening the black-box of recommendation systems by introducing the
concept of feature importance
5AMS World Marketing Congress 201607/20/2016
7. Research Questions
6AMS World Marketing Congress 2016
Data:
Recommendation Calculation:
Feature Importance:
07/20/2016
8. Research Questions
6AMS World Marketing Congress 2016
Data:
RQ1.a. Do Recommendation systems based on different single data sources differ in performance?
RQ1.b. Does combining different data sources add predictive performance?
Recommendation Calculation:
RQ2. Which hybridization technique performs best for algorithms with the optimal number of data
sources?
Feature Importance:
RQ3. Which are the most important predictors in the best performing algorithm?
07/20/2016
14. Experimental Setup
8 different company specific datasets
AMS World Marketing Congress 2016 9
Product Category Visitors Products
Shoes 31,536 11,712
Children's Clothing 16,752 3,956
Decoration 12,747 5,054
Lingerie 11,672 3,514
Furniture 20,507 6,481
Men's Clothing 8,412 4,737
Women's Clothing 50,336 12,979
Household linen 12,376 2,934
07/20/2016
15. Experimental Setup
Evaluation metric: F1@5 [Lipton, 2015]
Method of analysis:
AMS World Marketing Congress 2016 1007/20/2016
16. Experimental Setup
Evaluation metric: F1@5 [Lipton, 2015]
Method of analysis:
Evaluation: Data and Recommendation Calculation
Friedman aligned rank test with Li’s procedure for posthoc testing [Garçia, 2010]
AMS World Marketing Congress 2016 1007/20/2016
17. Experimental Setup
Evaluation metric: F1@5 [Lipton, 2015]
Method of analysis:
Evaluation: Data and Recommendation Calculation
Friedman aligned rank test with Li’s procedure for posthoc testing [Garçia, 2010]
Interpretation: Variable importance
Implementation of Breiman’s (2003) method developed for random forests
AMS World Marketing Congress 2016 10
𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑖
=
𝐹1@5 𝐹𝑢𝑙𝑙 − 𝐹1@5 𝑅𝑎𝑛𝑑𝑜𝑚 𝑝𝑒𝑟𝑚𝑢𝑡𝑎𝑡𝑖𝑜𝑛
𝑖
𝐹1@5 𝐹𝑢𝑙𝑙
𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑎𝑔𝑔𝑟
𝑖
=
1
𝑑
𝐹𝑒𝑎𝑡𝐼𝑚𝑝 𝑖
𝑑
07/20/2016
18. Results: Data
RQ1.a. Do Recommendation systems based on different single data sources differ in
performance?
AMS World Marketing Congress 2016 11
---- indicate a non-significant difference @ 95% CI
07/20/2016
19. Results: Data
RQ1.a. Do Recommendation systems based on different single data sources differ in
performance?
Yes, there is a difference in performance of different single data source
recommendation sytems
AMS World Marketing Congress 2016 11
---- indicate a non-significant difference @ 95% CI
07/20/2016
20. Results: Data
RQ1.a. Do Recommendation systems based on different single data sources differ in
performance?
Yes, there is a difference in performance of different single data source
recommendation sytems
A company focusses best on a RBD (or PD) based recommendation sytem when
building a single data source recommender system
AMS World Marketing Congress 2016 11
---- indicate a non-significant difference @ 95% CI
07/20/2016
21. Results: Data
RQ1.b. Does combining different data sources add predictive performance?
AMS World Marketing Congress 2016 12
…... indicate a marginally significant difference
07/20/2016
22. Results: Data
RQ1.b. Does combining different data sources add predictive performance?
Yes, performance increases when adding data sources
AMS World Marketing Congress 2016 12
…... indicate a marginally significant difference
07/20/2016
23. Results: Data
RQ1.b. Does combining different data sources add predictive performance?
Yes, performance increases when adding data sources
It is worthwhile for a company to investigate data source combination to improve
performance of recommendation systems
AMS World Marketing Congress 2016 12
…... indicate a marginally significant difference
07/20/2016
24. Results: Recommendation Calculation
RQ2. Which hybridization technique performs best for algorithms with the optimal
number of data sources?
AMS World Marketing Congress 2016 1307/20/2016
25. Results: Recommendation Calculation
RQ2. Which hybridization technique performs best for algorithms with the optimal
number of data sources?
Factorization machines are out performing an a posteriori weighting of single data
source algorithms
AMS World Marketing Congress 2016 1307/20/2016
26. Results: Recommendation Calculation
RQ2. Which hybridization technique performs best for algorithms with the optimal
number of data sources?
Factorization machines are out performing an a posteriori weighting of single data
source algorithms
It is worthwhile for a company to investigate advanced hybridization techniques to
improve the performance of recommendation systems
AMS World Marketing Congress 2016 1307/20/2016
27. Results: Feature Importance
RQ3. Which are the most important predictors in the best performing algorithm?
Within the best performing algorithm (RQ1 and RQ2), distinction can be made
between data source importance scores. RBD > PD > CD > ABD
AMS World Marketing Congress 2016 14
0% 5% 10% 15% 20% 25% 30% 35% 40%
Aggregated Behavioral Data
Customer Data
Product Data
Raw Behavioral Data
07/20/2016
28. Results: Feature Importance
AMS World Marketing Congress 2016 15
0% 2% 4% 6% 8% 10% 12% 14%
Number of total purchases
Mean product rating
Total value of purchases
Length of relationship
Time since last purchase
Internal vs external
Value-based segmentation
Mean Product Rating
Explicit ratings
Number of children
Marital Status
Place of residence
Age of Children
Brand
Gender
Age
Internal search
Product Division 3
Product Division 2
Product Division 1
Purchases
Addition to cart
Views
RBD
PD
CD
ABD
07/20/2016
29. Conclusions
A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
Empirical validation of the framework on 8 datasets:
AMS World Marketing Congress 2016 1607/20/2016
30. Conclusions
A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
Empirical validation of the framework on 8 datasets:
Single data sources recommendation systems differ in performance
AMS World Marketing Congress 2016 1607/20/2016
31. Conclusions
A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
Empirical validation of the framework on 8 datasets:
Single data sources recommendation systems differ in performance
Combining data sources adds to the performance of recommendation systems
AMS World Marketing Congress 2016 1607/20/2016
32. Conclusions
A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
Empirical validation of the framework on 8 datasets:
Single data sources recommendation systems differ in performance
Combining data sources adds to the performance of recommendation systems
An advanced combination technique based on feature combination outperforms
a posteriori weighting of single data source algorithms
AMS World Marketing Congress 2016 1607/20/2016
33. Conclusions
A framework to guide marketing professionals and academics in their
efforts to create recommendation systems
Empirical validation of the framework on 8 datasets:
Single data sources recommendation systems differ in performance
Combining data sources adds to the performance of recommendation systems
An advanced combination technique based on feature combination outperforms
a posteriori weighting of single data source algorithms
RBD is the most important data source in the best performing model followed by
PD, CD, and finally ABD
AMS World Marketing Congress 2016 1607/20/2016
34. Future Work
Incorporation of other evaluation metrics in the framework
Field test Evaluation of different recommendation strategies in terms
of business metrics
Identification of the relationship between ‘academic’ metrics and
business metrics
AMS World Marketing Congress 2016 1707/20/2016
35. References
J. Bobadilla, F. Ortega, A. Hernando, A. Gutierrez, Recommender systems survey, Knowl.-Based Syst.,
46 (2013) 109-132
] G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: A survey of the
state-of-the-art and possible extensions, IEEE Trans. Knowl. Data Eng., 17 (2005) 734-749
Y. Koren, Factorization meets the neighborhood: A multifaceted collaborative filtering model, 14th
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Las Vegas,
NV, 2008, pp. 426-434
M.J. Pazzani, A framework for collaborative, content-based and demographic filtering, Artif. Intell.
Rev., 13 (1999) 393-408
C. Porcel, A. Tejeda-Lorente, M.A. Martinez, E. Herrera-Viedma, A hybrid recommender system for
the selective dissemination of research resources in a technology transfer office, Inform. Sciences,
184 (2012) 1-19
R. Burke, Hybrid recommender systems: Survey and experiments, User Modeling and User-Adapted
Interaction, 12 (2002) 331-370
AMS World Marketing Congress 2016 1807/20/2016
36. References
J.L. Herlocker, J.A. Konstan, L.G. Terveen, J.T. Riedl, Evaluating collaborative filtering recommender
systems, ACM Trans. Inf. Syst., 22 (2004) 5-53
I.-Y. Song, Database Design for Real-World E-Commerce Systems, IEEE Data Engineering Bulletin, 23
(2000) 23-28.
R. Kohavi, L. Mason, R. Parekh, Z. Zheng, Lessons and Challenges from Mining Retail E-Commerce
Data, Mach. Learn., 57 (2004) 83-113
S. Rendle, Factorization Machines, IEEE International Conference on Data Mining, Sydney, Australia,
2010
Z.C. Lipton, C. Elkan, B. Naryanaswamy, Optimal thresholding of classifiers to maximize F1 measure,
in: T. Calders, F. Esposito, E. Hüllermeier, R. Meo (Eds.) Machine Learning and Knowledge Discovery in
Databases, Springer Berlin Heidelberg 2014, pp. 225-239
L. Breiman, Random forests, Mach. Learn., 45 (2001) 5-32
AMS World Marketing Congress 2016 1907/20/2016
37. Thank you for
your Attention
Contact:
Stijn Geuens (0)3.20.545.892
IESEG School of Management s.geuens@ieseg.fr
3 Rue de la Digue fr.linkedin.com/pub/stijn-geuens/
F-59000 Lille stijn.geuens
AMS World Marketing Congress 2016 2007/20/2016
38. Appendix 1: Advantages and disadvantages
of different systems
[Burke, 2002]
AMS World Marketing Congress 2016 21
Collaborative
Filtering
Content-based Knowledge-Based Demographic
Pros
No metadata
engineering needed
Comparison between
items possible
Deterministic
No metadata
engineering needed
Serendipity in results
No metadata
engineering needed
No cold-start Serendipity in results
Adaptive Adaptive
Cons
Scalability Overspecialization
Knowledge engineering
required
Long tail
Cold Start for new users
and items
Cold start for new users Subjective Cold start for new users
Long tail problem
Collection of product
information
Static Static
Stability
07/20/2016
39. Appendix 2: Experimental Framework
Data
22AMS World Marketing Congress 2016
Data
Product Data
Three main
product division
Brand
Mean product
rating
Internal vs.
external
Availability on
the web
Customer Data
Age
Gender
Marital status
Place of
residence
Number of
children
Age of children
Aggregated
Behavioral Data
RFM
Time since last
purchase
Number of total
purchases
Total value of
purchases
Relationship
features
Length of
Relationship
Value-based
segmentation
Mean product
rating
Raw Behavioral
Data
Explicit ratings
Purchases
Internal search
Addition to cart
Views
07/20/2016
42. Appendix 3: Experimental Framework:
Recommendation Calculation
25
Factorization Machines
Introduced by Rendle (2010)
Based on Support Vector Machines (SVM) and factorization models and combines the advantages
of both.
SVM: Works with any real valued feature vector, allowing to integrated different data sources
Factorization Models: Variable interaction is calculated based on factorized parameters, allowing
to estimate interaction under huge sparsity, where SVM’s fail.
General FM model equation of degree 2:
AMS World Marketing Congress 201607/20/2016