SlideShare a Scribd company logo
January 8, 2020 Page 1/27
Personalized Top-N Sequential Recommendation
via Convolutional Sequence Embedding (WSDM’18)
Jihoo Kim
datartist@hanyang.ac.kr
Dept. of Computer and Software, Hanyang University
Jiaxi Tang, Ke Wang
Simon Fraser University
January 8, 2020 Page 2/27
Jiaxi Tang
PhD Student
School of Computing Science
Simon Fraser University
Intern at Google AI
Research & Machine Intelligence Team
Ke Wang
Professor
School of Computing Science
Simon Fraser University
PhD, Georgia Institute of Technology
MS, Georgia Institute of Technology
Recent papers
Towards Neural Mixture Recommender for Long Range Dependent User Sequences (WWW’19)
Jiaxi Tang*, Francois Belletti*, Sagar Jain, Minmin Chen, Alex Beutel, Can Xu and Ed H. Chi
Ranking Distillation: Learning Compact Ranking Models With High Performance for Recommender System (KDD’18)
Jiaxi Tang, Ke Wang
Author
January 8, 2020 Page 3/27
Minimum qualifications:
• Currently enrolled in a Master’s or PhD degree in Computer Science or a related technical field.
• Experience (classroom/work) in Natural Language Understanding, Neural Networks, Computer Vision, Machine
Learning, Deep Learning, Algorithmic Foundations of Optimization, Data Science, Data Mining and/or Machine
Intelligence/Artificial Intelligence.
• Experience with one or more general purpose programming languages: Java, C++ or Python.
• Experience with research communities and/or efforts, including having published papers (being listed as author)
at conferences (e.g. NIPS, ICML, ACL, CVPR, etc).
About the job
Research and Machine Intelligence is a high impact team that’s building the next generation of intelligence and
language understanding for all Google products. To achieve this, we’re working on projects that utilize the latest
techniques in Artificial Intelligence, Machine Learning (including Deep Learning approaches like Google AI) and
Natural Language Understanding. We impact products across Google including Search, Maps and Google Now.
https://careers.google.com/jobs/results/136271419680924358-research-intern-2020/
Google AI Research Intern
January 8, 2020 Page 4/27
Contents
1. Introduction
1.1 Top-N Sequential Recommendation
1.2 Limitations of Previous Work
1.3 Contributions
2. Related Work
3. Proposed Methodology
3.1 Embedding Look-up
3.2 Convolutional Layers
3.3 Fully-connected Layers
3.4 Network Training
3.5 Recommendation
4. Experiments
4.1 Experimental Setup
4.2 Performance Comparison
4.3 Network Visualization
January 8, 2020 Page 5/27
User’s long term
and static behaviors
User’s short term
and dynamic behaviors
General
preferences
Sequential
patterns
<
always
After buying an iPhone, buy phone accessories
“I love Apple’s products”
vs
recent next
<Motivation>
1. Introduction
January 8, 2020 Page 6/27
1.1 Top-N Sequential Recommendation
Users
Items
Sequence
order
General preferences
Sequential patterns
Input Output
A list of items
for user u
<Top-N Sequential Recommendation>
<Notations>
1. Introduction
January 8, 2020 Page 7/27
1.2 Limitations of Previous Work
<Markov chain based model>
1) FPMC (Factorized Personalized Markov Chains) WWW’10
2) Fossil (Factorized Sequential Prediction with Item Similarity Model) ICDM’16
<Two major limitations>
1) Fail to model union-level* sequential patterns.
2) Fail to allow skip behaviors**.
milk flour
*Union-Level?
butter… …
**Skip behaviors?
… airport hotel
rest-
aurant
bar
attr-
action
not necessary
…
Figure 1
1. Introduction
January 8, 2020 Page 8/27
1.2 Limitations of Previous Work
To provide evidences of union-level influences and skip behaviors
minimum support count = 5
minimum confidence = 50%
X Y
sequence
Figure 2
Sequential Association
Rules
→
1. Introduction
January 8, 2020 Page 9/27
1.3 Contributions
Caser (ConvolutionAl Sequence Embedding Recommendation Model)
• Caser uses horizontal and vertical convolutional filters to capture sequential patterns
at point-level, union-level, and of skip behaviors.
• Caser models both users’ general preferences and sequential patterns, and
generalizes several existing state-of-the-art methods in a single unified framework.
• Caser outperforms state-of-the-art methods for top-N sequential recommendation on
real life data sets.
1. Introduction
January 8, 2020 Page 10/27
• Sequential pattern mining depends on the explicit representation of patterns, thus, could
miss patterns in unobserved states. (= could miss implicit patterns)
• CNN has been used to extract users’ preferences from their reviews. None of these works
is for sequential recommendation.
• RNN was used for session-based recommendation. It may not work well in sequential
recommendation, because not all adjacent actions have dependency relationships.
• Temporal recommendation is related but different problem. (Session-based is also different)
(ex. Recommend coffee in the morning, instead of evening.)
2. Related Work
January 8, 2020 Page 11/27
Figure 3
<Network Architecture of Caser>
3. Proposed Methodology
January 8, 2020 Page 12/27
The user 𝒖’s sequence
every 𝑳 successive
items
as input
their next 𝑻 items
as the targets
window of
size 𝑳 + 𝑻
The embedding for item 𝒊
d is the number of latent dimensions
𝑺 𝟏
𝒖
𝑺 𝟐
𝒖
𝑺 𝟑
𝒖
𝑺 𝟒
𝒖
𝑺 𝟓
𝒖
𝑬(𝒖,𝟑)
=
𝑸 𝑺 𝟏
𝒖
𝑸 𝑺 𝟐
𝒖
𝑬(𝒖,𝟒) =
𝑸 𝑺 𝟐
𝒖
𝑸 𝑺 𝟑
𝒖
𝑬(𝒖,𝟓) =
𝑸 𝑺 𝟑
𝒖
𝑸 𝑺 𝟒
𝒖
3.1 Embedding Look-up
3. Proposed Methodology
January 8, 2020 Page 13/27
image
local features
= 𝑳 × 𝒅 matrix 𝑬
= sequential pattern
Figure 4
Unlike image recognition,
“image” 𝑬 is not given…
and must be learnt
3.2 Convolutional Layers
3. Proposed Methodology
January 8, 2020 Page 14/27
𝑳 = 𝟒
𝒉 = 𝟐
𝒅 = 𝟑
𝑭 𝒌
∈ ℝ 𝟐×𝟑
𝒊 = 𝟏
𝒊 = 𝑳 − 𝒉 + 𝟏
= 𝟒 − 𝟐 + 𝟏
= 𝟑
𝑬 𝟏:𝟐
𝑬 𝟐:𝟑
𝑬 𝟑:𝟒
inner
product
activation
function
𝑖-th convolution value
<Max Pooling><Horizontal Filter>
𝑳 = 𝟒
𝒅 = 𝟑
෩𝑭 𝒌 ∈ ℝ 𝟒×𝟏
<Vertical Filter>
→ weighted sum
→ no max pooling
3. Proposed Methodology
𝑘-th filter
# of filter
height of filter
Convolution value (by 𝑭 𝒌
)
January 8, 2020 Page 15/27
activation function
convolutional
sequence embedding
3.3 Fully-connected Layers
the probability of
how likely user 𝒖 will interact
with item 𝒊
at time step 𝒕
3. Proposed Methodology
January 8, 2020 Page 16/27
union-level
sequential patterns
point-level
sequential patterns
short-term
sequential patterns
long-term
general preferences
3. Proposed Methodology
January 8, 2020 Page 17/27
3.4 Network Training
To train the network, we transform the values of the output layers to probabilities
sigmoid function
the collection of the time steps
for which we would like to make
predictions for user 𝒖
the likelihood of all sequences in the dataset
3. Proposed Methodology
January 8, 2020 Page 18/27
3.4 Network Training
To further capture skip behaviors, we could consider the next 𝑻 target items
Taking the negative logarithm of likelihood, we get the objective function “binary cross-entropy loss”
model parameters
hyper-parameters
are learned by minimizing the loss function (13)
are tuned on the validation set via grid search
3. Proposed Methodology
January 8, 2020 Page 19/27
3.5 Recommendation
After obtaining the trained neural network, to make recommendations for a user 𝒖 at time step 𝒕
We recommend 𝑵 items
that have the highest values
in the output layer 𝒖
𝒖’s last 𝑳 items’
embedding 𝑬(𝒖,𝒕)
𝒖’s latent
embedding 𝑷 𝒖
Input Output
3. Proposed Methodology
January 8, 2020 Page 20/27
4.1 Experimental Setup
<Datasets>
Amazon data was not used, due to its SI
0.0026 for ‘Office Products’
0.0019 for ‘Clothing’ / ‘Shoes’ / ‘Jewelry’ / ‘Video Games’
70% 10% 20%
validation testtraining
sequence
4. Experiments
January 8, 2020 Page 21/27
<Evaluation Metrics>
4.1 Experimental Setup
MAP(Mean Average Precision): the average of AP for all users
Precision, Recall
top 𝑵 predicted items
for a user
the last 20% of actions
in user’s sequence (= test set)
4. Experiments
January 8, 2020 Page 22/27
4.2 Performance Comparison
4. Experiments
January 8, 2020 Page 23/27
4.2 Performance Comparison
<Influence of hyper-parameter 𝒅, 𝑳, 𝑻,>
4. Experiments
January 8, 2020 Page 24/27
4.2 Performance Comparison
<Analysis of Caser Components>
𝒉 denotes horizontal convolutional layer
𝒗 denotes vertical convolutional layer
𝒑 denotes personalization
Any missing component is represented
by setting its corresponding 𝒐, ෥𝒐, 𝑷 𝒖 to zero.
4. Experiments
January 8, 2020 Page 25/27
4.3 Network Visualization
Caser puts more emphasis on recent actions,
demonstrating a major difference from the conventional top-N recommendation.
<Vertical convolutional filters>
4. Experiments
January 8, 2020 Page 26/27
4.3 Network Visualization
<Horizontal convolutional filters>
<Previous Sequence>
𝑺 𝟏 (13th Warrior) History
𝑺 𝟐 (American Beauty), Romance
𝑺 𝟑 (Star Trek), Action & SF
𝑺 𝟒 (Star Trek III)
𝑺 𝟓 (Star Trek IV)
<Predictions>
𝑹 𝟏 (Mad Max)
𝑹 𝟐 (Star War)
𝑹 𝟑 (Star Trek) >> Ground Truth
4. Experiments
January 8, 2020 Page 27/27
Thank you!
Q & A

More Related Content

What's hot

Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
Alexandros Karatzoglou
 
Label propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLPLabel propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLP
David Przybilla
 
Optimization/Gradient Descent
Optimization/Gradient DescentOptimization/Gradient Descent
Optimization/Gradient Descent
kandelin
 
Déjà Vu: The Importance of Time and Causality in Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender SystemsDéjà Vu: The Importance of Time and Causality in Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender Systems
Justin Basilico
 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with Performers
Joonhyung Lee
 
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Anoop Deoras
 
Talk@rmit 09112017
Talk@rmit 09112017Talk@rmit 09112017
Talk@rmit 09112017
Shuai Zhang
 
Past, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectivePast, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry Perspective
Justin Basilico
 
Introduction to Random Forest
Introduction to Random Forest Introduction to Random Forest
Introduction to Random Forest
Rupak Roy
 
Generative Adversarial Networks
Generative Adversarial NetworksGenerative Adversarial Networks
Generative Adversarial Networks
Mark Chang
 
Artwork Personalization at Netflix
Artwork Personalization at NetflixArtwork Personalization at Netflix
Artwork Personalization at Netflix
Justin Basilico
 
A Multi-Armed Bandit Framework For Recommendations at Netflix
A Multi-Armed Bandit Framework For Recommendations at NetflixA Multi-Armed Bandit Framework For Recommendations at Netflix
A Multi-Armed Bandit Framework For Recommendations at Netflix
Jaya Kawale
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms Reliable
Justin Basilico
 
Contextualization at Netflix
Contextualization at NetflixContextualization at Netflix
Contextualization at Netflix
Linas Baltrunas
 
오토인코더의 모든 것
오토인코더의 모든 것오토인코더의 모든 것
오토인코더의 모든 것
NAVER Engineering
 
Understanding Bagging and Boosting
Understanding Bagging and BoostingUnderstanding Bagging and Boosting
Understanding Bagging and Boosting
Mohit Rajput
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
Xavier Amatriain
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Ernesto Mislej
 
Introduction to Some Tree based Learning Method
Introduction to Some Tree based Learning MethodIntroduction to Some Tree based Learning Method
Introduction to Some Tree based Learning Method
Honglin Yu
 
Introduction to Diffusion Models
Introduction to Diffusion ModelsIntroduction to Diffusion Models
Introduction to Diffusion Models
Sangwoo Mo
 

What's hot (20)

Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Label propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLPLabel propagation - Semisupervised Learning with Applications to NLP
Label propagation - Semisupervised Learning with Applications to NLP
 
Optimization/Gradient Descent
Optimization/Gradient DescentOptimization/Gradient Descent
Optimization/Gradient Descent
 
Déjà Vu: The Importance of Time and Causality in Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender SystemsDéjà Vu: The Importance of Time and Causality in Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender Systems
 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with Performers
 
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
 
Talk@rmit 09112017
Talk@rmit 09112017Talk@rmit 09112017
Talk@rmit 09112017
 
Past, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectivePast, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry Perspective
 
Introduction to Random Forest
Introduction to Random Forest Introduction to Random Forest
Introduction to Random Forest
 
Generative Adversarial Networks
Generative Adversarial NetworksGenerative Adversarial Networks
Generative Adversarial Networks
 
Artwork Personalization at Netflix
Artwork Personalization at NetflixArtwork Personalization at Netflix
Artwork Personalization at Netflix
 
A Multi-Armed Bandit Framework For Recommendations at Netflix
A Multi-Armed Bandit Framework For Recommendations at NetflixA Multi-Armed Bandit Framework For Recommendations at Netflix
A Multi-Armed Bandit Framework For Recommendations at Netflix
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms Reliable
 
Contextualization at Netflix
Contextualization at NetflixContextualization at Netflix
Contextualization at Netflix
 
오토인코더의 모든 것
오토인코더의 모든 것오토인코더의 모든 것
오토인코더의 모든 것
 
Understanding Bagging and Boosting
Understanding Bagging and BoostingUnderstanding Bagging and Boosting
Understanding Bagging and Boosting
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
 
Introduction to Some Tree based Learning Method
Introduction to Some Tree based Learning MethodIntroduction to Some Tree based Learning Method
Introduction to Some Tree based Learning Method
 
Introduction to Diffusion Models
Introduction to Diffusion ModelsIntroduction to Diffusion Models
Introduction to Diffusion Models
 

Similar to [Paper Review] Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding (WSDM’18)

IRJET - Recommendations Engine with Multi-Objective Contextual Bandits (U...
IRJET -  	  Recommendations Engine with Multi-Objective Contextual Bandits (U...IRJET -  	  Recommendations Engine with Multi-Objective Contextual Bandits (U...
IRJET - Recommendations Engine with Multi-Objective Contextual Bandits (U...
IRJET Journal
 
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
IRJET Journal
 
Recuriter Recommendation System
Recuriter Recommendation SystemRecuriter Recommendation System
Recuriter Recommendation System
IRJET Journal
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
IRJET Journal
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
IRJET Journal
 
Neural Net: Machine Learning Web Application
Neural Net: Machine Learning Web ApplicationNeural Net: Machine Learning Web Application
Neural Net: Machine Learning Web Application
IRJET Journal
 
Email Spam Detection Using Machine Learning
Email Spam Detection Using Machine LearningEmail Spam Detection Using Machine Learning
Email Spam Detection Using Machine Learning
IRJET Journal
 
IRJET - Automated Fraud Detection Framework in Examination Halls
 IRJET - Automated Fraud Detection Framework in Examination Halls IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET Journal
 
IRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware PerformanceIRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware Performance
IRJET Journal
 
IRJET- Analysis of PV Fed Vector Controlled Induction Motor Drive
IRJET- Analysis of PV Fed Vector Controlled Induction Motor DriveIRJET- Analysis of PV Fed Vector Controlled Induction Motor Drive
IRJET- Analysis of PV Fed Vector Controlled Induction Motor Drive
IRJET Journal
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET Journal
 
IRJET-Scaling Distributed Associative Classifier using Big Data
IRJET-Scaling Distributed Associative Classifier using Big DataIRJET-Scaling Distributed Associative Classifier using Big Data
IRJET-Scaling Distributed Associative Classifier using Big Data
IRJET Journal
 
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning AlgorithmsIRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET Journal
 
IRJET - Smart Vet Locator for Hybrid Pets
IRJET -  	  Smart Vet Locator for Hybrid PetsIRJET -  	  Smart Vet Locator for Hybrid Pets
IRJET - Smart Vet Locator for Hybrid Pets
IRJET Journal
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection Methods
IRJET Journal
 
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET Journal
 
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree AlgorithmWater Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
IRJET Journal
 
A Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine LearningA Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine Learning
IRJET Journal
 
Tourist Destination Recommendation System using Cosine Similarity
Tourist Destination Recommendation System using Cosine SimilarityTourist Destination Recommendation System using Cosine Similarity
Tourist Destination Recommendation System using Cosine Similarity
IRJET Journal
 
IRJET- Mango Classification using Convolutional Neural Networks
IRJET- Mango Classification using Convolutional Neural NetworksIRJET- Mango Classification using Convolutional Neural Networks
IRJET- Mango Classification using Convolutional Neural Networks
IRJET Journal
 

Similar to [Paper Review] Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding (WSDM’18) (20)

IRJET - Recommendations Engine with Multi-Objective Contextual Bandits (U...
IRJET -  	  Recommendations Engine with Multi-Objective Contextual Bandits (U...IRJET -  	  Recommendations Engine with Multi-Objective Contextual Bandits (U...
IRJET - Recommendations Engine with Multi-Objective Contextual Bandits (U...
 
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
 
Recuriter Recommendation System
Recuriter Recommendation SystemRecuriter Recommendation System
Recuriter Recommendation System
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
Neural Net: Machine Learning Web Application
Neural Net: Machine Learning Web ApplicationNeural Net: Machine Learning Web Application
Neural Net: Machine Learning Web Application
 
Email Spam Detection Using Machine Learning
Email Spam Detection Using Machine LearningEmail Spam Detection Using Machine Learning
Email Spam Detection Using Machine Learning
 
IRJET - Automated Fraud Detection Framework in Examination Halls
 IRJET - Automated Fraud Detection Framework in Examination Halls IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET - Automated Fraud Detection Framework in Examination Halls
 
IRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware PerformanceIRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware Performance
 
IRJET- Analysis of PV Fed Vector Controlled Induction Motor Drive
IRJET- Analysis of PV Fed Vector Controlled Induction Motor DriveIRJET- Analysis of PV Fed Vector Controlled Induction Motor Drive
IRJET- Analysis of PV Fed Vector Controlled Induction Motor Drive
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
 
IRJET-Scaling Distributed Associative Classifier using Big Data
IRJET-Scaling Distributed Associative Classifier using Big DataIRJET-Scaling Distributed Associative Classifier using Big Data
IRJET-Scaling Distributed Associative Classifier using Big Data
 
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning AlgorithmsIRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
 
IRJET - Smart Vet Locator for Hybrid Pets
IRJET -  	  Smart Vet Locator for Hybrid PetsIRJET -  	  Smart Vet Locator for Hybrid Pets
IRJET - Smart Vet Locator for Hybrid Pets
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection Methods
 
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
 
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree AlgorithmWater Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
 
A Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine LearningA Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine Learning
 
Tourist Destination Recommendation System using Cosine Similarity
Tourist Destination Recommendation System using Cosine SimilarityTourist Destination Recommendation System using Cosine Similarity
Tourist Destination Recommendation System using Cosine Similarity
 
IRJET- Mango Classification using Convolutional Neural Networks
IRJET- Mango Classification using Convolutional Neural NetworksIRJET- Mango Classification using Convolutional Neural Networks
IRJET- Mango Classification using Convolutional Neural Networks
 

Recently uploaded

一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 

[Paper Review] Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding (WSDM’18)

  • 1. January 8, 2020 Page 1/27 Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding (WSDM’18) Jihoo Kim datartist@hanyang.ac.kr Dept. of Computer and Software, Hanyang University Jiaxi Tang, Ke Wang Simon Fraser University
  • 2. January 8, 2020 Page 2/27 Jiaxi Tang PhD Student School of Computing Science Simon Fraser University Intern at Google AI Research & Machine Intelligence Team Ke Wang Professor School of Computing Science Simon Fraser University PhD, Georgia Institute of Technology MS, Georgia Institute of Technology Recent papers Towards Neural Mixture Recommender for Long Range Dependent User Sequences (WWW’19) Jiaxi Tang*, Francois Belletti*, Sagar Jain, Minmin Chen, Alex Beutel, Can Xu and Ed H. Chi Ranking Distillation: Learning Compact Ranking Models With High Performance for Recommender System (KDD’18) Jiaxi Tang, Ke Wang Author
  • 3. January 8, 2020 Page 3/27 Minimum qualifications: • Currently enrolled in a Master’s or PhD degree in Computer Science or a related technical field. • Experience (classroom/work) in Natural Language Understanding, Neural Networks, Computer Vision, Machine Learning, Deep Learning, Algorithmic Foundations of Optimization, Data Science, Data Mining and/or Machine Intelligence/Artificial Intelligence. • Experience with one or more general purpose programming languages: Java, C++ or Python. • Experience with research communities and/or efforts, including having published papers (being listed as author) at conferences (e.g. NIPS, ICML, ACL, CVPR, etc). About the job Research and Machine Intelligence is a high impact team that’s building the next generation of intelligence and language understanding for all Google products. To achieve this, we’re working on projects that utilize the latest techniques in Artificial Intelligence, Machine Learning (including Deep Learning approaches like Google AI) and Natural Language Understanding. We impact products across Google including Search, Maps and Google Now. https://careers.google.com/jobs/results/136271419680924358-research-intern-2020/ Google AI Research Intern
  • 4. January 8, 2020 Page 4/27 Contents 1. Introduction 1.1 Top-N Sequential Recommendation 1.2 Limitations of Previous Work 1.3 Contributions 2. Related Work 3. Proposed Methodology 3.1 Embedding Look-up 3.2 Convolutional Layers 3.3 Fully-connected Layers 3.4 Network Training 3.5 Recommendation 4. Experiments 4.1 Experimental Setup 4.2 Performance Comparison 4.3 Network Visualization
  • 5. January 8, 2020 Page 5/27 User’s long term and static behaviors User’s short term and dynamic behaviors General preferences Sequential patterns < always After buying an iPhone, buy phone accessories “I love Apple’s products” vs recent next <Motivation> 1. Introduction
  • 6. January 8, 2020 Page 6/27 1.1 Top-N Sequential Recommendation Users Items Sequence order General preferences Sequential patterns Input Output A list of items for user u <Top-N Sequential Recommendation> <Notations> 1. Introduction
  • 7. January 8, 2020 Page 7/27 1.2 Limitations of Previous Work <Markov chain based model> 1) FPMC (Factorized Personalized Markov Chains) WWW’10 2) Fossil (Factorized Sequential Prediction with Item Similarity Model) ICDM’16 <Two major limitations> 1) Fail to model union-level* sequential patterns. 2) Fail to allow skip behaviors**. milk flour *Union-Level? butter… … **Skip behaviors? … airport hotel rest- aurant bar attr- action not necessary … Figure 1 1. Introduction
  • 8. January 8, 2020 Page 8/27 1.2 Limitations of Previous Work To provide evidences of union-level influences and skip behaviors minimum support count = 5 minimum confidence = 50% X Y sequence Figure 2 Sequential Association Rules → 1. Introduction
  • 9. January 8, 2020 Page 9/27 1.3 Contributions Caser (ConvolutionAl Sequence Embedding Recommendation Model) • Caser uses horizontal and vertical convolutional filters to capture sequential patterns at point-level, union-level, and of skip behaviors. • Caser models both users’ general preferences and sequential patterns, and generalizes several existing state-of-the-art methods in a single unified framework. • Caser outperforms state-of-the-art methods for top-N sequential recommendation on real life data sets. 1. Introduction
  • 10. January 8, 2020 Page 10/27 • Sequential pattern mining depends on the explicit representation of patterns, thus, could miss patterns in unobserved states. (= could miss implicit patterns) • CNN has been used to extract users’ preferences from their reviews. None of these works is for sequential recommendation. • RNN was used for session-based recommendation. It may not work well in sequential recommendation, because not all adjacent actions have dependency relationships. • Temporal recommendation is related but different problem. (Session-based is also different) (ex. Recommend coffee in the morning, instead of evening.) 2. Related Work
  • 11. January 8, 2020 Page 11/27 Figure 3 <Network Architecture of Caser> 3. Proposed Methodology
  • 12. January 8, 2020 Page 12/27 The user 𝒖’s sequence every 𝑳 successive items as input their next 𝑻 items as the targets window of size 𝑳 + 𝑻 The embedding for item 𝒊 d is the number of latent dimensions 𝑺 𝟏 𝒖 𝑺 𝟐 𝒖 𝑺 𝟑 𝒖 𝑺 𝟒 𝒖 𝑺 𝟓 𝒖 𝑬(𝒖,𝟑) = 𝑸 𝑺 𝟏 𝒖 𝑸 𝑺 𝟐 𝒖 𝑬(𝒖,𝟒) = 𝑸 𝑺 𝟐 𝒖 𝑸 𝑺 𝟑 𝒖 𝑬(𝒖,𝟓) = 𝑸 𝑺 𝟑 𝒖 𝑸 𝑺 𝟒 𝒖 3.1 Embedding Look-up 3. Proposed Methodology
  • 13. January 8, 2020 Page 13/27 image local features = 𝑳 × 𝒅 matrix 𝑬 = sequential pattern Figure 4 Unlike image recognition, “image” 𝑬 is not given… and must be learnt 3.2 Convolutional Layers 3. Proposed Methodology
  • 14. January 8, 2020 Page 14/27 𝑳 = 𝟒 𝒉 = 𝟐 𝒅 = 𝟑 𝑭 𝒌 ∈ ℝ 𝟐×𝟑 𝒊 = 𝟏 𝒊 = 𝑳 − 𝒉 + 𝟏 = 𝟒 − 𝟐 + 𝟏 = 𝟑 𝑬 𝟏:𝟐 𝑬 𝟐:𝟑 𝑬 𝟑:𝟒 inner product activation function 𝑖-th convolution value <Max Pooling><Horizontal Filter> 𝑳 = 𝟒 𝒅 = 𝟑 ෩𝑭 𝒌 ∈ ℝ 𝟒×𝟏 <Vertical Filter> → weighted sum → no max pooling 3. Proposed Methodology 𝑘-th filter # of filter height of filter Convolution value (by 𝑭 𝒌 )
  • 15. January 8, 2020 Page 15/27 activation function convolutional sequence embedding 3.3 Fully-connected Layers the probability of how likely user 𝒖 will interact with item 𝒊 at time step 𝒕 3. Proposed Methodology
  • 16. January 8, 2020 Page 16/27 union-level sequential patterns point-level sequential patterns short-term sequential patterns long-term general preferences 3. Proposed Methodology
  • 17. January 8, 2020 Page 17/27 3.4 Network Training To train the network, we transform the values of the output layers to probabilities sigmoid function the collection of the time steps for which we would like to make predictions for user 𝒖 the likelihood of all sequences in the dataset 3. Proposed Methodology
  • 18. January 8, 2020 Page 18/27 3.4 Network Training To further capture skip behaviors, we could consider the next 𝑻 target items Taking the negative logarithm of likelihood, we get the objective function “binary cross-entropy loss” model parameters hyper-parameters are learned by minimizing the loss function (13) are tuned on the validation set via grid search 3. Proposed Methodology
  • 19. January 8, 2020 Page 19/27 3.5 Recommendation After obtaining the trained neural network, to make recommendations for a user 𝒖 at time step 𝒕 We recommend 𝑵 items that have the highest values in the output layer 𝒖 𝒖’s last 𝑳 items’ embedding 𝑬(𝒖,𝒕) 𝒖’s latent embedding 𝑷 𝒖 Input Output 3. Proposed Methodology
  • 20. January 8, 2020 Page 20/27 4.1 Experimental Setup <Datasets> Amazon data was not used, due to its SI 0.0026 for ‘Office Products’ 0.0019 for ‘Clothing’ / ‘Shoes’ / ‘Jewelry’ / ‘Video Games’ 70% 10% 20% validation testtraining sequence 4. Experiments
  • 21. January 8, 2020 Page 21/27 <Evaluation Metrics> 4.1 Experimental Setup MAP(Mean Average Precision): the average of AP for all users Precision, Recall top 𝑵 predicted items for a user the last 20% of actions in user’s sequence (= test set) 4. Experiments
  • 22. January 8, 2020 Page 22/27 4.2 Performance Comparison 4. Experiments
  • 23. January 8, 2020 Page 23/27 4.2 Performance Comparison <Influence of hyper-parameter 𝒅, 𝑳, 𝑻,> 4. Experiments
  • 24. January 8, 2020 Page 24/27 4.2 Performance Comparison <Analysis of Caser Components> 𝒉 denotes horizontal convolutional layer 𝒗 denotes vertical convolutional layer 𝒑 denotes personalization Any missing component is represented by setting its corresponding 𝒐, ෥𝒐, 𝑷 𝒖 to zero. 4. Experiments
  • 25. January 8, 2020 Page 25/27 4.3 Network Visualization Caser puts more emphasis on recent actions, demonstrating a major difference from the conventional top-N recommendation. <Vertical convolutional filters> 4. Experiments
  • 26. January 8, 2020 Page 26/27 4.3 Network Visualization <Horizontal convolutional filters> <Previous Sequence> 𝑺 𝟏 (13th Warrior) History 𝑺 𝟐 (American Beauty), Romance 𝑺 𝟑 (Star Trek), Action & SF 𝑺 𝟒 (Star Trek III) 𝑺 𝟓 (Star Trek IV) <Predictions> 𝑹 𝟏 (Mad Max) 𝑹 𝟐 (Star War) 𝑹 𝟑 (Star Trek) >> Ground Truth 4. Experiments
  • 27. January 8, 2020 Page 27/27 Thank you! Q & A