Modeling Implicit Signals and Dynamic Preferences for Sequential Recommendation

Ho-Beom Kim
Network Science Lab
Dept. of Mathematics
The Catholic University of Korea
E-mail: hobeom2001@catholic.ac.kr
2023 / 07 / 05
WANG, Xiang, et al.
2019, ACM SIGIR

2
Introduction
Problem Statements
•Existing works
• adopt human designed rules or attention mechanism to assign time-decaying weights to
historically interacted items.
• leverages recurrent neural networks to summarize the behavioral sequences, but they suffer from
the short-term botleneck in capturing users’dynamic interests due to the difficulty of modeling
long-range dependencies.
•Recent Solutions
• jointly model long-term and short-term interests to avoid forgetting long-term interests
•Two major challenges in sequential recommendation that have no been well-addressed
• User behaviors in long sequences reflect implicit and noisy preference signals
• implicit feedback records will serve as noises in the user’s behavior history, worsening the
modeling of their real interests.
• User preferences are always drifting over time due to their diversity
• It is still challengign to model how they change in the history and estimate the activated
preferences at the current time, which is the core of recommendation models

3
Introduction
Contributions
•They approach sequential recommendation from a new perspective by taking into consideration the
implicit-signal behaviors and fast-changing preferences
•They propose to aggregate implicit signals into explicit ones from user behaviors by designing graph
neural network-based models on constructed item-item interest graphs.
•They design dynamic-pooling for filtering and reserving activated core preferences for recommendation
•They conduct extensive experiments on two large-scale datasets collected from real-world applications.
•The experimental results show significant performance improvements compared with the state-of-the-art
methods of sequential recommendation.
•Studies also verify that our method can model long behavioral sequences effectively and efficiently

5
Method
Interest Graph Construction
•They convert loose item sequences into tight item-item interest graphs.
•Raw graph costruction
•This model attempts to construct an undirected graph for each interaction sequence
•𝒢 = {V, ℰ, A}
•ℰ : the set of graph edges to learn
•A : the coresponds adjacency matrix
•V : an interacted item
•They aim to learn the adjacency matrix A
• The core interest node has a higher degree than the peripheral interest node due to connecting more
similar interests, and the higher frequency of similar interests results in a denser and larger subgraph.
• A priori framework, neighbor nodes are similar, dense subgraphs are the core interests of users.

6
Method
Node similarity metric learning
•𝑀𝑖𝑗 = cos 𝑤⨀ℎ𝑖, 𝑤⨀ℎ𝑗
•𝑤 : trainable weight vector
•ℎ𝑖 : item embeddings
•𝜙weight vectors to compute 𝜙 independent similarity matrices
•𝑀𝑖𝑗
𝛿
= cos 𝑤⨀ℎ𝑖, 𝑤⨀ℎ𝑗 , 𝑀𝑖𝑗 =
1
𝛿 𝛿=1
𝜙
𝑀𝑖𝑗
𝛿
•𝑀𝑖𝑗
𝛿
: the similarity metric for the 𝛿-th head

7
Method
Graph sparsification via 𝜀 -sparseness
•They extract the symmetric sparse non-negative adjacency matrix A from M by considering only the
node pair with the most vital connection.
•They adopt a relative ranking strategy of the entire graph
•𝐴𝑖𝑗 =
1, 𝑀𝑖𝑗 >=𝑅𝑎𝑛𝑘ℰ𝑛2 𝑀 ;
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒,
•𝑅𝑎𝑛𝑘ℰ𝑛2 𝑀 : returns the value of theℰ𝑛2
-th largest value in the metric matrix M
•N : the number of nodes
•ℰ : control the overall sparsity of the generated graph

8
Method
Interest-fusion Graph Convolutional Layer
They have learnable interest graphs which separate diverse interests.
The core interests and peripheral interests form large clusters and small clusters respectively, and different
types of interests form different clusters. To gather weak signals to strong ones that can accurately reflect
user preferences, they need to aggregate information in the constructed graph

9
Method
Interest fusion via graph attentive convolution
• ℎ𝑖 : The input (node, I is the number of nodes)
• ℎ𝑖
′
: a new node embedding matrix after the layer
• 𝐸𝑖𝑗 : An alignment score to map the importance of target node on it’s neighbor node
• ℎ𝑖
′
= 𝜎(𝑊
𝑎 ∙ 𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒(𝐸𝑖𝑗 ∗ ℎ𝑖 𝑗 ∈ 𝑁𝑖 + ℎ𝑖)
• ℎ𝑖
′
= 𝜎(𝑊
𝑎
𝛿
∙ 𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒(𝐸𝑖𝑗
𝛿
∗ ℎ𝑖 𝑗 ∈ 𝑁𝑖 + ℎ𝑖)

10
Method
Cluster-and query-aware attention
• 𝑎𝑖 = 𝐴𝑡𝑡𝑒𝑛𝑡𝑜𝑛𝑐(𝑊
𝑐ℎ𝑖|| ℎ𝑖𝑐
|| W𝑐⨀ℎ𝑖𝑐
)
• 𝐴𝑡𝑡𝑒𝑛𝑡𝑜𝑛𝑐 : a two-layers feedforward neural network with the 𝐿𝑒𝑎𝑘𝑦𝑅𝑒𝐿𝑈 as activation function
• Β𝑗 = 𝐴𝑡𝑡𝑒𝑛𝑡𝑜𝑛𝑞(𝑊
𝑞ℎ𝑗|| ℎ𝑡 || W𝑞ℎ𝑗⨀ℎ𝑡)
• 𝐴𝑡𝑡𝑒𝑛𝑡𝑜𝑛𝑞 : a two-layers feedforward neural network applying the LeakyReLU nonlinearity
• 𝐸𝑖𝑗 = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥𝑗 𝑎𝑖 + 𝛽𝑗 =
exp 𝑎𝑖+𝛽𝑗
𝑘∈𝑁𝑖
exp(𝑎𝑖+𝛽𝑘)

11
Method
Interest-extraction Graph Pooling Layer
The fusion of implicit interest signals to explicit interest signals is completed by performing information
aggregation on the interest graph
They use graph pooling to further extract the fused information
Aims to downsize the graph → loose interest is transformed into tight interest and its distribution is
maintained

12
Method
Interest extraction via graph pooling
• As summing a soft cluster assignment matrix S, it can pool node information into cluster information
• ℎ1
∗
, ℎ2
∗
, … , ℎ𝑚
∗
= 𝑆𝑇 ℎ1
′
, ℎ2
′
, … , ℎ𝑛
′
• 𝛾1
∗
, 𝛾2
∗
, … , 𝛾𝑚
∗
= 𝑆𝑇
𝛾1, 𝛾2, … , 𝛾𝑛
• 𝛾𝑖 : applying sofrmax on 𝛽𝑖
• 𝑆𝑖: = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑊
𝑝 ∙ 𝐴𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒(𝐴𝑖𝑗 ∗ ℎ𝑗
′
|𝑗 ∈ 𝑁𝑖))

13
Method
Assignment regularization
• Same mapping regularization
• 𝐿𝑀 = 𝐴, 𝑆𝑆𝑇
𝐹
• Single affiliation regularization
• 𝐿𝐴 =
1
𝑛 𝑖=1
𝑛
𝐻(𝑠𝑖:)
• Relative position regularization
• 𝐿𝑃 = 𝑃𝑛𝑆, 𝑃𝑚 2

14
Method
Graph readout
• ℎ𝑔 = 𝑅𝑒𝑎𝑑𝑜𝑢𝑡 𝛾𝑖 ∗ ℎ𝑖
′
, 𝑖 ∈ 𝒢

Modeling Implicit Signals and Dynamic Preferences for Sequential Recommendation

Recommended

Recommended

More Related Content

Similar to Modeling Implicit Signals and Dynamic Preferences for Sequential Recommendation

Similar to Modeling Implicit Signals and Dynamic Preferences for Sequential Recommendation (20)

More from ssuser4b1f48

More from ssuser4b1f48 (20)

Recently uploaded

Recently uploaded (20)

Modeling Implicit Signals and Dynamic Preferences for Sequential Recommendation