SlideShare a Scribd company logo
1 of 152
Download to read offline
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
SMORe :
Modularize Graph Embedding for Recommendation
1
13TH ACM CONFERENCE ON RECOMMENDER SYSTEMS
COPENHAGEN, DENMARK. 16-20 SEPTEMBER 2019

TUTORIAL: GRAPH EMBEDDING (11:00-12:30)
Chih-Ming Chen

(CM)
Ting-Hsiang Wang

(Sean)
Chuan-Ju Wang

(Jennifer)
Ming-Feng Tsai

(Victor)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Lecture (Sean & CM, 65 minutes)
Hands-on (CM, 15 minutes)
Q&A (Sean & CM, 10 minutes)
2
QR to Slides, Codes, Abstract
Tutorial Agenda
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
3
DeepWalk: Online Learning of Social Representations
Bryan Perozzi
Stony Brook University
Department of Computer
Science
Rami Al-Rfou
Stony Brook University
Department of Computer
Science
Steven Skiena
Stony Brook University
Department of Computer
Science
{bperozzi, ralrfou, skiena}@cs.stonybrook.edu
ABSTRACT
We present DeepWalk, a novel approach for learning la-
tent representations of vertices in a network. These latent
representations encode social relations in a continuous vector
ace, which is easily exploited by statistical models. Deep-
eralizes recent advancements in language mod-
ised feature learning (or deep learning)
graphs.
obtained from trun-
tions by treat-
trate
(a) Input: Karate Graph
(b) Output: Representation
Figure 1: Our proposed method learns a latent space rep-
resentation of social interactions in R
d . The learned rep-
resentation encodes community structure so it can be eas-
oited by standard classification methods. Here, our
on Zachary’s Karate network [44] to gen-
tion in R
2 . Note the correspon-
re in the input graph and
odularity-based
1
?
?
?
?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Lecture Agenda
4
Q0. Recommendation (REC) and challenges
Q1. Why graph embedding (GE) for REC
Q2. SMORe modularization of GE and benefits
Q3. Exemplar structural modeling for REC
Q4. REC using SMORe
(Sean)
(Sean)
(Sean)
(CM)
(CM)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Lecture Agenda
5
Q0. Recommendation (REC) and challenges
Q1. Why graph embedding (GE) for REC
Q2. SMORe modularization of GE and benefits
Q3. Exemplar structural modeling for REC
Q4. REC using SMORe
(Sean)
(Sean)
(Sean)
(CM)
(CM)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
6
REC systems are everywhere !
Movie
Music
Book, Toy, Electronics, etc. Friend, Interest Group
Recruiter, Job Seeker
Local Business
Trend, Following
News, Ad, Media, etc.
And many more …
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
• REC center around users to provide customized results
• CF assumes people agree on things are likely to agree on other things
7
Let’s start with Collaborative Filtering (CF)
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Sam and Derek have similar tastes; therefore,
if Sam likes Song C, it is likely Derek does, too
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
8
Let’s start with Collaborative Filtering (CF)
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
• REC center around users to provide customized results
• CF assumes people agree on things are likely to agree on other things
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Sam and Derek have similar tastes; therefore,
if Sam likes Song C, it is likely Derek does, too
Collaborate
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
9
Let’s start with Collaborative Filtering (CF)
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
4
Song C
• REC center around users to provide customized results
• CF assumes people agree on things are likely to agree on other things
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Sam and Derek have similar tastes; therefore,
if Sam likes Song C, it is likely Derek does, too
Collaborate
Filter
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
10
Let’s start with Collaborative Filtering (CF)
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Sam and Derek have similar tastes; therefore,
if Sam likes Song C, it is likely Derek does, too
4
Song C
Collaborate
Filter
Recommend
• REC center around users to provide customized results
• CF assumes people agree on things are likely to agree on other things
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
11
Challenge (1) : Data Sparsity
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
• When data sparsity occurs, it is difficult to make accurate REC with CF
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Data sparsity occurs to Liz as there is insufficient
ratings from Liz to compare her taste with others’
Collaborate
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
12
Challenge (1) : Data Sparsity
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
• When data sparsity occurs, it is difficult to make accurate REC with CF
4
Song C
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Data sparsity occurs to Liz as there is insufficient
ratings from Liz to compare her taste with others’
Collaborate
Filter
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
13
Challenge (1) : Data Sparsity
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
• When data sparsity occurs, it is difficult to make accurate REC with CF
4
Song C
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?
☞
→ Data sparsity occurs to Liz as there is insufficient
ratings from Liz to compare her taste with others’
Collaborate
Filter
Recommend
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
14
Challenge (2) : Cold Start
User Item
Sam
Liz
Derek
5
4
3
5
4
Song A
Song B
• When cold start occurs, it is impossible to conduct CF since there is no context

for comparison
Song A Song B Song C
Sam 5 4 4
Derek 4 5 ?
Liz - 3 ?
Roger - - ?☞
→ Cold Start occurs to Roger as there is NO song
rating from Roger to compare his taste with others’
Roger
?
?
?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
15
Challenge (3) : Constant Cold Start
• Some Cold Start situations are constant, i.e., never warm up
• E.g., event tickets are sold before user-event interactions can occur
November 2, 2019 November 16, 2019 October 16, 2019
→ Today is September 19, 2019
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
16
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
17
Common Solution : Add Attributes
→ REC systems often mitigate Complete
Cold Start problems by requiring new
users to specify interests and items
be labeled using tags
→ By requiring user interests and item tags
to come from predefined label set, users
and items share context for comparison
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
18
To enrich context for comparison …
Derek
User Item
Sam
Liz
5
4
3
5
4
Song A
Song B
• For additional context for comparison, REC systems often borrow auxiliary

information, such as item content and user and item attributes
• This forms a Heterogeneous Information Network (HIN)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
19
Derek
User Item Attribute
Sam
Liz Singer A
5
4
3
5
4
Song A
Song B
Metal
Rock
Genre, Artist
• For additional context for comparison, REC systems often borrow auxiliary

information, such as item content and user and item attributes
• This forms a Heterogeneous Information Network (HIN)
Add auxiliary information …
→ Item attributes are added to improve
item comparison, e.g., music genre,

artist, metadata, etc.
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
20
Metal
User Item AttributeAttribute
Sam
Liz
Rock
Working
Student
Singer A
5
4
3
5
4
Song A
Song B
Derek
Social NetworkOccupation
• For additional context for comparison, REC systems often borrow auxiliary

information, such as item content and user and item attributes
• This forms a Heterogeneous Information Network (HIN)
Genre, Artist
And more auxiliary information !
→ Item attributes are added to improve
item comparison, e.g., music genre,

artist, metadata, etc.

→ User attributes are added to improve
user comparison, e.g., social network,

occupation, favorite artist & genre, etc.
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
21
Metal
• For additional context for comparison, REC systems often borrow auxiliary

information, such as item content and user and item attributes
• This forms a Heterogeneous Information Network (HIN)
User Item AttributeAttribute
Sam
Liz
Rock
Working
Student
Singer A
5
4
3
5
4
Song A
Song B
→ Item attributes are added to improve
item comparison, e.g., music genre,

artist, metadata, etc.

→ User attributes are added to improve
user comparison, e.g., social network,

occupation, favorite artist & genre, etc.
→ HIN gives holistic view of complex systems
Derek
Graph coordinates all relations !
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Lecture Agenda
22
Q0. Recommendation (REC) and challenges
Q1. Why graph embedding (GE) for REC
Q2. SMORe modularization of GE and benefits
Q3. Exemplar structural modeling for REC
Q4. REC using SMORe
(Sean)
(Sean)
(Sean)
(CM)
(CM)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
23
Why Graph ?
1. Universal language for describing complex data [1]
• Many fields have all chosen graph to depict entity interactions
Twitter Ego-Network
Delta Airline Route Map NetworkCore Sound Food Web
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
24
1. Universal language for describing complex data [1]
• Many fields have all chosen graph to depict entity interactions
2. Shared vocabulary (therefore ideas) between fields [1]
• E.g., distributional hypothesis in linguistics vs. CF in REC systems
User Item
Why Graph ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
25
1. Universal language for describing complex data [1]
• Many fields have all chosen graph to depict entity interactions
2. Shared vocabulary (therefore ideas) between fields [1]
• E.g., distributional hypothesis in linguistics vs. CF in REC systems
3. Holistic view of complex systems of interactions
• Different domains can easily connect and be jointly mined
User Item Attribute
→ HINs easily integrates information
from different domains
Why Graph ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
26
1. Universal language for describing complex data [1]
• Many fields have all chosen graph to depict entity interactions
2. Shared vocabulary (therefore ideas) between fields [1]
• E.g., distributional hypothesis in linguistics vs. CF in REC systems
3. Holistic view of complex systems of interactions
• Different domains can easily connect and be jointly mined
4. General definition of contexts used for entity comparison
• Model graph structure instead specific types of relations
User Item Attribute
→ Different REC approaches, similar
structure
CF CBF
Why Graph ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
1. Universal language for describing complex data [1]
• Many fields have all chosen graph to depict entity interactions
2. Shared vocabulary (therefore ideas) between fields [1]
• E.g., distributional hypothesis in linguistics vs. CF in REC systems
3. Holistic view of complex systems of interactions
• Different domains can easily connect and be jointly mined
4. General definition of contexts used for entity comparison
• Model graph structure instead specific types of relations
5. Sophisticated models available from network-related researches
• Network schema, meta-path, subgraph matching, information propagation, etc.
27
(a) ACM data (b) DBLP data
Figure 3: Examples of bibliographic network
schema.
distinct semantics under different paths will lead to different
relatedness. The relatedness under APVC path emphasizes
on the conferences that authors participated, while the relat-
edness under APSPVC path emphasizes on conferences pub-
lishing the papers that have the same subjects with authors’
papers. For example, assume most papers of an author are
published in the KDD, SIGMOD, and VLDB. However, the
papers having the same subjects with the author’s papers
may be published in more wide conferences, such as ICDM,
Figu
In m
jects
lar p
As a
ence
gene
to th
resea
man
mor
Star-schema [2]
Meta-path [3]
Why Graph ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
1. Universal language for describing complex data [1]
• Many fields have all chosen graph to depict entity interactions
2. Shared vocabulary (therefore ideas) between fields [1]
• E.g., distributional hypothesis in linguistics vs. CF in REC systems
3. Holistic view of complex systems of interactions
• Different domains can easily connect and be jointly mined
4. General definition of contexts used for entity comparison
• Model graph structure instead specific types of relations
5. Sophisticated models available from network-related researches
• Network schema, meta-path, subgraph matching, information propagation, etc.
28
Why Graph ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
29
REC as Link Prediction on Graphs
?
?
?
?
✘
✔
✔
✘
Machine Learning
Algorithm
• Many ways to compare node similarity: entity types, shared neighbors, distance, etc.
• Early REC models, e.g., CF and CBF, define intuitive relations for similarity measurement
• As models mature and data diversity skyrockets nowadays, designing features

for REC becomes increasingly challenging
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
30
Graph Embedding Pipeline
Hidden layer
Input layer Output layer
Supply Graph Return Embeddings (from hidden Layer)Train Neural Network
eepWalk: Online Learning of Social Representations
Bryan Perozzi
Stony Brook University
Department of Computer
Science
Rami Al-Rfou
Stony Brook University
Department of Computer
Science
Steven Skiena
Stony Brook University
Department of Computer
Science
{bperozzi, ralrfou, skiena}@cs.stonybrook.edu
ABSTRACT
We present DeepWalk, a novel approach for learning la-
tent representations of vertices in a network. These latent
representations encode social relations in a continuous vector
space, which is easily exploited by statistical models. Deep-
k generalizes recent advancements in language mod-
supervised feature learning (or deep learning)
ds to graphs.
mation obtained from trun-
esentations by treat-
emonstrate
el
(a) Input: Karate Graph
(b) Output: Representation
Figure 1: Our proposed method learns a latent space rep-
resentation of social interactions in R
d . The learned rep-
resentation encodes community structure so it can be eas-
ly exploited by standard classification methods. Here, our
used on Zachary’s Karate network [44] to gen-
resentation in R
2 . Note the correspon-
tructure in the input graph and
ent a modularity-based
Rami Al-Rfou
y Brook University
tment of Computer
Science
Steven Skiena
Stony Brook University
Department of Computer
Science
u, skiena}@cs.stonybrook.edu
ing la-
latent
vector
Deep-
e mod-
rning)
m trun-
y treat-
nstrate
ti-label
s Blog-
Deep-
allowed
ence of
1
2
3
4
5
6
7
8
9
11
12
13
14
18
20
22
32
31
10
28
29
33
17
34
15
16
19
21
23
24
26
30
25
27
1
2
3
4
5
6
7
8
9
11
12
13
14
18
20
22
32
31
10
28
29
33
17
34
15
16
19
21
23
24
26
30
25
27
(a) Input: Karate Graph (b) Output: Representation
Figure 1: Our proposed method learns a latent space rep-
resentation of social interactions in Rd
. The learned rep-
resentation encodes community structure so it can be eas-
ily exploited by standard classification methods. Here, our
method is used on Zachary’s Karate network [44] to gen-
erate a latent representation in R2
. Note the correspon-
dence between community structure in the input graph and
• GE for REC exploits observed links as graph structures to predict unobserved links
• Entities are converted into spatial features (embeddings) in the hidden layer and

iteratively updated such that their interactions approximate values in the output layer
• Entity relatedness is preserved as spatial properties, e.g., distance and angle
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
31
Why Embedding ?
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
❌Query
O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit>
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
32
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
→ Brute force exhaustive search finds the closest nearest neighbors in
❌Query
O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit>
Why Embedding ?
O(log n!)<latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit>
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
33
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
❌Query
O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit>
Why Embedding ?
→ Brute force exhaustive search finds the closest nearest neighbors in O(log n!)<latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit>
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
34
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
❌Query
→ Spotify ANNOY builds binary tree of subdivisions to quickly find closest neighbors
O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit>
Why Embedding ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
35
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
❌Query
O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit>
Why Embedding ?
→ Spotify ANNOY builds binary tree of subdivisions to quickly find closest neighbors
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
36
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
2. Efficient pairwise comparison due to dimensionality reduction (DR)
• Lower dimension costs less to calculate feature similarity
O(log n)<latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit>
Why Embedding ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
37
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
2. Efficient pairwise comparison due to dimensionality reduction (DR)
• Lower dimension costs less to calculate feature similarity
3. Reduced space complexity due to DR
• Lower dimension costs less storage space
O(log n)<latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit>
Why Embedding ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
38
1. Efficient retrieval from approximate nearest neighbor (ANN) search methods
• E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining

a binary tree of subdivisions such that good enough results can be found in [4]
2. Efficient pairwise comparison due to dimensionality reduction (DR)
• Lower dimension costs less to calculate feature similarity
3. Reduced space complexity due to DR
• Lower dimension costs less storage space
4. Transfer learning with pertained embeddings
• Pretrained embeddings are better guesses than randomly initialized vectors
O(log n)<latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit>
Why Embedding ?
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
39
GE for REC : Challenges
• So graph embedding is GREAT for recommendation :
• Reduces data sparsity and cold start via integrating auxiliary information
• Provides holistic view of REC problem and jointly mines different relations in
terms of graph structures
• Trains fast, compares fast, and retrieves fast while taking less space
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
40
• So graph embedding is GREAT for recommendation :
• Reduces data sparsity and cold start via integrating auxiliary information
• Provides holistic view of REC problem and jointly mines different relations in
terms of graph structures
• Trains fast, compares fast, and retrieves fast while taking less space

• What’s the catch ?
GE for REC : Challenges
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Lecture Agenda
41
Q0. Recommendation (REC) and challenges
Q1. Why graph embedding (GE) for REC
Q2. SMORe modularization of GE and benefits
Q3. Exemplar structural modeling for REC
Q4. REC using SMORe
(Sean)
(Sean)
(Sean)
(CM)
(CM)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
42
Embedding space
Let’s look at GE process …
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Let’s look at GE process …
43
…
Graph structures
Embedding space
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Let’s look at GE process …
…
44
Samples relation
A
B
BA
Graph structures
A B
Embedding space
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Let’s look at GE process …
…
B
A
45
Samples relation
A
B
BA
Graph structures
Maps entities to space
Embedding space
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Let’s look at GE process …
…
46
Samples relation
A
B
BA
Embedding space
Graph structures
A
B
Maps entities to space
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Challenge (1) : Select structure is HARD
A
B
47
Embedding space
Graph structures
Maps entities to spaceB
Samples relation
A
BA
…
→ Unsure which graph structures
best model current REC task
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Challenge (1) : Select structure is HARD
A
B
48
Embedding space
Graph structures
Maps entities to spaceB
Samples relation
A
BA
…
5
8 7
6
2
3
1
4
In bipartite, model & by
neighborhood is intuitive; but in
planar, & are also shown
similar in betweenness
g j
5 7
→ Unsure which graph structures
best model current REC task
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Challenge (1) : Select structure is HARD
A
B
49
Embedding space
Graph structures
Maps entities to spaceB
Samples relation
A
BA
…
5
8 7
6
2
3
1
4
In bipartite, model & by
neighborhood is intuitive; but in
planar, & are also shown
similar in betweenness
g j
5 7
→ Unsure which graph structures
best model current REC task
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Challenge (2) : Customize GE model
A
B
50
Embedding space
Graph structures
Maps entities to spaceB
Samples relation
A
BA
…
→ Wanna customize GE methods
during each stage of training
→ Unsure which graph structures
best model current REC task
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Challenge (3) : Make fair comparison
A
B
51
Embedding space
Graph structures
Maps entities to spaceB
Samples relation
A
BA
…
→ Wanna customize GE methods
during each stage of training
→ Difficult to compare models as
their implementations often vary
→ Unsure which graph structures
best model current REC task
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Solution : Modularize GE for adaptability !
52
…
Samples relation
BA
Embedding space
Graph structures
A
B
Maps entities to space
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
…
Samples relation
BA
Embedding space
Graph structures
A
B
Maps entities to space
Optimizes distance
53
Sampler
Mapper
Optimizer
Solution : Modularize GE for adaptability !
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Solution : Modularize GE for adaptability !
A
B
54
…
Samples relation
BA
Embedding space
Graph structures
Maps entities to space
Optimizes distance
Sampler
Mapper
Optimizer
•Extracts graph structures from dataset
while remains type-agnostic to sampled
entities, i.e., nodes & edges
E : {(v1, v2)|(v1, v2) 2 V ⇥ V }<latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit><latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit><latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit><latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit>
: E ! R<latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit><latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit><latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit><latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit>
G = (V, E, R, )<latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit><latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit><latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit><latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit>
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Solution : Modularize GE for adaptability !
55
…
Samples relation
BA
Embedding space
Graph structures
A
B
Maps entities to space
Sampler
Mapper
Optimizer
•Converts entities into spatial features
via embedding stacking operations,

e.g, lookup, pooling (average, etc.)
f : (p) ! Rd
<latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit><latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit><latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit><latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit>
f : (p, q) ! Rd
<latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit><latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit><latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit><latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit>
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Solution : Modularize GE for adaptability !
BA
56
…
Samples relation
Embedding space
Graph structures
A
B
Maps entities to space
Sampler
Mapper
Optimizer
•Preserves entity relatedness as spatial
properties with customizable similarity
metrics and loss functions
pture item j’s
entual location
ay from f(xj).
the function f,
2
.
Gaussian prior
n we have more
e that the func-
we simultane-
ibed earlier to
h other. Specif-
med by v⇤ and
levant to users’
that the items
d together and
ss-rated items.
) with dropout
!"
!#
1 2
1
2
!"
!#
1 2
1
2
Matrix	Factorization Collaborative	Metric	Learning
User Item
$#
$%
$"
$#
$%
$"
&# &"
$#
$"
$%
Figure 3: Example latent vector assignments for ma-
trix factorization and CML. The table on the right
shows user/item’s preference relationships.
3.5 Training Procedure
The complete objective function of the proposed model is
g
&# &"
$#
$"
$%
s for ma-
Dot Product Euclidean Distance
→ Euclidean distance keeps triangular
inequality; dot product does not [5] Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Solution : Modularize GE for adaptability !
B
A
57
…
Samples relation
BA
Embedding space
Graph structures
Maps entities to space
Sampler
Mapper
for
Recommendation
Optimizer
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
SMORe : Modular GE toolkit for REC
B
A
58
…
Samples relation
BA
Embedding space
Graph structures
Maps entities to space
Sampler
Mapper
for
Recommendation
Optimizer
Optimizes distance
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
59
SMORe is a modular GE toolkit for REC S’more is a campfire treat with layers
SMORe : Modular GE toolkit for REC
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
60
Benefits of SMORe
Performance levelSampler
Mapper
for
Recommendation
Optimizer
Modules contribute different
levels of performance during
different REC tasks
Model A
Model B
Model C
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
61
Benefits of SMORe
Sampler
Mapper
for
Recommendation
Optimizer
SMORe enables combining
the most suitable modules
for given REC task
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
62
Benefits of SMORe
Sampler
Mapper
for
Recommendation
Optimizer
SMORe enables combining
the most suitable modules
for given REC task
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
63
Benefits of SMORe
Best THEN!
Best NOW!
As toolkit for research:
1. Baseline comparison
2. Ballpark approaches
3. One module at a time
As framework for development:
1. Reduces development time
2. Raises performance limit
3. Adapts to new tasks
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
References
64
1. Hamilton, William L., et al. “Representation on Networks”, WWW 2018 Tutorial
2. Shi, Chuan, et al. “Relevance Search in Heterogeneous Networks”, ACM EDBT 2012
3. Shi, Chuan, et al. “A Survey of Heterogeneous Information Network Analysis”,

IEEE TKDE 2016
4. Bernhardsson, Erik “ANNOY library”, https://github.com/spotify/annoy
5. Hsieh, Cheng-Kang, et al. “Collaborative Metric Learning”, WWW 2017
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
Ting-HsiangWang,TexasA&MUniversity
Lecture Agenda
65
Q0. Recommendation (REC) and challenges
Q1. Why graph embedding (GE) for REC
Q2. SMORe modularization of GE and benefits
Q3. Exemplar structural modeling for REC
Q4. REC using SMORe
(Sean)
(Sean)
(Sean)
(CM)
(CM)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica66
Example Graph
vertex
edge
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica67
Vertex Structure
• Adjacency

- vertices which share the same edge
• Neighborhood

- vertices which share similar connections
• Community

- vertices which share similar communities
• Centrality

- vertices which share similar properties
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica68
1. Adjacency
DB
A EC
F
EDGE EDGE
EDGE
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica69
Adjacency
user item
user-item	preference
vertices share the same edge are treated as similar pairs
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica70
Matrix Factorization
r = ? ? ? ?
?
?
?
?
x
p1 p2 p3
U1
U2
U3
I1 I2 I3 I4
IEEE Computer’09
Yehuda Koren, Robert M. Bell, Chris Volinsky: Matrix Factorization
Techniques for Recommender Systems. IEEE Computer 42(8): 30-37 (2009)
might distort the data considerably. Hence, more r
works3-6
suggested modeling directly the observe
ings only, while avoiding overfitting through a regula
model. To learn the factor vectors (pu
and qi
), the s
minimizes the regularized squared error on the
known ratings:
min
* *,q p
( , )u i
(rui
qi
T
pu
)2
+ (|| qi
||2
+ || pu
||2
)
Here, is the set of the (u,i) pairs for which rui
is k
q1
q2
q3
(rating - item*user)
p4
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica71
Graph
user-item	graph
Matrix Factorization in SMORe
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
(			,			)
(user,	item)
72
SamplerGraph
user-item	graph
Matrix Factorization in SMORe
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
(			,			)
(user,	item)
73
SamplerGraph
Mapper
user-item	graph
Matrix Factorization in SMORe
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
Embedding	Lookup
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
(			,			)
(user,	item)
74
OptimizerSamplerGraph
Mapper
user-item	graph
Matrix Factorization in SMORe
Square	Error
(~v · ~v y)2
<latexit sha1_base64="uQdhOq3LVxgk6wWspOVfIsBCS58=">AAACBnicbVDLSgMxFM34rPU16lKEYBHqwjJTBV0W3LisYB/QGUsmk2lDM8mQZApl6MqNv+LGhSJu/QZ3/o1pO4K2Hrhwcs695N4TJIwq7Thf1tLyyuraemGjuLm1vbNr7+03lUglJg0smJDtACnCKCcNTTUj7UQSFAeMtILB9cRvDYlUVPA7PUqIH6MepxHFSBupax+VvSHB2XAMPRwKDX9eZ3B0el/t2iWn4kwBF4mbkxLIUe/an14ocBoTrjFDSnVcJ9F+hqSmmJFx0UsVSRAeoB7pGMpRTJSfTc8YwxOjhDAS0hTXcKr+nshQrNQoDkxnjHRfzXsT8T+vk+roys8oT1JNOJ59FKUMagEnmcCQSoI1GxmCsKRmV4j7SCKsTXJFE4I7f/IiaVYr7nmlentRqtXzOArgEByDMnDBJaiBG1AHDYDBA3gCL+DVerSerTfrfda6ZOUzB+APrI9vpWyX/A==</latexit>
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
Embedding	Lookup
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica75
Sparse LInear Method (SLIM) ICDM’11, RecSys’16
= 1 0 1 0
0 ?
0
? 0
- 0
x
I1 I2 I3
I1
I2
I3
I4
I4I1 I2 I3 I4
U1
U2
U3
1. Xia Ning, George Karypis: SLIM: Sparse Linear Methods for Top-N Recommender Systems. ICDM 2011: 497-506
2. Evangelia Christakopoulou, George Karypis: Local Item-Item Models For Top-N Recommendation. RecSys 2016: 67-74
nd MF. In
y with a
methods.
ed as a
Bayesian
maximum
sures the
ems and
n method
objective
o denote
nd items
in ˜aT
i in decreasing order and recommending the top-N items.
B. Learning W for SLIM
We view the purchase/rating activity of user ui on item tj in
A (i.e., aij) as the ground-truth item recommendation score.
Given a user-item purchase/rating matrix A of size m⇥n, we
learn the sparse n⇥n matrix W in Equation 2 as the minimizer
for the following regularized optimization problem:
minimize
W
1
2
kA AWk2
F +
2
kWk2
F + kWk1
subject to W 0
diag(W) = 0,
(3)
where kWk1 =
Pn
i=1
Pn
j=1 |wij| is the entry-wise `1-norm
of W, and k · kF is the matrix Frobenius norm. In Equa-
1 - 1 -
I1 I2 I3 I4
U1
U2
U3
Transition Matrix W
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
(			,			)
(user,	item)
76
SamplerGraph
user-item	graph
SLIM in SMORe
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
(			,			)
(user,	item)
77
SamplerGraph
Mapper
user-item	graph
SLIM in SMORe
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
space such that the co-occurrence of users and items (i.e.,
a certain user has purchased a certain item) can be rendered
conditionally independent. Koren [10] proposed an intersect-
ing approach between neighborhood-based method and MF. In
his approach, item similarity is learned simultaneously with a
matrix factorization so as to take advantages of both methods.
Top-N recommendation has also been formulated as a
ranking problem. Rendle et al [11] proposed a Bayesian
Personalized Ranking (BPR) criterion, which is the maximum
posterior estimator from a Bayesian analysis and measures the
difference between the rankings of user-purchased items and
the rest items. BPR can be well adopted for item knn method
(BPRkNN) and MF methods (BPRMF) as a general objective
function.
III. DEFINITIONS AND NOTATIONS
In this paper, the symbols u and t will be used to denote
the users and items, respectively. Individual users and items
will be denoted using different subscripts (i.e., ui, tj). The
set of all users and items in the system will be denoted by U
represents the recommendation s
ui. Top-N recommendation for ui
purchased/-rated items based on t
in ˜aT
i in decreasing order and rec
B. Learning W for SLIM
We view the purchase/rating act
A (i.e., aij) as the ground-truth i
Given a user-item purchase/rating
learn the sparse n⇥n matrix W in
for the following regularized opti
minimize
W
1
2
kA AWk2
F +
subject to W 0
diag(W) = 0,
where kWk1 =
Pn
i=1
Pn
j=1 |wij
of W, and k · kF is the matrix
tion 3, AW is the estimated matri
(i.e., ˜A) by the sparse linear mo
1 2
A

(fixed)
W	
(trainable)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
(			,			)
(user,	item)
78
OptimizerSamplerGraph
Mapper
user-item	graph
SLIM in SMORe
Square	Error
(~v · ~v y)2
<latexit sha1_base64="uQdhOq3LVxgk6wWspOVfIsBCS58=">AAACBnicbVDLSgMxFM34rPU16lKEYBHqwjJTBV0W3LisYB/QGUsmk2lDM8mQZApl6MqNv+LGhSJu/QZ3/o1pO4K2Hrhwcs695N4TJIwq7Thf1tLyyuraemGjuLm1vbNr7+03lUglJg0smJDtACnCKCcNTTUj7UQSFAeMtILB9cRvDYlUVPA7PUqIH6MepxHFSBupax+VvSHB2XAMPRwKDX9eZ3B0el/t2iWn4kwBF4mbkxLIUe/an14ocBoTrjFDSnVcJ9F+hqSmmJFx0UsVSRAeoB7pGMpRTJSfTc8YwxOjhDAS0hTXcKr+nshQrNQoDkxnjHRfzXsT8T+vk+roys8oT1JNOJ59FKUMagEnmcCQSoI1GxmCsKRmV4j7SCKsTXJFE4I7f/IiaVYr7nmlentRqtXzOArgEByDMnDBJaiBG1AHDYDBA3gCL+DVerSerTfrfda6ZOUzB+APrI9vpWyX/A==</latexit>
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit>
A

(fixed)
W	
(trainable)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica79
2. Neighborhood
A
B
NEIGHBORS
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica80
Neighborhood
vertices share similar connections are treated as similar pairs
commonly	purchasedcommon	interests
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica81
item2vec RecSys’16, MLSP’16
Oren Barkan, Noam Koenigstein: Item2vec: Neural Item Embedding for
Collaborative Filtering. RecSys Posters 2016

Oren Barkan, Noam Koenigstein: ITEM2VEC: Neural item embedding for
collaborative filtering. MLSP 2016: 1-6
ter that
s to be
d iw is
d to the
und to
ibution,
rare and
edure is
each pair of items that share the same set as
example. This implies a window size that is d
from the set size. Specifically, for a given se
the objective from Eq. (1) is modified as follo
1
1
log ( | )
K K
j i
i j i
p w w
K = ≠
∑∑
Another option is to keep the objective in Eq
and shuffle each set of items during runtim
experiments we observed that both options p
the given K items
purchasing	logs
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica82
To Model Neighborhood
context

space
vertex

space
purchasing	logs
shared

neighbor
userA
userB
space for shared neighbor
w2v
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica83
SamplerGraph
user-item	graph
item2vec in SMORe
(			,			)
(itemA,	itemB)
A B
A
B
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica84
SamplerGraph
Mapper
user-item	graph
item2vec in SMORe
vertex

embedding	lookup
A
~vv
<latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit>
context

embedding	lookup
B ~vc
<latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit>
(			,			)
A B
A
B
(itemA,	itemB)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica85
OptimizerSamplerGraph
Mapper
user-item	graph
item2vec in SMORe
vertex

embedding	lookup
A
~vv
<latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit>
context

embedding	lookup
B ~vc
<latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit>
(			,			)
A B
Log	Likelihood
A B
log (~vv
· ~vc
)<latexit sha1_base64="+ijKtwEXuygdqvbeIqq2UeNudQM=">AAACIHicbZBNS8NAEIY3flu/oh69LBZBLyVRoR4FLx4r2FZoatlspu3STTbsTool9Kd48a948aCI3vTXuK05aPWFhZdnZpidN0ylMOh5H87c/MLi0vLKamltfWNzy93eaRiVaQ51rqTSNyEzIEUCdRQo4SbVwOJQQjMcXEzqzSFoI1RyjaMU2jHrJaIrOEOLOm41kKoXGNGL2WEwBJ4Px7d5gHCH1o0DHimcwdxietRxy17Fm4r+NX5hyqRQreO+B5HiWQwJcsmMafleiu2caRRcwrgUZAZSxgesBy1rExaDaefTA8f0wJKIdpW2L0E6pT8nchYbM4pD2xkz7JvZ2gT+V2tl2D1r5yJJM4SEfy/qZpKiopO0aCQ0cJQjaxjXwv6V8j7TjKPNtGRD8GdP/msaxxX/pHJ8dVo+rxVxrJA9sk8OiU+q5JxckhqpE07uySN5Ji/Og/PkvDpv361zTjGzS37J+fwCQLeliA==</latexit>
A
B
(itemA,	itemB)
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica86
3. Community
A
B
COMMUNITY
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica87
Community
vertices share similar communities are treated as similar pairs
connected	to	the	same	group
not	connected
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica88
Heterogeneous Preference Embedding (HPE) RecSys’16
Chih-Ming Chen, Ming-Feng Tsai, Yu-Ching Lin, Yi-Hsuan Yang: Query-based
Music Recommendations via Preference Embedding. RecSys 2016: 79-82
aper includes:
dation is proposed
arch work of infor-
mender system.
ommendation as a
propose a method
between users and
ms.
onducted to verify
ethod.
ENDATIONS
the task of query-
User
Track
Album
Artist
U1
T1
T2
T3
T4
T5
Ar1
U2
U3
Ar2
Ar3
Al1
Al2
Al3
Al4T6
U4
Figure 1: The user preference network. Suppose there are
totally four kinds of data attributes in the music listening
dataset. In the figure, the user U4 may frequently listen to
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica89
Heterogeneous Preference Embedding (HPE) RecSys’16
Chih-Ming Chen, Ming-Feng Tsai, Yu-Ching Lin, Yi-Hsuan Yang: Query-based
Music Recommendations via Preference Embedding. RecSys 2016: 79-82
T5 Al4U3 U4
target

vertexT5 U3
U3 U4 Al4U3
Edge Sampling from
whole network
Weighted Random Walks from

preceding vertex
Walk Direction
T5U3
…
( , )
( , )
( , ) ( , )
context

vertex
( , )
the training pairs
ure 2: Edge sampling via weighted random walks for
ning pairs. In our sampling scheme, the first sampling
is derived from the edge sampling over the whole network.
Algorithm 1: Heterogeneous Prefer
Input: User Preference Network:
Walk Steps: w, Sampling Times: n
1 for v 2 V do
2 Initialize the representation: (
3 for i 2 {1, ..., n} do
4 (v1, v2) = EdgeSampling(G)
5 Update (v1), context v2 by mi
6 Update (v2), context v1 by mi
7 for v0
2 RandomWalk(v2, w
8 Update (v1), context v0
by
Output: Vertex representations
Table 1: Dataset
Pr(vj| (vi)) =
0 otherwise
Maximizing the above posterior probabi
to minimizing the negative log likelihood,
function of the second-order proximity can
follows:
O =
X
(i,j)2S
wi,j log p(vj| (vi)) +
X
i
where S is a set of sampling pairs and w ind
of the edge. In practice, the observed ed
tive information and thus falls into the one
problem. We adopt the widely-used solutio
sampling to sample the additional user-to-o
weight embedding
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica90
SamplerGraph
user-item	graph
HPE in SMORe
(		,		)V C (		,		)V C (		,		)V C
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica91
OptimizerSamplerGraph
Mapper
user-item	graph
HPE in SMORe
(		,		)V C (		,		)V C (		,		)V C
vertex

embedding	lookup
~vv
<latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit>
V
V
context

embedding	lookup
~vc
<latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit>
C
C
Log	Likelihood
V C
log (~vv
· ~vc
)<latexit sha1_base64="+ijKtwEXuygdqvbeIqq2UeNudQM=">AAACIHicbZBNS8NAEIY3flu/oh69LBZBLyVRoR4FLx4r2FZoatlspu3STTbsTool9Kd48a948aCI3vTXuK05aPWFhZdnZpidN0ylMOh5H87c/MLi0vLKamltfWNzy93eaRiVaQ51rqTSNyEzIEUCdRQo4SbVwOJQQjMcXEzqzSFoI1RyjaMU2jHrJaIrOEOLOm41kKoXGNGL2WEwBJ4Px7d5gHCH1o0DHimcwdxietRxy17Fm4r+NX5hyqRQreO+B5HiWQwJcsmMafleiu2caRRcwrgUZAZSxgesBy1rExaDaefTA8f0wJKIdpW2L0E6pT8nchYbM4pD2xkz7JvZ2gT+V2tl2D1r5yJJM4SEfy/qZpKiopO0aCQ0cJQjaxjXwv6V8j7TjKPNtGRD8GdP/msaxxX/pHJ8dVo+rxVxrJA9sk8OiU+q5JxckhqpE07uySN5Ji/Og/PkvDpv361zTjGzS37J+fwCQLeliA==</latexit>
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica
4. Centrality (Degree)
92
A
B
DEGREE 4
DEGREE 4
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica93
Centrality
vertices share similar properties are treated as similar pairs
hub	item hub	item
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica94
struc2vec KDD’17
Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R.
Figueiredo: struc2vec: Learning Node Representations from Structural
Identity. KDD 2017: 385-394
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica95
struc2vec KDD’17
Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R.
Figueiredo: struc2vec: Learning Node Representations from Structural
Identity. KDD 2017: 385-394
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica96
struc2vec KDD’17
Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R.
Figueiredo: struc2vec: Learning Node Representations from Structural
Identity. KDD 2017: 385-394
user-user	graph
fn1(					)	=	structure1_A
fn1(					)	=	structure1_B
A
B
A
B
fn2(					)	=	structure2_C
C
C
1-step:
2-steps:
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica97
SamplerGraph
user-user	graph
struc2vec in SMORe
(		,		)V
fn1()
S (		,		)V
fn2()
S
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica98
OptimizerSamplerGraph
Mapper
user-user	graph
struc2vec in SMORe
~vv
<latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit>
V
V
~vc
<latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit>
S
S
Log	Likelihood
V S
log (~vv
· ~vc
)<latexit sha1_base64="+ijKtwEXuygdqvbeIqq2UeNudQM=">AAACIHicbZBNS8NAEIY3flu/oh69LBZBLyVRoR4FLx4r2FZoatlspu3STTbsTool9Kd48a948aCI3vTXuK05aPWFhZdnZpidN0ylMOh5H87c/MLi0vLKamltfWNzy93eaRiVaQ51rqTSNyEzIEUCdRQo4SbVwOJQQjMcXEzqzSFoI1RyjaMU2jHrJaIrOEOLOm41kKoXGNGL2WEwBJ4Px7d5gHCH1o0DHimcwdxietRxy17Fm4r+NX5hyqRQreO+B5HiWQwJcsmMafleiu2caRRcwrgUZAZSxgesBy1rExaDaefTA8f0wJKIdpW2L0E6pT8nchYbM4pD2xkz7JvZ2gT+V2tl2D1r5yJJM4SEfy/qZpKiopO0aCQ0cJQjaxjXwv6V8j7TjKPNtGRD8GdP/msaxxX/pHJ8dVo+rxVxrJA9sk8OiU+q5JxckhqpE07uySN5Ji/Og/PkvDpv361zTjGzS37J+fwCQLeliA==</latexit>
(		,		)V
fn1()
S (		,		)V
fn2()
S
note. original work adopts hierarchal softmax for learning
vertex

embedding	lookup
context

embedding	lookup
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica99
Recap
Sampler Mapper Optimizer
MF Adjacency Vertex Embedding
Dot product
RMSE
SLIM Adjacency Vertex Embedding
Dot product
RMSE
item2vec Neighborhood
Vertex Embedding
Context Embedding
Sigmoid Dot Product
Log Likelihood
HPE Community
Vertex Embedding
Context Embedding
Sigmoid Dot Product
Log Likelihood
struc2vec Centrality
Vertex Embedding
Context Embedding
Sigmoid Dot Product
Log Likelihood
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica100
Edge Information
• Weight

- reflects strength of interaction
• Closeness

- reflects order of interaction
• Semantics

- sample with grammar
• Inductiveness

- information diffuses through edges
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica101
1. Weight
B
C
A
12
1
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica102
node2vec
edge weight guides the walk
BFS Walk DFS Walk
Aditya Grover, Jure Leskovec: node2vec: Scalable Feature Learning for Networks.
KDD 2016: 855-864
KDD’16
v
α=1
α=1/q
α=1/q
α=1/p
x2
x3t
x1
2: Illustration of the random walk procedure in node2vec.
alk just transitioned from t to v and is now evaluating its next
t of node v. Edge labels indicate search biases ↵.
x denotes the shortest path distance between nodes t and x.
hat dtx must be one of {0, 1, 2}, and hence, the two parame-
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica103
node2vec in SMORe
v
α=1
α=1/q
α=1/q
α=1/p
x2
x3t
x1
2: Illustration of the random walk procedure in node2vec.
alk just transitioned from t to v and is now evaluating its next
t of node v. Edge labels indicate search biases ↵.
x denotes the shortest path distance between nodes t and x.
hat dtx must be one of {0, 1, 2}, and hence, the two parame-
Sampler
customized

random	walk
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica104
2. Closeness
A
strong weak
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica105
High-order Proximity for Recommendations (HOP-Rec)
RecSys’18
w
Chih-Ming Chen
National Chengchi University
Taipei, Taiwan
104761501@nccu.edu.tw
w
Ming-Feng Tsai
National Chengchi University
Taipei, Taiwan
mftsai@nccu.edu.tw
many e-commerce
pular approaches
els; the former ap-
he observed direct
r extracts indirect
item interactions.
ecient method
posed method in-
order information
d of factorizing a
dence weighting
n simultaneously,
action matrix and
lks. Experimental
performs the state
tasets.
andom walks; bi-
back
and Ming-Feng Tsai.
ecommendation. In
i1
i2
i3
i4
u1
1st 1st
u2
1st 1st
u3
1st 1st
i1 i2 i3 i4
u1
1st 1st 2nd 3rd
u2
2nd 1st 1st 2nd
u3
3rd 2nd 1st 1st
(c) (d)
(a) (b)
i1
u1
i2 i3 i4 i1
u1 u2
i2 i3
u3
i4
u2 u3
Figure 1: High-order proximity between users and items
within observed interactions
Collaborative ltering (CF) is commonly used to leverage this
Jheng-Hong Yang, Chih-Ming Chen, Chuan-Ju Wang, Ming-Feng Tsai: HOP-rec: high-
order proximity for implicit recommendation. RecSys 2018: 140-144
these factorization and graph models, we build HOP-Re
framework that (1) captures high-order preference info
a given user-item interaction matrix, and (2) scales to l
real-world datasets by using random surng on the corr
interaction graph. The objective of HOP-Rec is dened
LHOP =
X
1k K
u,(i,i0)
graph model
z }| {
C(k) E i⇠Pk
u
i0⇠PN
factorization model
z }| {f
F
⇣ |
u i0,
|
u i
⌘g
+ k
where Pk
u (·) denotes the k-order probability distribut
item sampled from the walk sequence Su (see Denit
denotes a uniform distribution by which an item is sam
the set of all items, and K denotes the maximum orde
in our method. The main idea behind the proposed met
approximation of high-order probabilistic matrix factor
conducting random walk (RW) with a decay factor for c
cit Recommendation RecSys ’18, October 2–7, 2018, Vancouver, BC, Cana
uild HOP-Rec, a united
eference information in
2) scales to large-scale
g on the corresponding
ec is dened as
model
{
|
u i
⌘g
+ k k2
2 , (3)
Finally, the ranking objective function is composed with an in
cator function and a pairwise logistic loss, and we thus dene F
Eq. (3) as
F (
|
u i0,
|
u i ) = 1{
|
u i0
|
u i  k } log
f ⇣ |
u i0
|
u i
⌘g
,
where 1B denotes the indicator function for condition B and k
an order-aware margin set to /k. Note that item i0 is uniform
sampled from the set of all items as we assume a simplied u
form distribution PN for less preferred items; this assumption
order-aware interaction
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica106
HOP-Rec in SMORe
SamplerGraph
user-item	graph
(		,		)
1(~v · ~v ~v · ~v  margin)latexit sha1_base64=iBNmKDj8/YuJu40Ok7W0yOl30QA=AAACRXicbVA9SwNBEN3z2/gVtbRZDEIsDHcqaGEh2lgqGBVyIeztTZIle3vH7lwwHPeX/Bv+ARsLLe3sxFb3YgSjPhh4vDfDzLwgkcKg6z46E5NT0zOzc/OlhcWl5ZXy6tqViVPNoc5jGeubgBmQQkEdBUq4STSwKJBwHfROC/+6D9qIWF3iIIFmxDpKtAVnaKVW+cyPGHZDk3l51e8Dz/q5z8MYvzndof/KRz71EW4xi5juCJVvt8oVt+YOQf8Sb0QqZITzVvnFD2OeRqCQS2ZMw3MTbGZMo+AS8pKfGkgY77EONCxVLALTzIYf53TLKiFtx9qWQjpUf05kLDJmEAW2s/jP/PYK8T+vkWL7sJkJlaQIin8taqeSYkyL+GgoNHCUA0sY18LeSnmXacbRhjy2JTTFaXnJBuP9juEvudqteXu13Yv9yvHJKKI5skE2SZV45IAckzNyTuqEkzvyQJ7Is3PvvDpvzvtX64QzmlknY3A+PgHSQLQT/latexit
R
greater

margin
smaller

margin
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica107
HOP-Rec in SMORe
OptimizerSamplerGraph
Mapper
user-item	graph
embedding	lookup
BPRLoss
(		,		)
~vlatexit sha1_base64=m1L29Cxyzv4kq0gD0isZHiqvweE=AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a/latexit
~vlatexit sha1_base64=m1L29Cxyzv4kq0gD0isZHiqvweE=AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a/latexit
log (~v · ~v ~v · ~v )latexit sha1_base64=O9uo0NROxizRILxvgLgJbQT9HME=AAACIXicbZDLSsNAFIYn9VbrLerSzWAR6sKSVMEui25cVrAXaEKZTCbp0EkmzEwKJfRV3Pgqblwo0p34Mk7bCNr2h4GP/5zDmfN7CaNSWdaXUdjY3NreKe6W9vYPDo/M45O25KnApIU546LrIUkYjUlLUcVINxEERR4jHW94P6t3RkRIyuMnNU6IG6EwpgHFSGmrb9YdxkNH0jBCFWdEcDaaONjn6pfhFVxrX/bNslW15oKrYOdQBrmafXPq+BynEYkVZkjKnm0lys2QUBQzMik5qSQJwkMUkp7GGEVEutn8wgm80I4PAy70ixWcu38nMhRJOY483RkhNZDLtZm5rtZLVVB3MxonqSIxXiwKUgYVh7O4oE8FwYqNNSAsqP4rxAMkEFY61JIOwV4+eRXatap9Xa093pQbd3kcRXAGzkEF2OAWNMADaIIWwOAZvIJ38GG8GG/GpzFdtBaMfOYU/JPx/QMUnaSs/latexit
1. Steffen Rendle, Christoph
Freudenthaler, Zeno Gantner, Lars
Schmidt-Thieme: BPR: Bayesian
Personalized Ranking from Implicit
Feedback. UAI 2009: 452-461
1(~v · ~v ~v · ~v  margin)latexit sha1_base64=iBNmKDj8/YuJu40Ok7W0yOl30QA=AAACRXicbVA9SwNBEN3z2/gVtbRZDEIsDHcqaGEh2lgqGBVyIeztTZIle3vH7lwwHPeX/Bv+ARsLLe3sxFb3YgSjPhh4vDfDzLwgkcKg6z46E5NT0zOzc/OlhcWl5ZXy6tqViVPNoc5jGeubgBmQQkEdBUq4STSwKJBwHfROC/+6D9qIWF3iIIFmxDpKtAVnaKVW+cyPGHZDk3l51e8Dz/q5z8MYvzndof/KRz71EW4xi5juCJVvt8oVt+YOQf8Sb0QqZITzVvnFD2OeRqCQS2ZMw3MTbGZMo+AS8pKfGkgY77EONCxVLALTzIYf53TLKiFtx9qWQjpUf05kLDJmEAW2s/jP/PYK8T+vkWL7sJkJlaQIin8taqeSYkyL+GgoNHCUA0sY18LeSnmXacbRhjy2JTTFaXnJBuP9juEvudqteXu13Yv9yvHJKKI5skE2SZV45IAckzNyTuqEkzvyQJ7Is3PvvDpvzvtX64QzmlknY3A+PgHSQLQT/latexit
R
R
greater

margin
smaller

margin
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica108
3. Semantics
A
USER
VENUE
CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica109
metapath2vec KDD’17
Org Author Paper Venue
a1
a2
a3
a4
a5
MIT
CMU
ACL
KDD
p1
p2
p3
APVPA
OAPVPAO
APA
meta paths
(a) An academic network
KDD 0
0
0
0
0
1
0
0
0
0
0
0
ACL
MIT
CMU
a1
a2
a3
a4
a5
p1
p2
p3
input layer hidden
layer
|V|-dim
(b) Skip-gram in metapa
a heteroge-
erogeneous
ion Learn-
learn the d-
V | that are
ong them.
trix X, with
ding to the
re dierent
ed into the
the heterogeneous network structures into skip-gram, we propose
meta-path-based random walks in heterogeneous networks.
Heterogeneous Skip-Gram. In metapath2vec, we enable skip-
gram to learn eective node representations for a heterogeneous
network G = (V, E,T ) with |TV |  1 by maximizing the probability
of having the heterogeneous context Nt ( ),t 2 TV given a node :
arg max
X
2V
X
t 2TV
X
ct 2Nt ( )
logp(ct | ; ) (2)
where Nt ( ) denotes ’s neighborhood with the tth type of nodes
and p(ct | ; ) is commonly dened as a somax function [3, 7, 18,
KDD’17, August 13–17, 2017, Halifax, NS, Canada
different types neighbors in given typeYuxiao Dong, Nitesh V. Chawla, Ananthram Swami:
metapath2vec: Scalable Representation Learning for
Heterogeneous Networks. KDD 2017: 135-144
metapath carries semantics
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe
RecSys'19 SMORe

More Related Content

What's hot

単語・句の分散表現の学習
単語・句の分散表現の学習単語・句の分散表現の学習
単語・句の分散表現の学習Naoaki Okazaki
 
深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -tmtm otm
 
協調フィルタリング入門
協調フィルタリング入門協調フィルタリング入門
協調フィルタリング入門hoxo_m
 
情報検索とゼロショット学習
情報検索とゼロショット学習情報検索とゼロショット学習
情報検索とゼロショット学習kt.mako
 
第52回SWO研究会チュートリアル資料
第52回SWO研究会チュートリアル資料第52回SWO研究会チュートリアル資料
第52回SWO研究会チュートリアル資料Takanori Ugai
 
差分プライバシーとは何か? (定義 & 解釈編)
差分プライバシーとは何か? (定義 & 解釈編)差分プライバシーとは何か? (定義 & 解釈編)
差分プライバシーとは何か? (定義 & 解釈編)Kentaro Minami
 
Linked Open Data(LOD)の基本的な使い方
Linked Open Data(LOD)の基本的な使い方Linked Open Data(LOD)の基本的な使い方
Linked Open Data(LOD)の基本的な使い方Kouji Kozaki
 
軽くRDB再入門とGraph DB 入門
軽くRDB再入門とGraph DB 入門軽くRDB再入門とGraph DB 入門
軽くRDB再入門とGraph DB 入門Kentaro Masumori
 
Variational Autoencoder (VAE) 解説
Variational Autoencoder (VAE) 解説Variational Autoencoder (VAE) 解説
Variational Autoencoder (VAE) 解説Akihiro Nitta
 
2019年度チュートリアルBPE
2019年度チュートリアルBPE2019年度チュートリアルBPE
2019年度チュートリアルBPE広樹 本間
 
GAN(と強化学習との関係)
GAN(と強化学習との関係)GAN(と強化学習との関係)
GAN(と強化学習との関係)Masahiro Suzuki
 
yans2022_hackathon.pdf
yans2022_hackathon.pdfyans2022_hackathon.pdf
yans2022_hackathon.pdfKosuke Yamada
 
強化学習 DQNからPPOまで
強化学習 DQNからPPOまで強化学習 DQNからPPOまで
強化学習 DQNからPPOまでharmonylab
 
DSIRNLP#1 ランキング学習ことはじめ
DSIRNLP#1 ランキング学習ことはじめDSIRNLP#1 ランキング学習ことはじめ
DSIRNLP#1 ランキング学習ことはじめsleepy_yoshi
 
4 データ間の距離と類似度
4 データ間の距離と類似度4 データ間の距離と類似度
4 データ間の距離と類似度Seiichi Uchida
 
オントロジーとは?
オントロジーとは?オントロジーとは?
オントロジーとは?Kouji Kozaki
 
【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH
【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH
【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCHDeep Learning JP
 
データサイエンス概論第一 5 時系列データの解析
データサイエンス概論第一 5 時系列データの解析データサイエンス概論第一 5 時系列データの解析
データサイエンス概論第一 5 時系列データの解析Seiichi Uchida
 
クラシックな機械学習の入門  8. クラスタリング
クラシックな機械学習の入門  8. クラスタリングクラシックな機械学習の入門  8. クラスタリング
クラシックな機械学習の入門  8. クラスタリングHiroshi Nakagawa
 
生成モデルの Deep Learning
生成モデルの Deep Learning生成モデルの Deep Learning
生成モデルの Deep LearningSeiya Tokui
 

What's hot (20)

単語・句の分散表現の学習
単語・句の分散表現の学習単語・句の分散表現の学習
単語・句の分散表現の学習
 
深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -深層学習の不確実性 - Uncertainty in Deep Neural Networks -
深層学習の不確実性 - Uncertainty in Deep Neural Networks -
 
協調フィルタリング入門
協調フィルタリング入門協調フィルタリング入門
協調フィルタリング入門
 
情報検索とゼロショット学習
情報検索とゼロショット学習情報検索とゼロショット学習
情報検索とゼロショット学習
 
第52回SWO研究会チュートリアル資料
第52回SWO研究会チュートリアル資料第52回SWO研究会チュートリアル資料
第52回SWO研究会チュートリアル資料
 
差分プライバシーとは何か? (定義 & 解釈編)
差分プライバシーとは何か? (定義 & 解釈編)差分プライバシーとは何か? (定義 & 解釈編)
差分プライバシーとは何か? (定義 & 解釈編)
 
Linked Open Data(LOD)の基本的な使い方
Linked Open Data(LOD)の基本的な使い方Linked Open Data(LOD)の基本的な使い方
Linked Open Data(LOD)の基本的な使い方
 
軽くRDB再入門とGraph DB 入門
軽くRDB再入門とGraph DB 入門軽くRDB再入門とGraph DB 入門
軽くRDB再入門とGraph DB 入門
 
Variational Autoencoder (VAE) 解説
Variational Autoencoder (VAE) 解説Variational Autoencoder (VAE) 解説
Variational Autoencoder (VAE) 解説
 
2019年度チュートリアルBPE
2019年度チュートリアルBPE2019年度チュートリアルBPE
2019年度チュートリアルBPE
 
GAN(と強化学習との関係)
GAN(と強化学習との関係)GAN(と強化学習との関係)
GAN(と強化学習との関係)
 
yans2022_hackathon.pdf
yans2022_hackathon.pdfyans2022_hackathon.pdf
yans2022_hackathon.pdf
 
強化学習 DQNからPPOまで
強化学習 DQNからPPOまで強化学習 DQNからPPOまで
強化学習 DQNからPPOまで
 
DSIRNLP#1 ランキング学習ことはじめ
DSIRNLP#1 ランキング学習ことはじめDSIRNLP#1 ランキング学習ことはじめ
DSIRNLP#1 ランキング学習ことはじめ
 
4 データ間の距離と類似度
4 データ間の距離と類似度4 データ間の距離と類似度
4 データ間の距離と類似度
 
オントロジーとは?
オントロジーとは?オントロジーとは?
オントロジーとは?
 
【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH
【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH
【DL輪読会】AUTOGT: AUTOMATED GRAPH TRANSFORMER ARCHITECTURE SEARCH
 
データサイエンス概論第一 5 時系列データの解析
データサイエンス概論第一 5 時系列データの解析データサイエンス概論第一 5 時系列データの解析
データサイエンス概論第一 5 時系列データの解析
 
クラシックな機械学習の入門  8. クラスタリング
クラシックな機械学習の入門  8. クラスタリングクラシックな機械学習の入門  8. クラスタリング
クラシックな機械学習の入門  8. クラスタリング
 
生成モデルの Deep Learning
生成モデルの Deep Learning生成モデルの Deep Learning
生成モデルの Deep Learning
 

Similar to RecSys'19 SMORe

Bootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4jBootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4jMax De Marzi
 
Testing Storage Systems: Methodology and Common Pitfalls
Testing Storage Systems: Methodology and Common PitfallsTesting Storage Systems: Methodology and Common Pitfalls
Testing Storage Systems: Methodology and Common PitfallsJohn Constable
 
Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015Max De Marzi
 
Data-mining the Semantic Web @TCD
Data-mining the Semantic Web @TCDData-mining the Semantic Web @TCD
Data-mining the Semantic Web @TCDFrank Lynam
 
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...Matt Stubbs
 
TC16_Fostering_a_Successful_Tableau_Deployment
TC16_Fostering_a_Successful_Tableau_DeploymentTC16_Fostering_a_Successful_Tableau_Deployment
TC16_Fostering_a_Successful_Tableau_DeploymentErin Gengo
 
Linking inside a video collection
Linking inside a video collectionLinking inside a video collection
Linking inside a video collectionroelandordelman.nl
 
Feeding Many Birds with One Piece of Bread
Feeding Many Birds with One Piece of BreadFeeding Many Birds with One Piece of Bread
Feeding Many Birds with One Piece of BreadElizabeth Tu
 
Public speaking - FDP tech leads summit - 2018-04-30
Public speaking - FDP tech leads summit - 2018-04-30Public speaking - FDP tech leads summit - 2018-04-30
Public speaking - FDP tech leads summit - 2018-04-30Frédéric Harper
 
Illinois Library Association Presentation: 2010
Illinois Library Association Presentation: 2010Illinois Library Association Presentation: 2010
Illinois Library Association Presentation: 2010Darren Thompson
 
[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...
[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...
[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...DataScienceConferenc1
 
Integrate Metadata Creation into Oral History Production Process, Xiaoli Ma
Integrate Metadata Creation into Oral History Production Process, Xiaoli MaIntegrate Metadata Creation into Oral History Production Process, Xiaoli Ma
Integrate Metadata Creation into Oral History Production Process, Xiaoli MaVisual Resources Association
 
Top 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceTop 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceEdureka!
 
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User StudiesYONG ZHENG
 
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...multimediaeval
 

Similar to RecSys'19 SMORe (19)

Bootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4jBootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4j
 
Testing Storage Systems: Methodology and Common Pitfalls
Testing Storage Systems: Methodology and Common PitfallsTesting Storage Systems: Methodology and Common Pitfalls
Testing Storage Systems: Methodology and Common Pitfalls
 
Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015
 
Spell Checking in Deezer Search Engine
Spell Checking in Deezer Search EngineSpell Checking in Deezer Search Engine
Spell Checking in Deezer Search Engine
 
Data-mining the Semantic Web @TCD
Data-mining the Semantic Web @TCDData-mining the Semantic Web @TCD
Data-mining the Semantic Web @TCD
 
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
 
TC16_Fostering_a_Successful_Tableau_Deployment
TC16_Fostering_a_Successful_Tableau_DeploymentTC16_Fostering_a_Successful_Tableau_Deployment
TC16_Fostering_a_Successful_Tableau_Deployment
 
Intro to Neo4j 2.0
Intro to Neo4j 2.0Intro to Neo4j 2.0
Intro to Neo4j 2.0
 
Linking inside a video collection
Linking inside a video collectionLinking inside a video collection
Linking inside a video collection
 
Feeding Many Birds with One Piece of Bread
Feeding Many Birds with One Piece of BreadFeeding Many Birds with One Piece of Bread
Feeding Many Birds with One Piece of Bread
 
Public speaking - FDP tech leads summit - 2018-04-30
Public speaking - FDP tech leads summit - 2018-04-30Public speaking - FDP tech leads summit - 2018-04-30
Public speaking - FDP tech leads summit - 2018-04-30
 
Illinois Library Association Presentation: 2010
Illinois Library Association Presentation: 2010Illinois Library Association Presentation: 2010
Illinois Library Association Presentation: 2010
 
[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...
[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...
[DSC Europe 23] Vladislav Belov - ChatBot Learning Assistant with Large Langu...
 
Integrate Metadata Creation into Oral History Production Process, Xiaoli Ma
Integrate Metadata Creation into Oral History Production Process, Xiaoli MaIntegrate Metadata Creation into Oral History Production Process, Xiaoli Ma
Integrate Metadata Creation into Oral History Production Process, Xiaoli Ma
 
JFfm Jessica Greschner
JFfm Jessica GreschnerJFfm Jessica Greschner
JFfm Jessica Greschner
 
Top 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceTop 5 algorithms used in Data Science
Top 5 algorithms used in Data Science
 
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
 
Diversity (in Media)
Diversity (in Media)Diversity (in Media)
Diversity (in Media)
 
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
 

More from 志明 陳

ML Toolkit Share
ML Toolkit ShareML Toolkit Share
ML Toolkit Share志明 陳
 
CM NCCU Class2
CM NCCU Class2CM NCCU Class2
CM NCCU Class2志明 陳
 
CM NCCU Class1
CM NCCU Class1CM NCCU Class1
CM NCCU Class1志明 陳
 
CM UTaipei Kaggle Share
CM UTaipei Kaggle ShareCM UTaipei Kaggle Share
CM UTaipei Kaggle Share志明 陳
 
MLDM CM Kaggle Tips
MLDM CM Kaggle TipsMLDM CM Kaggle Tips
MLDM CM Kaggle Tips志明 陳
 
CM KaggleTW Share
CM KaggleTW ShareCM KaggleTW Share
CM KaggleTW Share志明 陳
 
KAMERA 2nd solution
KAMERA 2nd solutionKAMERA 2nd solution
KAMERA 2nd solution志明 陳
 
PIXNET Page Views 1st solution
PIXNET Page Views 1st solutionPIXNET Page Views 1st solution
PIXNET Page Views 1st solution志明 陳
 
KDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solutionKDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solution志明 陳
 

More from 志明 陳 (10)

Oop
OopOop
Oop
 
ML Toolkit Share
ML Toolkit ShareML Toolkit Share
ML Toolkit Share
 
CM NCCU Class2
CM NCCU Class2CM NCCU Class2
CM NCCU Class2
 
CM NCCU Class1
CM NCCU Class1CM NCCU Class1
CM NCCU Class1
 
CM UTaipei Kaggle Share
CM UTaipei Kaggle ShareCM UTaipei Kaggle Share
CM UTaipei Kaggle Share
 
MLDM CM Kaggle Tips
MLDM CM Kaggle TipsMLDM CM Kaggle Tips
MLDM CM Kaggle Tips
 
CM KaggleTW Share
CM KaggleTW ShareCM KaggleTW Share
CM KaggleTW Share
 
KAMERA 2nd solution
KAMERA 2nd solutionKAMERA 2nd solution
KAMERA 2nd solution
 
PIXNET Page Views 1st solution
PIXNET Page Views 1st solutionPIXNET Page Views 1st solution
PIXNET Page Views 1st solution
 
KDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solutionKDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solution
 

Recently uploaded

Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 

Recently uploaded (20)

Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 

RecSys'19 SMORe

  • 1. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity SMORe : Modularize Graph Embedding for Recommendation 1 13TH ACM CONFERENCE ON RECOMMENDER SYSTEMS COPENHAGEN, DENMARK. 16-20 SEPTEMBER 2019 TUTORIAL: GRAPH EMBEDDING (11:00-12:30) Chih-Ming Chen (CM) Ting-Hsiang Wang (Sean) Chuan-Ju Wang (Jennifer) Ming-Feng Tsai (Victor)
  • 2. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Lecture (Sean & CM, 65 minutes) Hands-on (CM, 15 minutes) Q&A (Sean & CM, 10 minutes) 2 QR to Slides, Codes, Abstract Tutorial Agenda
  • 3. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 3 DeepWalk: Online Learning of Social Representations Bryan Perozzi Stony Brook University Department of Computer Science Rami Al-Rfou Stony Brook University Department of Computer Science Steven Skiena Stony Brook University Department of Computer Science {bperozzi, ralrfou, skiena}@cs.stonybrook.edu ABSTRACT We present DeepWalk, a novel approach for learning la- tent representations of vertices in a network. These latent representations encode social relations in a continuous vector ace, which is easily exploited by statistical models. Deep- eralizes recent advancements in language mod- ised feature learning (or deep learning) graphs. obtained from trun- tions by treat- trate (a) Input: Karate Graph (b) Output: Representation Figure 1: Our proposed method learns a latent space rep- resentation of social interactions in R d . The learned rep- resentation encodes community structure so it can be eas- oited by standard classification methods. Here, our on Zachary’s Karate network [44] to gen- tion in R 2 . Note the correspon- re in the input graph and odularity-based 1 ? ? ? ?
  • 4. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Lecture Agenda 4 Q0. Recommendation (REC) and challenges Q1. Why graph embedding (GE) for REC Q2. SMORe modularization of GE and benefits Q3. Exemplar structural modeling for REC Q4. REC using SMORe (Sean) (Sean) (Sean) (CM) (CM)
  • 5. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Lecture Agenda 5 Q0. Recommendation (REC) and challenges Q1. Why graph embedding (GE) for REC Q2. SMORe modularization of GE and benefits Q3. Exemplar structural modeling for REC Q4. REC using SMORe (Sean) (Sean) (Sean) (CM) (CM)
  • 6. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 6 REC systems are everywhere ! Movie Music Book, Toy, Electronics, etc. Friend, Interest Group Recruiter, Job Seeker Local Business Trend, Following News, Ad, Media, etc. And many more …
  • 7. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity • REC center around users to provide customized results • CF assumes people agree on things are likely to agree on other things 7 Let’s start with Collaborative Filtering (CF) User Item Sam Liz Derek 5 4 3 5 4 Song A Song B Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Sam and Derek have similar tastes; therefore, if Sam likes Song C, it is likely Derek does, too
  • 8. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 8 Let’s start with Collaborative Filtering (CF) User Item Sam Liz Derek 5 4 3 5 4 Song A Song B • REC center around users to provide customized results • CF assumes people agree on things are likely to agree on other things Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Sam and Derek have similar tastes; therefore, if Sam likes Song C, it is likely Derek does, too Collaborate
  • 9. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 9 Let’s start with Collaborative Filtering (CF) User Item Sam Liz Derek 5 4 3 5 4 Song A Song B 4 Song C • REC center around users to provide customized results • CF assumes people agree on things are likely to agree on other things Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Sam and Derek have similar tastes; therefore, if Sam likes Song C, it is likely Derek does, too Collaborate Filter
  • 10. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 10 Let’s start with Collaborative Filtering (CF) User Item Sam Liz Derek 5 4 3 5 4 Song A Song B Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Sam and Derek have similar tastes; therefore, if Sam likes Song C, it is likely Derek does, too 4 Song C Collaborate Filter Recommend • REC center around users to provide customized results • CF assumes people agree on things are likely to agree on other things
  • 11. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 11 Challenge (1) : Data Sparsity User Item Sam Liz Derek 5 4 3 5 4 Song A Song B • When data sparsity occurs, it is difficult to make accurate REC with CF Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Data sparsity occurs to Liz as there is insufficient ratings from Liz to compare her taste with others’ Collaborate
  • 12. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 12 Challenge (1) : Data Sparsity User Item Sam Liz Derek 5 4 3 5 4 Song A Song B • When data sparsity occurs, it is difficult to make accurate REC with CF 4 Song C Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Data sparsity occurs to Liz as there is insufficient ratings from Liz to compare her taste with others’ Collaborate Filter
  • 13. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 13 Challenge (1) : Data Sparsity User Item Sam Liz Derek 5 4 3 5 4 Song A Song B • When data sparsity occurs, it is difficult to make accurate REC with CF 4 Song C Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ? ☞ → Data sparsity occurs to Liz as there is insufficient ratings from Liz to compare her taste with others’ Collaborate Filter Recommend
  • 14. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 14 Challenge (2) : Cold Start User Item Sam Liz Derek 5 4 3 5 4 Song A Song B • When cold start occurs, it is impossible to conduct CF since there is no context
 for comparison Song A Song B Song C Sam 5 4 4 Derek 4 5 ? Liz - 3 ? Roger - - ?☞ → Cold Start occurs to Roger as there is NO song rating from Roger to compare his taste with others’ Roger ? ? ?
  • 15. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 15 Challenge (3) : Constant Cold Start • Some Cold Start situations are constant, i.e., never warm up • E.g., event tickets are sold before user-event interactions can occur November 2, 2019 November 16, 2019 October 16, 2019 → Today is September 19, 2019
  • 16. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 16
  • 17. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 17 Common Solution : Add Attributes → REC systems often mitigate Complete Cold Start problems by requiring new users to specify interests and items be labeled using tags → By requiring user interests and item tags to come from predefined label set, users and items share context for comparison
  • 18. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 18 To enrich context for comparison … Derek User Item Sam Liz 5 4 3 5 4 Song A Song B • For additional context for comparison, REC systems often borrow auxiliary
 information, such as item content and user and item attributes • This forms a Heterogeneous Information Network (HIN)
  • 19. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 19 Derek User Item Attribute Sam Liz Singer A 5 4 3 5 4 Song A Song B Metal Rock Genre, Artist • For additional context for comparison, REC systems often borrow auxiliary
 information, such as item content and user and item attributes • This forms a Heterogeneous Information Network (HIN) Add auxiliary information … → Item attributes are added to improve item comparison, e.g., music genre,
 artist, metadata, etc.
  • 20. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 20 Metal User Item AttributeAttribute Sam Liz Rock Working Student Singer A 5 4 3 5 4 Song A Song B Derek Social NetworkOccupation • For additional context for comparison, REC systems often borrow auxiliary
 information, such as item content and user and item attributes • This forms a Heterogeneous Information Network (HIN) Genre, Artist And more auxiliary information ! → Item attributes are added to improve item comparison, e.g., music genre,
 artist, metadata, etc.
 → User attributes are added to improve user comparison, e.g., social network,
 occupation, favorite artist & genre, etc.
  • 21. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 21 Metal • For additional context for comparison, REC systems often borrow auxiliary
 information, such as item content and user and item attributes • This forms a Heterogeneous Information Network (HIN) User Item AttributeAttribute Sam Liz Rock Working Student Singer A 5 4 3 5 4 Song A Song B → Item attributes are added to improve item comparison, e.g., music genre,
 artist, metadata, etc.
 → User attributes are added to improve user comparison, e.g., social network,
 occupation, favorite artist & genre, etc. → HIN gives holistic view of complex systems Derek Graph coordinates all relations !
  • 22. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Lecture Agenda 22 Q0. Recommendation (REC) and challenges Q1. Why graph embedding (GE) for REC Q2. SMORe modularization of GE and benefits Q3. Exemplar structural modeling for REC Q4. REC using SMORe (Sean) (Sean) (Sean) (CM) (CM)
  • 23. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 23 Why Graph ? 1. Universal language for describing complex data [1] • Many fields have all chosen graph to depict entity interactions Twitter Ego-Network Delta Airline Route Map NetworkCore Sound Food Web
  • 24. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 24 1. Universal language for describing complex data [1] • Many fields have all chosen graph to depict entity interactions 2. Shared vocabulary (therefore ideas) between fields [1] • E.g., distributional hypothesis in linguistics vs. CF in REC systems User Item Why Graph ?
  • 25. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 25 1. Universal language for describing complex data [1] • Many fields have all chosen graph to depict entity interactions 2. Shared vocabulary (therefore ideas) between fields [1] • E.g., distributional hypothesis in linguistics vs. CF in REC systems 3. Holistic view of complex systems of interactions • Different domains can easily connect and be jointly mined User Item Attribute → HINs easily integrates information from different domains Why Graph ?
  • 26. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 26 1. Universal language for describing complex data [1] • Many fields have all chosen graph to depict entity interactions 2. Shared vocabulary (therefore ideas) between fields [1] • E.g., distributional hypothesis in linguistics vs. CF in REC systems 3. Holistic view of complex systems of interactions • Different domains can easily connect and be jointly mined 4. General definition of contexts used for entity comparison • Model graph structure instead specific types of relations User Item Attribute → Different REC approaches, similar structure CF CBF Why Graph ?
  • 27. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 1. Universal language for describing complex data [1] • Many fields have all chosen graph to depict entity interactions 2. Shared vocabulary (therefore ideas) between fields [1] • E.g., distributional hypothesis in linguistics vs. CF in REC systems 3. Holistic view of complex systems of interactions • Different domains can easily connect and be jointly mined 4. General definition of contexts used for entity comparison • Model graph structure instead specific types of relations 5. Sophisticated models available from network-related researches • Network schema, meta-path, subgraph matching, information propagation, etc. 27 (a) ACM data (b) DBLP data Figure 3: Examples of bibliographic network schema. distinct semantics under different paths will lead to different relatedness. The relatedness under APVC path emphasizes on the conferences that authors participated, while the relat- edness under APSPVC path emphasizes on conferences pub- lishing the papers that have the same subjects with authors’ papers. For example, assume most papers of an author are published in the KDD, SIGMOD, and VLDB. However, the papers having the same subjects with the author’s papers may be published in more wide conferences, such as ICDM, Figu In m jects lar p As a ence gene to th resea man mor Star-schema [2] Meta-path [3] Why Graph ?
  • 28. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 1. Universal language for describing complex data [1] • Many fields have all chosen graph to depict entity interactions 2. Shared vocabulary (therefore ideas) between fields [1] • E.g., distributional hypothesis in linguistics vs. CF in REC systems 3. Holistic view of complex systems of interactions • Different domains can easily connect and be jointly mined 4. General definition of contexts used for entity comparison • Model graph structure instead specific types of relations 5. Sophisticated models available from network-related researches • Network schema, meta-path, subgraph matching, information propagation, etc. 28 Why Graph ?
  • 29. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 29 REC as Link Prediction on Graphs ? ? ? ? ✘ ✔ ✔ ✘ Machine Learning Algorithm • Many ways to compare node similarity: entity types, shared neighbors, distance, etc. • Early REC models, e.g., CF and CBF, define intuitive relations for similarity measurement • As models mature and data diversity skyrockets nowadays, designing features
 for REC becomes increasingly challenging
  • 30. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 30 Graph Embedding Pipeline Hidden layer Input layer Output layer Supply Graph Return Embeddings (from hidden Layer)Train Neural Network eepWalk: Online Learning of Social Representations Bryan Perozzi Stony Brook University Department of Computer Science Rami Al-Rfou Stony Brook University Department of Computer Science Steven Skiena Stony Brook University Department of Computer Science {bperozzi, ralrfou, skiena}@cs.stonybrook.edu ABSTRACT We present DeepWalk, a novel approach for learning la- tent representations of vertices in a network. These latent representations encode social relations in a continuous vector space, which is easily exploited by statistical models. Deep- k generalizes recent advancements in language mod- supervised feature learning (or deep learning) ds to graphs. mation obtained from trun- esentations by treat- emonstrate el (a) Input: Karate Graph (b) Output: Representation Figure 1: Our proposed method learns a latent space rep- resentation of social interactions in R d . The learned rep- resentation encodes community structure so it can be eas- ly exploited by standard classification methods. Here, our used on Zachary’s Karate network [44] to gen- resentation in R 2 . Note the correspon- tructure in the input graph and ent a modularity-based Rami Al-Rfou y Brook University tment of Computer Science Steven Skiena Stony Brook University Department of Computer Science u, skiena}@cs.stonybrook.edu ing la- latent vector Deep- e mod- rning) m trun- y treat- nstrate ti-label s Blog- Deep- allowed ence of 1 2 3 4 5 6 7 8 9 11 12 13 14 18 20 22 32 31 10 28 29 33 17 34 15 16 19 21 23 24 26 30 25 27 1 2 3 4 5 6 7 8 9 11 12 13 14 18 20 22 32 31 10 28 29 33 17 34 15 16 19 21 23 24 26 30 25 27 (a) Input: Karate Graph (b) Output: Representation Figure 1: Our proposed method learns a latent space rep- resentation of social interactions in Rd . The learned rep- resentation encodes community structure so it can be eas- ily exploited by standard classification methods. Here, our method is used on Zachary’s Karate network [44] to gen- erate a latent representation in R2 . Note the correspon- dence between community structure in the input graph and • GE for REC exploits observed links as graph structures to predict unobserved links • Entities are converted into spatial features (embeddings) in the hidden layer and
 iteratively updated such that their interactions approximate values in the output layer • Entity relatedness is preserved as spatial properties, e.g., distance and angle
  • 31. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 31 Why Embedding ? 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] ❌Query O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit>
  • 32. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 32 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] → Brute force exhaustive search finds the closest nearest neighbors in ❌Query O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit> Why Embedding ? O(log n!)<latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit>
  • 33. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 33 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] ❌Query O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit> Why Embedding ? → Brute force exhaustive search finds the closest nearest neighbors in O(log n!)<latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit><latexit sha1_base64="ZUjF3udMHzVYQ9q9OFGt0N8yXiQ=">AAAB+3icbZC7TsMwFIZPyq2UWygji6FBKkuVdIGxEgsbRaIXqY0qx3Vaq44T2Q6iivoqLAwgxMqLsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLKPy20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJObeb3zSKVisXjQ04T6ER4JFjKCtbEGdrkfhJlz51R5PEKOOHMuZwO74tbchdA6eDlUIFdzYH/1hzFJIyo04Vipnucm2s+w1IxwOiv1U0UTTCZ4RHsGBY6o8rPF7TN0YZwhCmNpntBo4f6eyHCk1DQKTGeE9Vit1ubmf7VeqsNrP2MiSTUVZLkoTDnSMZoHgYZMUqL51AAmkplbERljiYk2cZVMCN7ql9ehXa95hu/rlUYjj6MIp3AOVfDgChpwC01oAYEneIZXeLNm1ov1bn0sWwtWPnMCf2R9/gA8KZKY</latexit>
  • 34. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 34 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] ❌Query → Spotify ANNOY builds binary tree of subdivisions to quickly find closest neighbors O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit> Why Embedding ?
  • 35. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 35 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] ❌Query O(log n)<latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit><latexit sha1_base64="XMGJBZt/opmMHuBWeobJ4tunTBM=">AAAB+nicbZC7TsMwFIZPyq2UWwoji0WDVJYq6QJjJRY2ikQvUhtVjuu0Vh0nsh1QFfooLAwgxMqTsPE2uG0GaPklS5/+c47O8R8knCntut9WYWNza3unuFva2z84PLLLx20Vp5LQFol5LLsBVpQzQVuaaU67iaQ4CjjtBJPreb3zQKVisbjX04T6ER4JFjKCtbEGdrkfhJlz61R5PEKOcC5mA7vi1tyF0Dp4OVQgV3Ngf/WHMUkjKjThWKme5ybaz7DUjHA6K/VTRRNMJnhEewYFjqjys8XpM3RunCEKY2me0Gjh/p7IcKTUNApMZ4T1WK3W5uZ/tV6qwys/YyJJNRVkuShMOdIxmueAhkxSovnUACaSmVsRGWOJiTZplUwI3uqX16Fdr3mG7+qVRiOPowincAZV8OASGnADTWgBgUd4hld4s56sF+vd+li2Fqx85gT+yPr8AeA+km0=</latexit> Why Embedding ? → Spotify ANNOY builds binary tree of subdivisions to quickly find closest neighbors
  • 36. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 36 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] 2. Efficient pairwise comparison due to dimensionality reduction (DR) • Lower dimension costs less to calculate feature similarity O(log n)<latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit> Why Embedding ?
  • 37. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 37 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] 2. Efficient pairwise comparison due to dimensionality reduction (DR) • Lower dimension costs less to calculate feature similarity 3. Reduced space complexity due to DR • Lower dimension costs less storage space O(log n)<latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit> Why Embedding ?
  • 38. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 38 1. Efficient retrieval from approximate nearest neighbor (ANN) search methods • E.g., Spotify ANNOY reduces curse of dimensionality during online search by maintaining
 a binary tree of subdivisions such that good enough results can be found in [4] 2. Efficient pairwise comparison due to dimensionality reduction (DR) • Lower dimension costs less to calculate feature similarity 3. Reduced space complexity due to DR • Lower dimension costs less storage space 4. Transfer learning with pertained embeddings • Pretrained embeddings are better guesses than randomly initialized vectors O(log n)<latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit><latexit sha1_base64="CpYTxtDU/JE+emzgwPLx6uI+9ck=">AAAB+nicdVDLTsJAFL3FF+Kr6NLNRGqCG9KyUJckbtyJiYAJNGQ6TGHCdNrMTDWk8iluXGiMW7/EnX/jFDDxeZKbe3LOvZk7J0g4U9p1363C0vLK6lpxvbSxubW9Y5d32ypOJaEtEvNYXgdYUc4EbWmmOb1OJMVRwGknGJ/lfueGSsVicaUnCfUjPBQsZARrI/Xtci8IM+fCqfJ4iBzhHE37dsWtHbs50G/i1WbdrcACzb791hvEJI2o0IRjpbqem2g/w1Izwum01EsVTTAZ4yHtGipwRJWfzU6fokOjDFAYS1NCo5n6dSPDkVKTKDCTEdYj9dPLxb+8bqrDUz9jIkk1FWT+UJhypGOU54AGTFKi+cQQTCQztyIywhITbdIqmRA+f4r+J+16zTP8sl5pNBZxFGEfDqAKHpxAA86hCS0gcAv38AhP1p31YD1bL/PRgrXY2YNvsF4/APp4kn8=</latexit> Why Embedding ?
  • 39. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 39 GE for REC : Challenges • So graph embedding is GREAT for recommendation : • Reduces data sparsity and cold start via integrating auxiliary information • Provides holistic view of REC problem and jointly mines different relations in terms of graph structures • Trains fast, compares fast, and retrieves fast while taking less space
  • 40. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 40 • So graph embedding is GREAT for recommendation : • Reduces data sparsity and cold start via integrating auxiliary information • Provides holistic view of REC problem and jointly mines different relations in terms of graph structures • Trains fast, compares fast, and retrieves fast while taking less space
 • What’s the catch ? GE for REC : Challenges
  • 41. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Lecture Agenda 41 Q0. Recommendation (REC) and challenges Q1. Why graph embedding (GE) for REC Q2. SMORe modularization of GE and benefits Q3. Exemplar structural modeling for REC Q4. REC using SMORe (Sean) (Sean) (Sean) (CM) (CM)
  • 42. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 42 Embedding space Let’s look at GE process …
  • 43. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Let’s look at GE process … 43 … Graph structures Embedding space
  • 44. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Let’s look at GE process … … 44 Samples relation A B BA Graph structures A B Embedding space
  • 45. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Let’s look at GE process … … B A 45 Samples relation A B BA Graph structures Maps entities to space Embedding space
  • 46. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Let’s look at GE process … … 46 Samples relation A B BA Embedding space Graph structures A B Maps entities to space Optimizes distance
  • 47. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Challenge (1) : Select structure is HARD A B 47 Embedding space Graph structures Maps entities to spaceB Samples relation A BA … → Unsure which graph structures best model current REC task Optimizes distance
  • 48. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Challenge (1) : Select structure is HARD A B 48 Embedding space Graph structures Maps entities to spaceB Samples relation A BA … 5 8 7 6 2 3 1 4 In bipartite, model & by neighborhood is intuitive; but in planar, & are also shown similar in betweenness g j 5 7 → Unsure which graph structures best model current REC task Optimizes distance
  • 49. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Challenge (1) : Select structure is HARD A B 49 Embedding space Graph structures Maps entities to spaceB Samples relation A BA … 5 8 7 6 2 3 1 4 In bipartite, model & by neighborhood is intuitive; but in planar, & are also shown similar in betweenness g j 5 7 → Unsure which graph structures best model current REC task Optimizes distance
  • 50. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Challenge (2) : Customize GE model A B 50 Embedding space Graph structures Maps entities to spaceB Samples relation A BA … → Wanna customize GE methods during each stage of training → Unsure which graph structures best model current REC task Optimizes distance
  • 51. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Challenge (3) : Make fair comparison A B 51 Embedding space Graph structures Maps entities to spaceB Samples relation A BA … → Wanna customize GE methods during each stage of training → Difficult to compare models as their implementations often vary → Unsure which graph structures best model current REC task Optimizes distance
  • 52. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Solution : Modularize GE for adaptability ! 52 … Samples relation BA Embedding space Graph structures A B Maps entities to space Optimizes distance
  • 53. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity … Samples relation BA Embedding space Graph structures A B Maps entities to space Optimizes distance 53 Sampler Mapper Optimizer Solution : Modularize GE for adaptability !
  • 54. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Solution : Modularize GE for adaptability ! A B 54 … Samples relation BA Embedding space Graph structures Maps entities to space Optimizes distance Sampler Mapper Optimizer •Extracts graph structures from dataset while remains type-agnostic to sampled entities, i.e., nodes & edges E : {(v1, v2)|(v1, v2) 2 V ⇥ V }<latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit><latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit><latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit><latexit sha1_base64="Tguco8yNKzLzF1xI2NN30+8zm+U=">AAACIHicbZBNS8NAEIY39avWr6hHL4utUEFK0kvFU1EEjxXsBzQhbLabdulmE3Y3hRLzU7z4V7x4UERv+mvctjlodWDZh3dmmJnXjxmVyrI+jcLK6tr6RnGztLW9s7tn7h90ZJQITNo4YpHo+UgSRjlpK6oY6cWCoNBnpOuPr2b57oQISSN+p6YxcUM05DSgGCkteWajcn3hpNWJl9rZGdRfPTuF93BJcCiHHegoGhI5g6zimWWrZs0D/gU7hzLIo+WZH84gwklIuMIMSdm3rVi5KRKKYkaykpNIEiM8RkPS18iRHuWm8wMzeKKVAQwioR9XcK7+7EhRKOU09HVliNRILudm4n+5fqKCczelPE4U4XgxKEgYVBGcuQUHVBCs2FQDwoLqXSEeIYGw0p6WtAn28sl/oVOv2Zpv6+XmZW5HERyBY1AFNmiAJrgBLdAGGDyAJ/ACXo1H49l4M94XpQUj7zkEv8L4+gaxJqAz</latexit> : E ! R<latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit><latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit><latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit><latexit sha1_base64="j+ipPQXWS0Htqsw+lExijt/CRa0=">AAACAXicbZDLSgMxFIYz9VbrbdSN4CbYCq7KTDeKq6IILqvYC3SGkkkzndBMMiQZpQx146u4caGIW9/CnW9j2s5CW38IfPznHE7OHySMKu0431ZhaXllda24XtrY3NresXf3WkqkEpMmFkzIToAUYZSTpqaakU4iCYoDRtrB8HJSb98Tqajgd3qUED9GA05DipE2Vs8+qHhJROE5vIKepINIIynFA7yt9OyyU3Wmgovg5lAGuRo9+8vrC5zGhGvMkFJd10m0nyGpKWZkXPJSRRKEh2hAugY5ionys+kFY3hsnD4MhTSPazh1f09kKFZqFAemM0Y6UvO1iflfrZvq8MzPKE9STTieLQpTBrWAkzhgn0qCNRsZQFhS81eIIyQR1ia0kgnBnT95EVq1qmv4plauX+RxFMEhOAInwAWnoA6uQQM0AQaP4Bm8gjfryXqx3q2PWWvBymf2wR9Znz87EpV0</latexit> G = (V, E, R, )<latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit><latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit><latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit><latexit sha1_base64="hfH2qiOkdhHaLO2qsmwczf37cp4=">AAAB/nicbZDLSgMxFIbPeK31Niqu3ARboUIpM93oRiiK6LKKvUA7lEyatqGZzJBkhDIUfBU3LhRx63O4821M21lo6w+Bj/+cwzn5/YgzpR3n21paXlldW89sZDe3tnd27b39ugpjSWiNhDyUTR8rypmgNc00p81IUhz4nDb84dWk3nikUrFQPOhRRL0A9wXrMYK1sTr2Yf4GXaBCvYiui+i+iNrRgJ3mO3bOKTlToUVwU8hBqmrH/mp3QxIHVGjCsVIt14m0l2CpGeF0nG3HikaYDHGftgwKHFDlJdPzx+jEOF3UC6V5QqOp+3siwYFSo8A3nQHWAzVfm5j/1Vqx7p17CRNRrKkgs0W9mCMdokkWqMskJZqPDGAimbkVkQGWmGiTWNaE4M5/eRHq5ZJr+K6cq1ymcWTgCI6hAC6cQQVuoQo1IJDAM7zCm/VkvVjv1sesdclKZw7gj6zPH/r0kkY=</latexit>
  • 55. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Solution : Modularize GE for adaptability ! 55 … Samples relation BA Embedding space Graph structures A B Maps entities to space Sampler Mapper Optimizer •Converts entities into spatial features via embedding stacking operations,
 e.g, lookup, pooling (average, etc.) f : (p) ! Rd <latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit><latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit><latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit><latexit sha1_base64="VmJxJ04sPmQqHhZa1Uz3GKOc0Ps=">AAACEnicbZC7TsMwFIadcivlFmBksWiR2qVKuoCYKlgYC6IXqQmV4zqtVSeObAdURXkGFl6FhQGEWJnYeBucNgO0/JKlT/85Rz7n9yJGpbKsb6Owsrq2vlHcLG1t7+zumfsHHcljgUkbc8ZFz0OSMBqStqKKkV4kCAo8Rrre5DKrd++JkJSHt2oaETdAo5D6FCOlrYFZczw/qfjnsBrVoCPoaKyQEPwBOgFSY89LbtK7ZJjCSjowy1bdmgkug51DGeRqDcwvZ8hxHJBQYYak7NtWpNwECUUxI2nJiSWJEJ6gEelrDFFApJvMTkrhiXaG0OdCv1DBmft7IkGBlNPA053ZonKxlpn/1fqx8s/chIZRrEiI5x/5MYOKwywfOKSCYMWmGhAWVO8K8RgJhJVOsaRDsBdPXoZOo25rvm6Umxd5HEVwBI5BFdjgFDTBFWiBNsDgETyDV/BmPBkvxrvxMW8tGPnMIfgj4/MH38+c+w==</latexit> f : (p, q) ! Rd <latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit><latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit><latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit><latexit sha1_base64="w+fc6rCobrgHcpaEWRxxo2tpDbs=">AAACFHicbZC7TsMwFIadcivlFmBksWiRikBV0gXEVMHCWBC9SE2oHMdprToXbAdURXkIFl6FhQGEWBnYeBucNgO0/JKlT/85Rz7ndyJGhTSMb62wsLi0vFJcLa2tb2xu6ds7bRHGHJMWDlnIuw4ShNGAtCSVjHQjTpDvMNJxRhdZvXNPuKBhcCPHEbF9NAioRzGSyurrR5bjJRXvDFaj47tDaHE6GErEefgALR/JoeMk1+lt4qawkvb1slEzJoLzYOZQBrmaff3LckMc+ySQmCEheqYRSTtBXFLMSFqyYkEihEdoQHoKA+QTYSeTo1J4oBwXeiFXL5Bw4v6eSJAvxNh3VGe2qJitZeZ/tV4svVM7oUEUSxLg6UdezKAMYZYQdCknWLKxAoQ5VbtCPEQcYalyLKkQzNmT56Fdr5mKr+rlxnkeRxHsgX1QBSY4AQ1wCZqgBTB4BM/gFbxpT9qL9q59TFsLWj6zC/5I+/wBMe6drA==</latexit> Optimizes distance
  • 56. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Solution : Modularize GE for adaptability ! BA 56 … Samples relation Embedding space Graph structures A B Maps entities to space Sampler Mapper Optimizer •Preserves entity relatedness as spatial properties with customizable similarity metrics and loss functions pture item j’s entual location ay from f(xj). the function f, 2 . Gaussian prior n we have more e that the func- we simultane- ibed earlier to h other. Specif- med by v⇤ and levant to users’ that the items d together and ss-rated items. ) with dropout !" !# 1 2 1 2 !" !# 1 2 1 2 Matrix Factorization Collaborative Metric Learning User Item $# $% $" $# $% $" &# &" $# $" $% Figure 3: Example latent vector assignments for ma- trix factorization and CML. The table on the right shows user/item’s preference relationships. 3.5 Training Procedure The complete objective function of the proposed model is g &# &" $# $" $% s for ma- Dot Product Euclidean Distance → Euclidean distance keeps triangular inequality; dot product does not [5] Optimizes distance
  • 57. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Solution : Modularize GE for adaptability ! B A 57 … Samples relation BA Embedding space Graph structures Maps entities to space Sampler Mapper for Recommendation Optimizer Optimizes distance
  • 58. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity SMORe : Modular GE toolkit for REC B A 58 … Samples relation BA Embedding space Graph structures Maps entities to space Sampler Mapper for Recommendation Optimizer Optimizes distance
  • 59. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 59 SMORe is a modular GE toolkit for REC S’more is a campfire treat with layers SMORe : Modular GE toolkit for REC
  • 60. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 60 Benefits of SMORe Performance levelSampler Mapper for Recommendation Optimizer Modules contribute different levels of performance during different REC tasks Model A Model B Model C
  • 61. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 61 Benefits of SMORe Sampler Mapper for Recommendation Optimizer SMORe enables combining the most suitable modules for given REC task
  • 62. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 62 Benefits of SMORe Sampler Mapper for Recommendation Optimizer SMORe enables combining the most suitable modules for given REC task
  • 63. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity 63 Benefits of SMORe Best THEN! Best NOW! As toolkit for research: 1. Baseline comparison 2. Ballpark approaches 3. One module at a time As framework for development: 1. Reduces development time 2. Raises performance limit 3. Adapts to new tasks
  • 64. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity References 64 1. Hamilton, William L., et al. “Representation on Networks”, WWW 2018 Tutorial 2. Shi, Chuan, et al. “Relevance Search in Heterogeneous Networks”, ACM EDBT 2012 3. Shi, Chuan, et al. “A Survey of Heterogeneous Information Network Analysis”,
 IEEE TKDE 2016 4. Bernhardsson, Erik “ANNOY library”, https://github.com/spotify/annoy 5. Hsieh, Cheng-Kang, et al. “Collaborative Metric Learning”, WWW 2017
  • 65. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica Ting-HsiangWang,TexasA&MUniversity Lecture Agenda 65 Q0. Recommendation (REC) and challenges Q1. Why graph embedding (GE) for REC Q2. SMORe modularization of GE and benefits Q3. Exemplar structural modeling for REC Q4. REC using SMORe (Sean) (Sean) (Sean) (CM) (CM)
  • 66. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica66 Example Graph vertex edge
  • 67. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica67 Vertex Structure • Adjacency
 - vertices which share the same edge • Neighborhood
 - vertices which share similar connections • Community
 - vertices which share similar communities • Centrality
 - vertices which share similar properties
  • 68. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica68 1. Adjacency DB A EC F EDGE EDGE EDGE
  • 69. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica69 Adjacency user item user-item preference vertices share the same edge are treated as similar pairs
  • 70. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica70 Matrix Factorization r = ? ? ? ? ? ? ? ? x p1 p2 p3 U1 U2 U3 I1 I2 I3 I4 IEEE Computer’09 Yehuda Koren, Robert M. Bell, Chris Volinsky: Matrix Factorization Techniques for Recommender Systems. IEEE Computer 42(8): 30-37 (2009) might distort the data considerably. Hence, more r works3-6 suggested modeling directly the observe ings only, while avoiding overfitting through a regula model. To learn the factor vectors (pu and qi ), the s minimizes the regularized squared error on the known ratings: min * *,q p ( , )u i (rui qi T pu )2 + (|| qi ||2 + || pu ||2 ) Here, is the set of the (u,i) pairs for which rui is k q1 q2 q3 (rating - item*user) p4
  • 71. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica71 Graph user-item graph Matrix Factorization in SMORe
  • 72. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica ( , ) (user, item) 72 SamplerGraph user-item graph Matrix Factorization in SMORe
  • 73. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica ( , ) (user, item) 73 SamplerGraph Mapper user-item graph Matrix Factorization in SMORe ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> Embedding Lookup
  • 74. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica ( , ) (user, item) 74 OptimizerSamplerGraph Mapper user-item graph Matrix Factorization in SMORe Square Error (~v · ~v y)2 <latexit sha1_base64="uQdhOq3LVxgk6wWspOVfIsBCS58=">AAACBnicbVDLSgMxFM34rPU16lKEYBHqwjJTBV0W3LisYB/QGUsmk2lDM8mQZApl6MqNv+LGhSJu/QZ3/o1pO4K2Hrhwcs695N4TJIwq7Thf1tLyyuraemGjuLm1vbNr7+03lUglJg0smJDtACnCKCcNTTUj7UQSFAeMtILB9cRvDYlUVPA7PUqIH6MepxHFSBupax+VvSHB2XAMPRwKDX9eZ3B0el/t2iWn4kwBF4mbkxLIUe/an14ocBoTrjFDSnVcJ9F+hqSmmJFx0UsVSRAeoB7pGMpRTJSfTc8YwxOjhDAS0hTXcKr+nshQrNQoDkxnjHRfzXsT8T+vk+roys8oT1JNOJ59FKUMagEnmcCQSoI1GxmCsKRmV4j7SCKsTXJFE4I7f/IiaVYr7nmlentRqtXzOArgEByDMnDBJaiBG1AHDYDBA3gCL+DVerSerTfrfda6ZOUzB+APrI9vpWyX/A==</latexit> ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> Embedding Lookup
  • 75. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica75 Sparse LInear Method (SLIM) ICDM’11, RecSys’16 = 1 0 1 0 0 ? 0 ? 0 - 0 x I1 I2 I3 I1 I2 I3 I4 I4I1 I2 I3 I4 U1 U2 U3 1. Xia Ning, George Karypis: SLIM: Sparse Linear Methods for Top-N Recommender Systems. ICDM 2011: 497-506 2. Evangelia Christakopoulou, George Karypis: Local Item-Item Models For Top-N Recommendation. RecSys 2016: 67-74 nd MF. In y with a methods. ed as a Bayesian maximum sures the ems and n method objective o denote nd items in ˜aT i in decreasing order and recommending the top-N items. B. Learning W for SLIM We view the purchase/rating activity of user ui on item tj in A (i.e., aij) as the ground-truth item recommendation score. Given a user-item purchase/rating matrix A of size m⇥n, we learn the sparse n⇥n matrix W in Equation 2 as the minimizer for the following regularized optimization problem: minimize W 1 2 kA AWk2 F + 2 kWk2 F + kWk1 subject to W 0 diag(W) = 0, (3) where kWk1 = Pn i=1 Pn j=1 |wij| is the entry-wise `1-norm of W, and k · kF is the matrix Frobenius norm. In Equa- 1 - 1 - I1 I2 I3 I4 U1 U2 U3 Transition Matrix W
  • 76. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica ( , ) (user, item) 76 SamplerGraph user-item graph SLIM in SMORe
  • 77. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica ( , ) (user, item) 77 SamplerGraph Mapper user-item graph SLIM in SMORe ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> space such that the co-occurrence of users and items (i.e., a certain user has purchased a certain item) can be rendered conditionally independent. Koren [10] proposed an intersect- ing approach between neighborhood-based method and MF. In his approach, item similarity is learned simultaneously with a matrix factorization so as to take advantages of both methods. Top-N recommendation has also been formulated as a ranking problem. Rendle et al [11] proposed a Bayesian Personalized Ranking (BPR) criterion, which is the maximum posterior estimator from a Bayesian analysis and measures the difference between the rankings of user-purchased items and the rest items. BPR can be well adopted for item knn method (BPRkNN) and MF methods (BPRMF) as a general objective function. III. DEFINITIONS AND NOTATIONS In this paper, the symbols u and t will be used to denote the users and items, respectively. Individual users and items will be denoted using different subscripts (i.e., ui, tj). The set of all users and items in the system will be denoted by U represents the recommendation s ui. Top-N recommendation for ui purchased/-rated items based on t in ˜aT i in decreasing order and rec B. Learning W for SLIM We view the purchase/rating act A (i.e., aij) as the ground-truth i Given a user-item purchase/rating learn the sparse n⇥n matrix W in for the following regularized opti minimize W 1 2 kA AWk2 F + subject to W 0 diag(W) = 0, where kWk1 = Pn i=1 Pn j=1 |wij of W, and k · kF is the matrix tion 3, AW is the estimated matri (i.e., ˜A) by the sparse linear mo 1 2 A
 (fixed) W (trainable)
  • 78. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica ( , ) (user, item) 78 OptimizerSamplerGraph Mapper user-item graph SLIM in SMORe Square Error (~v · ~v y)2 <latexit sha1_base64="uQdhOq3LVxgk6wWspOVfIsBCS58=">AAACBnicbVDLSgMxFM34rPU16lKEYBHqwjJTBV0W3LisYB/QGUsmk2lDM8mQZApl6MqNv+LGhSJu/QZ3/o1pO4K2Hrhwcs695N4TJIwq7Thf1tLyyuraemGjuLm1vbNr7+03lUglJg0smJDtACnCKCcNTTUj7UQSFAeMtILB9cRvDYlUVPA7PUqIH6MepxHFSBupax+VvSHB2XAMPRwKDX9eZ3B0el/t2iWn4kwBF4mbkxLIUe/an14ocBoTrjFDSnVcJ9F+hqSmmJFx0UsVSRAeoB7pGMpRTJSfTc8YwxOjhDAS0hTXcKr+nshQrNQoDkxnjHRfzXsT8T+vk+roys8oT1JNOJ59FKUMagEnmcCQSoI1GxmCsKRmV4j7SCKsTXJFE4I7f/IiaVYr7nmlentRqtXzOArgEByDMnDBJaiBG1AHDYDBA3gCL+DVerSerTfrfda6ZOUzB+APrI9vpWyX/A==</latexit> ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> ~v<latexit sha1_base64="m1L29Cxyzv4kq0gD0isZHiqvweE=">AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a</latexit> A
 (fixed) W (trainable)
  • 79. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica79 2. Neighborhood A B NEIGHBORS
  • 80. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica80 Neighborhood vertices share similar connections are treated as similar pairs commonly purchasedcommon interests
  • 81. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica81 item2vec RecSys’16, MLSP’16 Oren Barkan, Noam Koenigstein: Item2vec: Neural Item Embedding for Collaborative Filtering. RecSys Posters 2016
 Oren Barkan, Noam Koenigstein: ITEM2VEC: Neural item embedding for collaborative filtering. MLSP 2016: 1-6 ter that s to be d iw is d to the und to ibution, rare and edure is each pair of items that share the same set as example. This implies a window size that is d from the set size. Specifically, for a given se the objective from Eq. (1) is modified as follo 1 1 log ( | ) K K j i i j i p w w K = ≠ ∑∑ Another option is to keep the objective in Eq and shuffle each set of items during runtim experiments we observed that both options p the given K items purchasing logs
  • 82. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica82 To Model Neighborhood context
 space vertex
 space purchasing logs shared
 neighbor userA userB space for shared neighbor w2v
  • 83. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica83 SamplerGraph user-item graph item2vec in SMORe ( , ) (itemA, itemB) A B A B
  • 84. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica84 SamplerGraph Mapper user-item graph item2vec in SMORe vertex
 embedding lookup A ~vv <latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit> context
 embedding lookup B ~vc <latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit> ( , ) A B A B (itemA, itemB)
  • 85. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica85 OptimizerSamplerGraph Mapper user-item graph item2vec in SMORe vertex
 embedding lookup A ~vv <latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit> context
 embedding lookup B ~vc <latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit> ( , ) A B Log Likelihood A B log (~vv · ~vc )<latexit sha1_base64="+ijKtwEXuygdqvbeIqq2UeNudQM=">AAACIHicbZBNS8NAEIY3flu/oh69LBZBLyVRoR4FLx4r2FZoatlspu3STTbsTool9Kd48a948aCI3vTXuK05aPWFhZdnZpidN0ylMOh5H87c/MLi0vLKamltfWNzy93eaRiVaQ51rqTSNyEzIEUCdRQo4SbVwOJQQjMcXEzqzSFoI1RyjaMU2jHrJaIrOEOLOm41kKoXGNGL2WEwBJ4Px7d5gHCH1o0DHimcwdxietRxy17Fm4r+NX5hyqRQreO+B5HiWQwJcsmMafleiu2caRRcwrgUZAZSxgesBy1rExaDaefTA8f0wJKIdpW2L0E6pT8nchYbM4pD2xkz7JvZ2gT+V2tl2D1r5yJJM4SEfy/qZpKiopO0aCQ0cJQjaxjXwv6V8j7TjKPNtGRD8GdP/msaxxX/pHJ8dVo+rxVxrJA9sk8OiU+q5JxckhqpE07uySN5Ji/Og/PkvDpv361zTjGzS37J+fwCQLeliA==</latexit> A B (itemA, itemB)
  • 86. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica86 3. Community A B COMMUNITY
  • 87. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica87 Community vertices share similar communities are treated as similar pairs connected to the same group not connected
  • 88. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica88 Heterogeneous Preference Embedding (HPE) RecSys’16 Chih-Ming Chen, Ming-Feng Tsai, Yu-Ching Lin, Yi-Hsuan Yang: Query-based Music Recommendations via Preference Embedding. RecSys 2016: 79-82 aper includes: dation is proposed arch work of infor- mender system. ommendation as a propose a method between users and ms. onducted to verify ethod. ENDATIONS the task of query- User Track Album Artist U1 T1 T2 T3 T4 T5 Ar1 U2 U3 Ar2 Ar3 Al1 Al2 Al3 Al4T6 U4 Figure 1: The user preference network. Suppose there are totally four kinds of data attributes in the music listening dataset. In the figure, the user U4 may frequently listen to
  • 89. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica89 Heterogeneous Preference Embedding (HPE) RecSys’16 Chih-Ming Chen, Ming-Feng Tsai, Yu-Ching Lin, Yi-Hsuan Yang: Query-based Music Recommendations via Preference Embedding. RecSys 2016: 79-82 T5 Al4U3 U4 target
 vertexT5 U3 U3 U4 Al4U3 Edge Sampling from whole network Weighted Random Walks from
 preceding vertex Walk Direction T5U3 … ( , ) ( , ) ( , ) ( , ) context
 vertex ( , ) the training pairs ure 2: Edge sampling via weighted random walks for ning pairs. In our sampling scheme, the first sampling is derived from the edge sampling over the whole network. Algorithm 1: Heterogeneous Prefer Input: User Preference Network: Walk Steps: w, Sampling Times: n 1 for v 2 V do 2 Initialize the representation: ( 3 for i 2 {1, ..., n} do 4 (v1, v2) = EdgeSampling(G) 5 Update (v1), context v2 by mi 6 Update (v2), context v1 by mi 7 for v0 2 RandomWalk(v2, w 8 Update (v1), context v0 by Output: Vertex representations Table 1: Dataset Pr(vj| (vi)) = 0 otherwise Maximizing the above posterior probabi to minimizing the negative log likelihood, function of the second-order proximity can follows: O = X (i,j)2S wi,j log p(vj| (vi)) + X i where S is a set of sampling pairs and w ind of the edge. In practice, the observed ed tive information and thus falls into the one problem. We adopt the widely-used solutio sampling to sample the additional user-to-o weight embedding
  • 90. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica90 SamplerGraph user-item graph HPE in SMORe ( , )V C ( , )V C ( , )V C
  • 91. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica91 OptimizerSamplerGraph Mapper user-item graph HPE in SMORe ( , )V C ( , )V C ( , )V C vertex
 embedding lookup ~vv <latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit> V V context
 embedding lookup ~vc <latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit> C C Log Likelihood V C log (~vv · ~vc )<latexit sha1_base64="+ijKtwEXuygdqvbeIqq2UeNudQM=">AAACIHicbZBNS8NAEIY3flu/oh69LBZBLyVRoR4FLx4r2FZoatlspu3STTbsTool9Kd48a948aCI3vTXuK05aPWFhZdnZpidN0ylMOh5H87c/MLi0vLKamltfWNzy93eaRiVaQ51rqTSNyEzIEUCdRQo4SbVwOJQQjMcXEzqzSFoI1RyjaMU2jHrJaIrOEOLOm41kKoXGNGL2WEwBJ4Px7d5gHCH1o0DHimcwdxietRxy17Fm4r+NX5hyqRQreO+B5HiWQwJcsmMafleiu2caRRcwrgUZAZSxgesBy1rExaDaefTA8f0wJKIdpW2L0E6pT8nchYbM4pD2xkz7JvZ2gT+V2tl2D1r5yJJM4SEfy/qZpKiopO0aCQ0cJQjaxjXwv6V8j7TjKPNtGRD8GdP/msaxxX/pHJ8dVo+rxVxrJA9sk8OiU+q5JxckhqpE07uySN5Ji/Og/PkvDpv361zTjGzS37J+fwCQLeliA==</latexit>
  • 92. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica 4. Centrality (Degree) 92 A B DEGREE 4 DEGREE 4
  • 93. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica93 Centrality vertices share similar properties are treated as similar pairs hub item hub item
  • 94. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica94 struc2vec KDD’17 Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo: struc2vec: Learning Node Representations from Structural Identity. KDD 2017: 385-394
  • 95. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica95 struc2vec KDD’17 Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo: struc2vec: Learning Node Representations from Structural Identity. KDD 2017: 385-394
  • 96. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica96 struc2vec KDD’17 Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo: struc2vec: Learning Node Representations from Structural Identity. KDD 2017: 385-394 user-user graph fn1( ) = structure1_A fn1( ) = structure1_B A B A B fn2( ) = structure2_C C C 1-step: 2-steps:
  • 97. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica97 SamplerGraph user-user graph struc2vec in SMORe ( , )V fn1() S ( , )V fn2() S
  • 98. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica98 OptimizerSamplerGraph Mapper user-user graph struc2vec in SMORe ~vv <latexit sha1_base64="n/DRd03R+fmi3AovMrJcYZKlFBM=">AAAB+3icbZBNS8NAEIY39avWr1iPXoJF8FSSKuix4MVjBdsKTSyb7aRduvlgd1JaQv6KFw+KePWPePPfuG1z0NYXFh7emWFmXz8RXKFtfxuljc2t7Z3ybmVv/+DwyDyudlScSgZtFotYPvpUgeARtJGjgMdEAg19AV1/fDuvdycgFY+jB5wl4IV0GPGAM4ra6ptVdwIsm+RPmYswRU1536zZdXshax2cAmqkUKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9jRENQXnZ4vbcOtfOwApiqV+E1sL9PZHRUKlZ6OvOkOJIrdbm5n+1XorBjZfxKEkRIrZcFKTCwtiaB2ENuASGYqaBMsn1rRYbUUkZ6rgqOgRn9cvr0GnUnct64/6q1mwVcZTJKTkjF8Qh16RJ7kiLtAkjU/JMXsmbkRsvxrvxsWwtGcXMCfkj4/MHZ5qVXg==</latexit> V V ~vc <latexit sha1_base64="HJbdihIJmLvMuY2eQ/2t7MTPmYg=">AAAB+3icbVBNS8NAEN3Ur1q/Yj16CRbBU0mqoMeCF48VbCs0sWy2k3bp5oPdSWkJ+StePCji1T/izX/jts1BWx8MPN6bYWaenwiu0La/jdLG5tb2Tnm3srd/cHhkHlc7Kk4lgzaLRSwffapA8AjayFHAYyKBhr6Arj++nfvdCUjF4+gBZwl4IR1GPOCMopb6ZtWdAMsm+VPmIkwxY3neN2t23V7AWidOQWqkQKtvfrmDmKUhRMgEVarn2Al6GZXImYC84qYKEsrGdAg9TSMagvKyxe25da6VgRXEUleE1kL9PZHRUKlZ6OvOkOJIrXpz8T+vl2Jw42U8SlKEiC0XBamwMLbmQVgDLoGhmGlCmeT6VouNqKQMdVwVHYKz+vI66TTqzmW9cX9Va7aKOMrklJyRC+KQa9Ikd6RF2oSRKXkmr+TNyI0X4934WLaWjGLmhPyB8fkDSqiVSw==</latexit> S S Log Likelihood V S log (~vv · ~vc )<latexit sha1_base64="+ijKtwEXuygdqvbeIqq2UeNudQM=">AAACIHicbZBNS8NAEIY3flu/oh69LBZBLyVRoR4FLx4r2FZoatlspu3STTbsTool9Kd48a948aCI3vTXuK05aPWFhZdnZpidN0ylMOh5H87c/MLi0vLKamltfWNzy93eaRiVaQ51rqTSNyEzIEUCdRQo4SbVwOJQQjMcXEzqzSFoI1RyjaMU2jHrJaIrOEOLOm41kKoXGNGL2WEwBJ4Px7d5gHCH1o0DHimcwdxietRxy17Fm4r+NX5hyqRQreO+B5HiWQwJcsmMafleiu2caRRcwrgUZAZSxgesBy1rExaDaefTA8f0wJKIdpW2L0E6pT8nchYbM4pD2xkz7JvZ2gT+V2tl2D1r5yJJM4SEfy/qZpKiopO0aCQ0cJQjaxjXwv6V8j7TjKPNtGRD8GdP/msaxxX/pHJ8dVo+rxVxrJA9sk8OiU+q5JxckhqpE07uySN5Ji/Og/PkvDpv361zTjGzS37J+fwCQLeliA==</latexit> ( , )V fn1() S ( , )V fn2() S note. original work adopts hierarchal softmax for learning vertex
 embedding lookup context
 embedding lookup
  • 99. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica99 Recap Sampler Mapper Optimizer MF Adjacency Vertex Embedding Dot product RMSE SLIM Adjacency Vertex Embedding Dot product RMSE item2vec Neighborhood Vertex Embedding Context Embedding Sigmoid Dot Product Log Likelihood HPE Community Vertex Embedding Context Embedding Sigmoid Dot Product Log Likelihood struc2vec Centrality Vertex Embedding Context Embedding Sigmoid Dot Product Log Likelihood
  • 100. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica100 Edge Information • Weight
 - reflects strength of interaction • Closeness
 - reflects order of interaction • Semantics
 - sample with grammar • Inductiveness
 - information diffuses through edges
  • 101. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica101 1. Weight B C A 12 1
  • 102. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica102 node2vec edge weight guides the walk BFS Walk DFS Walk Aditya Grover, Jure Leskovec: node2vec: Scalable Feature Learning for Networks. KDD 2016: 855-864 KDD’16 v α=1 α=1/q α=1/q α=1/p x2 x3t x1 2: Illustration of the random walk procedure in node2vec. alk just transitioned from t to v and is now evaluating its next t of node v. Edge labels indicate search biases ↵. x denotes the shortest path distance between nodes t and x. hat dtx must be one of {0, 1, 2}, and hence, the two parame-
  • 103. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica103 node2vec in SMORe v α=1 α=1/q α=1/q α=1/p x2 x3t x1 2: Illustration of the random walk procedure in node2vec. alk just transitioned from t to v and is now evaluating its next t of node v. Edge labels indicate search biases ↵. x denotes the shortest path distance between nodes t and x. hat dtx must be one of {0, 1, 2}, and hence, the two parame- Sampler customized
 random walk
  • 104. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica104 2. Closeness A strong weak
  • 105. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica105 High-order Proximity for Recommendations (HOP-Rec) RecSys’18 w Chih-Ming Chen National Chengchi University Taipei, Taiwan 104761501@nccu.edu.tw w Ming-Feng Tsai National Chengchi University Taipei, Taiwan mftsai@nccu.edu.tw many e-commerce pular approaches els; the former ap- he observed direct r extracts indirect item interactions. ecient method posed method in- order information d of factorizing a dence weighting n simultaneously, action matrix and lks. Experimental performs the state tasets. andom walks; bi- back and Ming-Feng Tsai. ecommendation. In i1 i2 i3 i4 u1 1st 1st u2 1st 1st u3 1st 1st i1 i2 i3 i4 u1 1st 1st 2nd 3rd u2 2nd 1st 1st 2nd u3 3rd 2nd 1st 1st (c) (d) (a) (b) i1 u1 i2 i3 i4 i1 u1 u2 i2 i3 u3 i4 u2 u3 Figure 1: High-order proximity between users and items within observed interactions Collaborative ltering (CF) is commonly used to leverage this Jheng-Hong Yang, Chih-Ming Chen, Chuan-Ju Wang, Ming-Feng Tsai: HOP-rec: high- order proximity for implicit recommendation. RecSys 2018: 140-144 these factorization and graph models, we build HOP-Re framework that (1) captures high-order preference info a given user-item interaction matrix, and (2) scales to l real-world datasets by using random surng on the corr interaction graph. The objective of HOP-Rec is dened LHOP = X 1k K u,(i,i0) graph model z }| { C(k) E i⇠Pk u i0⇠PN factorization model z }| {f F ⇣ | u i0, | u i ⌘g + k where Pk u (·) denotes the k-order probability distribut item sampled from the walk sequence Su (see Denit denotes a uniform distribution by which an item is sam the set of all items, and K denotes the maximum orde in our method. The main idea behind the proposed met approximation of high-order probabilistic matrix factor conducting random walk (RW) with a decay factor for c cit Recommendation RecSys ’18, October 2–7, 2018, Vancouver, BC, Cana uild HOP-Rec, a united eference information in 2) scales to large-scale g on the corresponding ec is dened as model { | u i ⌘g + k k2 2 , (3) Finally, the ranking objective function is composed with an in cator function and a pairwise logistic loss, and we thus dene F Eq. (3) as F ( | u i0, | u i ) = 1{ | u i0 | u i k } log f ⇣ | u i0 | u i ⌘g , where 1B denotes the indicator function for condition B and k an order-aware margin set to /k. Note that item i0 is uniform sampled from the set of all items as we assume a simplied u form distribution PN for less preferred items; this assumption order-aware interaction
  • 106. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica106 HOP-Rec in SMORe SamplerGraph user-item graph ( , ) 1(~v · ~v ~v · ~v margin)latexit sha1_base64=iBNmKDj8/YuJu40Ok7W0yOl30QA=AAACRXicbVA9SwNBEN3z2/gVtbRZDEIsDHcqaGEh2lgqGBVyIeztTZIle3vH7lwwHPeX/Bv+ARsLLe3sxFb3YgSjPhh4vDfDzLwgkcKg6z46E5NT0zOzc/OlhcWl5ZXy6tqViVPNoc5jGeubgBmQQkEdBUq4STSwKJBwHfROC/+6D9qIWF3iIIFmxDpKtAVnaKVW+cyPGHZDk3l51e8Dz/q5z8MYvzndof/KRz71EW4xi5juCJVvt8oVt+YOQf8Sb0QqZITzVvnFD2OeRqCQS2ZMw3MTbGZMo+AS8pKfGkgY77EONCxVLALTzIYf53TLKiFtx9qWQjpUf05kLDJmEAW2s/jP/PYK8T+vkWL7sJkJlaQIin8taqeSYkyL+GgoNHCUA0sY18LeSnmXacbRhjy2JTTFaXnJBuP9juEvudqteXu13Yv9yvHJKKI5skE2SZV45IAckzNyTuqEkzvyQJ7Is3PvvDpvzvtX64QzmlknY3A+PgHSQLQT/latexit R greater
 margin smaller
 margin
  • 107. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica107 HOP-Rec in SMORe OptimizerSamplerGraph Mapper user-item graph embedding lookup BPRLoss ( , ) ~vlatexit sha1_base64=m1L29Cxyzv4kq0gD0isZHiqvweE=AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a/latexit ~vlatexit sha1_base64=m1L29Cxyzv4kq0gD0isZHiqvweE=AAAB7nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lqQY8FLx4r2A9oQ9lsJ+3SzSbsbgol9Ed48aCIV3+PN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RS2tnd294r7pYPDo+OT8ulZW8epYthisYhVN6AaBZfYMtwI7CYKaRQI7AST+4XfmaLSPJZPZpagH9GR5CFn1Fip058iy6bzQbniVt0lyCbxclKBHM1B+as/jFkaoTRMUK17npsYP6PKcCZwXuqnGhPKJnSEPUsljVD72fLcObmyypCEsbIlDVmqvycyGmk9iwLbGVEz1uveQvzP66UmvPMzLpPUoGSrRWEqiInJ4ncy5AqZETNLKFPc3krYmCrKjE2oZEPw1l/eJO1a1bup1h7rlUYzj6MIF3AJ1+DBLTTgAZrQAgYTeIZXeHMS58V5dz5WrQUnnzmHP3A+fwC0XY/a/latexit log (~v · ~v ~v · ~v )latexit sha1_base64=O9uo0NROxizRILxvgLgJbQT9HME=AAACIXicbZDLSsNAFIYn9VbrLerSzWAR6sKSVMEui25cVrAXaEKZTCbp0EkmzEwKJfRV3Pgqblwo0p34Mk7bCNr2h4GP/5zDmfN7CaNSWdaXUdjY3NreKe6W9vYPDo/M45O25KnApIU546LrIUkYjUlLUcVINxEERR4jHW94P6t3RkRIyuMnNU6IG6EwpgHFSGmrb9YdxkNH0jBCFWdEcDaaONjn6pfhFVxrX/bNslW15oKrYOdQBrmafXPq+BynEYkVZkjKnm0lys2QUBQzMik5qSQJwkMUkp7GGEVEutn8wgm80I4PAy70ixWcu38nMhRJOY483RkhNZDLtZm5rtZLVVB3MxonqSIxXiwKUgYVh7O4oE8FwYqNNSAsqP4rxAMkEFY61JIOwV4+eRXatap9Xa093pQbd3kcRXAGzkEF2OAWNMADaIIWwOAZvIJ38GG8GG/GpzFdtBaMfOYU/JPx/QMUnaSs/latexit 1. Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, Lars Schmidt-Thieme: BPR: Bayesian Personalized Ranking from Implicit Feedback. UAI 2009: 452-461 1(~v · ~v ~v · ~v margin)latexit sha1_base64=iBNmKDj8/YuJu40Ok7W0yOl30QA=AAACRXicbVA9SwNBEN3z2/gVtbRZDEIsDHcqaGEh2lgqGBVyIeztTZIle3vH7lwwHPeX/Bv+ARsLLe3sxFb3YgSjPhh4vDfDzLwgkcKg6z46E5NT0zOzc/OlhcWl5ZXy6tqViVPNoc5jGeubgBmQQkEdBUq4STSwKJBwHfROC/+6D9qIWF3iIIFmxDpKtAVnaKVW+cyPGHZDk3l51e8Dz/q5z8MYvzndof/KRz71EW4xi5juCJVvt8oVt+YOQf8Sb0QqZITzVvnFD2OeRqCQS2ZMw3MTbGZMo+AS8pKfGkgY77EONCxVLALTzIYf53TLKiFtx9qWQjpUf05kLDJmEAW2s/jP/PYK8T+vkWL7sJkJlaQIin8taqeSYkyL+GgoNHCUA0sY18LeSnmXacbRhjy2JTTFaXnJBuP9juEvudqteXu13Yv9yvHJKKI5skE2SZV45IAckzNyTuqEkzvyQJ7Is3PvvDpvzvtX64QzmlknY3A+PgHSQLQT/latexit R R greater
 margin smaller
 margin
  • 108. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica108 3. Semantics A USER VENUE
  • 109. CLIP Lab, National Chengchi University CFDA Lab, Academia Sinica109 metapath2vec KDD’17 Org Author Paper Venue a1 a2 a3 a4 a5 MIT CMU ACL KDD p1 p2 p3 APVPA OAPVPAO APA meta paths (a) An academic network KDD 0 0 0 0 0 1 0 0 0 0 0 0 ACL MIT CMU a1 a2 a3 a4 a5 p1 p2 p3 input layer hidden layer |V|-dim (b) Skip-gram in metapa a heteroge- erogeneous ion Learn- learn the d- V | that are ong them. trix X, with ding to the re dierent ed into the the heterogeneous network structures into skip-gram, we propose meta-path-based random walks in heterogeneous networks. Heterogeneous Skip-Gram. In metapath2vec, we enable skip- gram to learn eective node representations for a heterogeneous network G = (V, E,T ) with |TV | 1 by maximizing the probability of having the heterogeneous context Nt ( ),t 2 TV given a node : arg max X 2V X t 2TV X ct 2Nt ( ) logp(ct | ; ) (2) where Nt ( ) denotes ’s neighborhood with the tth type of nodes and p(ct | ; ) is commonly dened as a somax function [3, 7, 18, KDD’17, August 13–17, 2017, Halifax, NS, Canada different types neighbors in given typeYuxiao Dong, Nitesh V. Chawla, Ananthram Swami: metapath2vec: Scalable Representation Learning for Heterogeneous Networks. KDD 2017: 135-144 metapath carries semantics