SlideShare a Scribd company logo
Hypergraph Motifs:
Concepts, Algorithms, and Discoveries
Kijung ShinJihoon KoGeon Lee
Hypergraphs are Everywhere
Icon made by Freepik from www.flaticon.com
Collaborations of Researchers Co-purchases of Items Joint Interactions of Proteins
• Hypergraphs consist of nodes and hyperedges.
• Each hyperedge is a subset of any number of nodes.
Our Questions
Q1 What are structural design principles of real-world hypergraphs?
Real-world Hypergraph Randomized Hypergraph
VS
Our Questions (cont.)
Q2 How can we compare local structures of hypergraphs of different sizes?
VS
Small Hypergraph Large Hypergraph
Our Questions (cont.)
Q3 How can we identify domains which hypergraphs are from?
Roadmap
• Hypergraph Motif
• Proposed Method: MoCHy
• Experimental Results
• Conclusions
Hypergraph Motifs: Definition
• Hypergraph motifs (h-motifs) describe connectivity patterns of three
connected hyperedges.
• H-motifs describe the connectivity pattern of hyperedges 𝑒", 𝑒#, and 𝑒$
by the emptiness of seven subsets.
1
2 3
4 6
5
7
𝑒"
𝑒# 𝑒$
(1) 𝑒"𝑒#𝑒$
(2) 𝑒#𝑒$𝑒"
(3) 𝑒$𝑒"𝑒#
(4) 𝑒" ∩ 𝑒#𝑒$
(5) 𝑒# ∩ 𝑒$𝑒"
(6) 𝑒$ ∩ 𝑒"𝑒#
(7) 𝑒" ∩ 𝑒# ∩ 𝑒$
Hypergraph Motifs: Definition (cont.)
• While there can exist 2(
h-motifs, 26 h-motifs remain once we exclude:
1. symmetric ones
2. those with duplicated hyperedges
3. those cannot be obtained from connected hyperedges.
h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13
h-motif 14 h-motif 15 h-motif 16 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 h-motif 23 h-motif 24 h-motif 25 h-motif 26
Open h-motifs contain two non-adjacent hyperedges. Others are closed h-motifs.
Hypergraph Motifs: Example
1
2
54
36
7
1
2
54
36
7
1
2
54
36
7
𝒆 𝟑
𝒆 𝟐 𝒆 𝟏
𝒆 𝟐
𝒆 𝟏
𝒆 𝟒 𝒆 𝟑 𝒆 𝟒
𝒆 𝟏
FH
L
K
B G
S
R
𝒆 𝟏
𝒆 𝟐
𝒆 𝟑
𝒆 𝟒
• Hypergraph motifs (h-motifs) describe connectivity patterns of three
connected hyperedges.
Hypergraph Motifs: Properties
Exhaustive
H-motifs capture connectivity patterns of all possible three connected
hyperedges.
Unique
Connectivity pattern of any three connected hyperedges is captured
by at most one h-motif.
Size Independent
H-motifs capture connectivity patterns independently of the sizes of
hyperedges.
Hypergraph Motifs: Properties
Question:
Why are non-pairwise relations considered?
Answer:
Non-pairwise relations play a key role in capturing the local
structural patterns of real-world hypergraphs.
𝒆 𝟐
𝒆 𝟏 𝒆 𝟒 𝒆 𝟑
𝒆 𝟏 𝒆 𝟒
𝑒), 𝑒*, 𝑒+ and 𝑒), 𝑒,, 𝑒+ have same pairwise
relations, while their connectivity patterns are
distinguished by h-motifs.
Characteristic Profiles
Question:
How can we summarize structural properties of hypergraphs?
Answer:
We compute a compact vector of normalized significance of
every h-motif.
Characteristic Profiles (cont.)
• Significance of h-motifs
∆0≔
𝑀 𝑡 − 𝑀5678 𝑡
𝑀 𝑡 + 𝑀5678 𝑡 + 𝜖
• Characteristic Profiles (CPs)
𝐶𝑃0 ≔
∆0
∑0>?
@A
∆0
@
# of instances of h-motif 𝑡
in the given hypergraph
# of instances of h-motif 𝑡
in randomized hypergraphs
Roadmap
• Hypergraph Motif
• Proposed Method: MoCHy
• Experimental Results
• Conclusions
MoCHy: Motif Counting in Hypergraphs
• Given a hypergraph, how can we count the instances of each h-motif?
• We present MoCHy (Motif Counting in Hypergraphs), a family of parallel
algorithms for counting the instances of h-motifs.
Hypergraph Projection
(Preprocessing Step)
MoCHy-A
MoCHy-E
MoCHy-A+
Exact Counting
Approximate Counting
MoCHy: Hypergraph Projection
• Every version of MoCHy builds the projected graph ̅𝐺 = 𝐸,∧, 𝜔 of
the input hypergraph 𝐺 = 𝑉, 𝐸 .
• To find the neighbors of each hyperedge 𝑒", find hyperedge 𝑒# that
contains any node 𝑣 ∈ 𝑒".
FH
L
K
B G
S
R
𝒆 𝟏
𝒆 𝟐
𝒆 𝟑
𝒆 𝟒
𝒆 𝟐 𝒆 𝟑
𝒆 𝟏 𝒆 𝟒
hyperwedge ∧*,
MoCHy-E: Exact Counting
𝒆 𝟐 𝒆 𝟒
𝒆 𝟑 𝒆 𝟓
𝒆 𝟏
• Enumerate all three connected hyperedges in the hypergraph.
• Increment the count of h-motif corresponding to each instance.
𝑒?, 𝑒@, 𝑒L
𝑒?, 𝑒@, 𝑒M
𝑒@, 𝑒L, 𝑒M
𝑒@, 𝑒L, 𝑒N
𝑒L, 𝑒M, 𝑒N
Definition
𝒉 𝒆𝒊, 𝒆𝒋, 𝒆 𝒌 : h-motif corresponding to an instance 𝑒", 𝑒#, 𝑒$
𝑴 𝒕 : count of h-motif 𝑡’s instances
𝑀 ℎ 𝑒?, 𝑒@, 𝑒L ↑
𝑀 ℎ 𝑒?, 𝑒@, 𝑒M ↑
𝑀 ℎ 𝑒@, 𝑒L, 𝑒M ↑
𝑀 ℎ 𝑒@, 𝑒L, 𝑒N ↑
𝑀 ℎ 𝑒L, 𝑒M, 𝑒N ↑
MoCHy-E: Exact Counting
Question:
How to compute ℎ 𝑒", 𝑒#, 𝑒$ ?
Answer:
Check the emptiness of seven sets by computing the
cardinalities of each set ( 𝑂 min 𝑒! , 𝑒" , 𝑒# ).
(1) 𝑒"𝑒#𝑒$ = 𝑒" − 𝑒" ∩ 𝑒# − 𝑒$ ∩ 𝑒" + 𝑒" ∩ 𝑒# ∩ 𝑒#
(2) 𝑒#𝑒$𝑒" = 𝑒# − 𝑒" ∩ 𝑒# − 𝑒# ∩ 𝑒$ + 𝑒" ∩ 𝑒# ∩ 𝑒#
(3) 𝑒$𝑒"𝑒# = 𝑒$ − 𝑒$ ∩ 𝑒" − 𝑒# ∩ 𝑒$ + 𝑒" ∩ 𝑒# ∩ 𝑒#
(4) 𝑒" ∩ 𝑒#𝑒$ = 𝑒" ∩ 𝑒# − 𝑒" ∩ 𝑒# ∩ 𝑒#
(5) 𝑒# ∩ 𝑒$𝑒" = 𝑒# ∩ 𝑒$ − 𝑒" ∩ 𝑒# ∩ 𝑒#
(6) 𝑒$ ∩ 𝑒"𝑒# = 𝑒$ ∩ 𝑒" − 𝑒" ∩ 𝑒# ∩ 𝑒#
(7) 𝑒"𝑒#𝑒$
1
2 3
4 6
5
7
𝑒# 𝑒$
𝑒"
MoCHy-A: Hyperedge Sampling
𝒆 𝒔 𝒆 𝟑
𝒆 𝟐 𝒆 𝟒
𝒆 𝟏
Sampled hyperedge
𝑒U, 𝑒?, 𝑒@
𝑒U, 𝑒?, 𝑒L
𝑒U, 𝑒@, 𝑒L
𝑒U, 𝑒@, 𝑒M
• Sample 𝑠 hyperedges from the hyperedge set 𝐸 uniformly at
random.
• For each sampled hyperedge 𝑒U, count the number of instances
of each h-motif 𝑡 that contains 𝑒U.
• Rescale the total approximate counts based on the sample size.
𝒔
Rescale
MoCHy-A+: Hyperwedge Sampling
𝒆𝒊 𝒆 𝟐
𝒆𝒋 𝒆 𝟑
𝒆 𝟏
Sampled hyperwedge
𝑒", 𝑒#, 𝑒?
𝑒", 𝑒#, 𝑒@
𝑒", 𝑒#, 𝑒L
• Sample 𝑟 hyperwedges from the hyperwedge set ∧ uniformly at
random.
• For each sampled hyperwedge ∧"#= 𝑒", 𝑒# , count the number
of instances of each h-motif 𝑡 that contains ∧"#.
• Rescale the total approximate counts based on the sample size.
𝒓
Rescale
Roadmap
• Hypergraph Motif
• Proposed Method: MoCHy
• Experimental Results
• Conclusions
Experimental Settings
• 11 real-world hypergraphs from 5 different domains
• All versions of MoCHy implemented using C++ and OpenMP.
Co-authorship E-mail Contact Tags Threads
&
for parallelization
EXP1. Comparison with Random
• Real-world and random hypergraphs have distinct distributions of h-
motif instances.
real random real random real random real random real random
1 9.6E07 (7) 1.3E09 (4) 3 -0.86 4.8E04 (16) 2.8E07 (5) 11 -1.00 7.5E06 (13) 1.7E08 (7) 6 -0.91 9.0E08 (13) 2.2E11 (6) 7 -0.99 6.4E08 (7) 2.4E11 (4) 3 -0.99
2 7.0E09 (2) 7.2E09 (2) 0 -0.01 1.1E08 (3) 8.6E07 (3) 0 0.12 6.3E08 (2) 8.2E08 (3) 1 -0.13 1.6E12 (2) 1.6E12 (2) 0 0.02 1.1E12 (2) 7.7E11 (2) 0 0.16
3 2.2E06 (17) 6.1E03 (14) 3 0.99 2.8E03 (21) 1.7E05 (16) 5 -0.97 1.6E06 (21) 7.8E05 (17) 4 0.34 3.0E06 (20) 1.1E09 (15) 5 -0.99 1.7E05 (20) 1.7E08 (14) 6 -1.00
4 9.6E06 (11) 1.1E05 (12) 1 0.98 8.4E02 (24) 9.2E05 (12) 12 -1.00 4.3E06 (16) 1.5E07 (12) 4 -0.55 1.5E08 (17) 1.6E10 (12) 5 -0.98 3.1E06 (13) 1.2E09 (11) 2 -0.99
5 1.5E08 (6) 1.2E05 (11) 5 1.00 4.6E06 (5) 1.6E06 (11) 6 0.49 7.5E07 (7) 1.1E07 (13) 6 0.74 7.4E09 (8) 2.5E10 (8) 0 -0.54 4.1E08 (8) 1.7E09 (10) 2 -0.61
6 9.9E08 (3) 1.8E06 (9) 6 1.00 1.3E07 (4) 8.2E06 (7) 3 0.24 3.9E08 (4) 1.9E08 (6) 2 0.34 6.8E11 (3) 3.3E11 (4) 1 0.35 1.4E10 (4) 1.1E10 (8) 4 0.11
7 1.9E05 (23) 0.0E00 (20) 3 1.00 1.6E04 (17) 2.0E02 (24) 7 0.98 7.5E04 (24) 1.2E02 (25) 1 1.00 8.3E05 (25) 9.1E05 (25) 0 -0.05 8.8E03 (24) 1.7E04 (24) 0 -0.32
8 3.9E05 (22) 0.0E00 (20) 2 1.00 4.6E03 (20) 2.6E03 (22) 2 0.27 4.2E06 (17) 2.5E04 (21) 4 0.99 2.0E06 (23) 3.4E07 (22) 1 -0.89 2.2E04 (23) 3.5E05 (21) 2 -0.88
9 2.4E06 (16) 0.0E00 (20) 4 1.00 1.7E05 (12) 4.6E03 (20) 8 0.95 1.8E06 (20) 1.1E04 (22) 2 0.99 1.4E08 (18) 5.4E07 (21) 3 0.45 5.1E05 (17) 4.5E05 (20) 3 0.06
10 7.6E06 (13) 7.5E00 (18) 5 1.00 5.7E04 (15) 5.5E04 (17) 2 0.03 2.8E07 (10) 1.7E06 (14) 4 0.88 7.1E08 (14) 1.9E09 (14) 0 -0.45 2.3E06 (15) 9.4E06 (17) 2 -0.61
11 8.6E06 (12) 0.9E00 (19) 7 1.00 4.1E05 (11) 2.4E04 (18) 7 0.89 9.0E06 (11) 1.9E05 (19) 8 0.96 3.5E09 (10) 7.4E08 (16) 6 0.65 2.8E06 (14) 3.1E06 (18) 4 -0.05
12 6.4E07 (8) 1.9E02 (16) 8 1.00 1.7E05 (13) 2.7E05 (14) 1 -0.24 8.2E07 (6) 2.4E07 (10) 4 0.55 6.9E10 (6) 2.4E10 (10) 4 0.49 8.2E07 (10) 6.2E07 (15) 5 0.14
13 1.6E04 (26) 0.0E00 (20) 6 1.00 5.5E03 (19) 1.6E00 (26) 7 1.00 2.7E04 (26) 0.4E00 (26) 0 1.00 1.1E06 (24) 1.7E04 (26) 2 0.97 1.5E02 (26) 8.6E00 (26) 0 0.89
14 1.4E05 (24) 0.0E00 (20) 4 1.00 6.0E03 (18) 7.1E01 (25) 7 0.98 7.2E05 (22) 3.7E02 (24) 2 1.00 2.8E07 (19) 1.8E06 (24) 5 0.88 3.9E03 (25) 9.3E02 (25) 0 0.61
15 6.5E05 (19) 0.0E00 (20) 1 1.00 1.7E03 (22) 8.6E02 (23) 1 0.34 3.6E06 (19) 5.0E04 (20) 1 0.97 2.9E08 (15) 5.7E07 (20) 5 0.67 2.7E04 (22) 2.0E04 (23) 1 0.16
16 2.0E06 (18) 0.0E00 (20) 2 1.00 1.4E02 (25) 3.2E03 (21) 4 -0.92 6.7E06 (14) 1.7E06 (15) 1 0.60 1.9E09 (11) 5.8E08 (18) 7 0.53 2.4E05 (18) 1.3E05 (22) 4 0.29
17 4.2E05 (21) 2.0E06 (8) 13 -0.65 1.0E03 (23) 6.3E05 (13) 10 -1.00 3.8E04 (25) 8.7E05 (16) 9 -0.92 5.1E05 (26) 5.0E08 (19) 7 -1.00 2.3E05 (19) 9.2E08 (12) 7 -1.00
18 2.6E06 (15) 6.4E07 (7) 8 -0.92 1.2E02 (26) 7.0E06 (8) 18 -1.00 6.0E06 (15) 4.0E07 (8) 7 -0.74 2.5E06 (22) 1.6E10 (13) 9 -1.00 8.3E05 (16) 1.3E10 (7) 9 -1.00
19 3.6E07 (9) 6.7E07 (6) 3 -0.30 2.0E06 (6) 1.2E07 (6) 0 -0.72 8.7E06 (12) 2.9E07 (9) 3 -0.54 9.4E08 (12) 2.4E10 (9) 3 -0.93 3.5E08 (9) 1.8E10 (6) 3 -0.96
20 3.4E08 (5) 2.2E09 (3) 2 -0.73 6.0E05 (10) 1.3E08 (2) 8 -0.99 2.2E08 (5) 1.2E09 (2) 3 -0.69 9.2E09 (7) 7.2E11 (3) 4 -0.97 1.9E09 (5) 2.4E11 (3) 2 -0.98
21 7.9E08 (4) 5.6E08 (5) 1 0.17 1.7E08 (2) 5.7E07 (4) 2 0.50 5.3E08 (3) 2.3E08 (4) 1 0.39 1.2E11 (5) 2.8E11 (5) 0 -0.40 2.8E10 (3) 8.6E10 (5) 2 -0.51
22 1.7E10 (1) 1.8E10 (1) 0 -0.03 3.1E08 (1) 5.8E08 (1) 0 -0.30 4.9E09 (1) 8.5E09 (1) 0 -0.27 6.6E12 (1) 7.6E12 (1) 0 -0.07 1.1E12 (1) 1.2E12 (1) 0 -0.02
23 2.4E04 (25) 1.5E01 (17) 8 1.00 1.2E05 (14) 5.4E03 (19) 5 0.91 8.8E04 (23) 4.0E03 (23) 0 0.91 2.6E06 (21) 7.9E06 (23) 2 -0.51 1.4E05 (21) 7.8E05 (19) 2 -0.70
24 4.4E05 (20) 1.4E03 (15) 5 0.99 7.7E05 (9) 1.8E05 (15) 6 0.63 4.2E06 (18) 5.4E05 (18) 0 0.77 2.2E08 (16) 7.2E08 (17) 1 -0.53 7.5E06 (12) 3.1E07 (16) 4 -0.61
25 3.8E06 (14) 4.6E04 (13) 1 0.98 1.7E06 (8) 1.8E06 (10) 2 -0.03 3.2E07 (9) 2.0E07 (11) 2 0.23 6.0E09 (9) 2.0E10 (11) 2 -0.54 8.0E07 (11) 4.2E08 (13) 2 -0.68
26 2.3E07 (10) 4.9E05 (10) 0 0.96 1.8E06 (7) 6.14E06 (9) 2 -0.54 7.5E07 (8) 2.1E08 (5) 3 -0.48 1.3E11 (4) 1.8E11 (7) 3 -0.14 1.2E09 (6) 1.9E09 (9) 3 -0.21
h-motif
contact-primary email-EU tags-math threads-math
count (rank) count (rank)
coauth-DBLP
RD RC RD RC RD RC
count (rank) count (rank) count (rank)
RD RC RD RC
0
5
10
15
20
1 5 10 15 20 25
Hypergraph Motif Index
AbsoluteCounts
(logscaled)
0
5
10
15
20
1 5 10 15 20 25
Hypergraph Motif Index
AbsoluteCounts
(logscaled)
0
5
10
15
20
1 5 10 15 20 25
Hypergraph Motif Index
AbsoluteCounts
(logscaled)
0
10
20
30
1 5 10 15 20 25
Hypergraph Motif Index
AbsoluteCounts
(logscaled)
0
10
20
1 5 10 15 20 25
Hypergraph Motif Index
AbsoluteCounts
(logscaled)
EXP1. Comparison with Random
• Real-world and random hypergraphs have distinct distributions of h-
motif instances.
coauth-DBLP
email-Eu
contact-primary
tags-math
threads-math
EXP2. Comparison across Domains
• The CPs are similar within domains but different across domains.
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif Index
Normalized
Significance
coauth-DBLP
coauth-geology
coauth-history
Co-authorship
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif Index
Normalized
Significance
contact-primary
contact-high
Contact
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif Index
Normalized
Significance
email-Enron
email-EU
Email
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif Index
Normalized
Significance
threads-ubuntu
threads-math
Threads
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif Index
Normalized
Significance
tags-ubuntu
tags-math
Tags
EXP2. Comparison across Domains (cont.)
• Characteristic profiles (CPs) based on hypergraph motifs (h-motifs)
capture local structural patterns more accurately than CPs based on
network motifs.
1.0
0.9
0.8
0.7
0.6
0.5
0.4
coauth-DBLP
coauth-geology
coauth-history
contact-primary
contact-high
email-Enron
email-EU
tags-ubuntu
tags-math
threads-ubuntu
threads-math
1.0
0.9
0.8
0.7
CorrelationCoefficient
EXP3. Observations and Applications
• Trends in the formation of collaborations are captured by h-motifs.
(a) The fractions of the instances of h-motifs 2 and 22 have increased rapidly.
(b) The fraction of the instances of open h-motifs increased steadily since 2001.
0.3
0.4
0.5
0.6
0.7
1984 1992 2000 2008 2016
Year
Proportion
Closed h-motifs
Open h-motifs
0.00
0.25
0.50
0.75
1.00
1984 1992 2000 2008 2016
Year
Ratio
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
0.00
0.25
0.50
0.75
1.00
1984 1992 2000 2008 2016
Year
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
.25
.50
.75
.00
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
0.00
0.25
0.50
0.75
1.00
1984 1992 2000 2008 2016
Year
Ratio
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
.00
.25
.50
.75
.00
1984 1992 2000 2008 2016
Year
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
0.00
0.25
0.50
0.75
1.00
1984 1992 2000 2008 2016
Year
Ratio
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
0.00
0.25
0.50
0.75
1.00
1984 1992 2000 2008 2016
Year
Ratio
h-motif 1
h-motif 2
h-motif 3
h-motif 4
h-motif 5
h-motif 6
h-motif 7
h-motif 8
h-motif 9
h-motif 10
h-motif 11
h-motif 12
h-motif 13
h-motif 14
h-motif 15
h-motif 16
h-motif 23
h-motif 24
h-motif 25
h-motif 26
h-motif 17
h-motif 18
h-motif 19
h-motif 20
h-motif 21
h-motif 22
(a) Fraction of the instances of each h-motif in the coauth-DBLP over time. (b) Open and closed h-motifs.
EXP3. Observations and Applications (cont.)
• Hyperedge prediction: classify real hyperedges and fake ones.
• H-motifs give informative features.
HM26 HM7 HC
Logistic Regression
ACC 0.754 0.656 0.636
AUC 0.813 0.693 0.691
Random Forest
ACC 0.768 0.741 0.639
AUC 0.852 0.779 0.692
Decision Tree
ACC 0.731 0.684 0.613
AUC 0.732 0.685 0.616
K-Nearest Neighbors
ACC 0.694 0.689 0.640
AUC 0.750 0.743 0.684
MLP Classifier
ACC 0.795 0.762 0.646
AUC 0.875 0.841 0.701
ACC: accuracy, AUC: area under the ROC curve
The number of each h-motif’s instances that
contain each hyperedge.
HM26 (∈ ℝ&'
)
The seven features with the largest variance
among those in HM26.
HM7 (∈ ℝ(
)
The mean, maximum, and minimum degree
and the mean, maximum, and minimum
number of neighbors of the nodes in each
hyperedge and its size.
HC (∈ ℝ()
EXP4. Performance of Counting Algorithms
• MoCHy-A+ is up to 25X more accurate than MoCHy-A.
• MoCHy-A+ is up to 32.5X faster than MoCHy-E.
26.3X
5.9X
0.00
0.01
0.02
0.03
0.04
0 2 4 6
Elapsed Time (sec)
RelativeError
28.2X
10.2X
0.00
0.01
0.02
0.03
0 20 40 60
Elapsed Time (sec)
RelativeError
11.1X
8.4X
0.00
0.02
0.04
0.06
0.08
0 2 4 6 8 10
Elapsed Time (sec)
RelativeError
32.5X
24.6X
0.00
0.01
0.02
0.03
0.04
0 1000 2000
Elapsed Time (sec)
RelativeError
24.2X
25X
0.00
0.02
0.04
0.06
0 200 400
Elapsed Time (sec)
RelativeError
7.6X
7.6X
0.00
0.05
0.10
0.15
0.0 0.1 0.2
Elapsed Time (sec)
RelativeError
Definition
Relative Error:
∑'()
*+
* + , -* +
∑'()
*+ * +
threads-ubuntu
contact-high email-Enron coauth-history
contact-primary email-Eu
EXP4. Performance of Counting Algorithms (cont.)
• CPs obtained by MoCHy-A+ are estimated near perfectly even with a
smaller number of samples.
-0.2
0.0
0.2
1 6 11 16 21 26
Hypergraph Motif Index
Normalized
Significance
-0.2
0.0
0.2
1 6 11 16 21 26
Hypergraph Motif Index
Normalized
Significance
-0.2
0.0
0.2
1 6 11 16 21 26
Hypergraph Motif Index
Normalized
Significance
MoCHy-A+ (! = ∧ ⋅ 0.01)
MoCHy-A+ (! = ∧ ⋅ 0.05)
MoCHy-A+ (! = ∧ ⋅ 0.001)
MoCHy-A+ (! = ∧ ⋅ 0.005)
MoCHy-E
coauth-history
contact-primary
email-Eu
EXP4. Performance of Counting Algorithms (cont.)
• Both MoCHy-E and MoCHy-A+ achieve significant speedups with
multiple threads.
0
1000
2000
1 2 3 4 5 6 7 8
Number of Threads
ElapsedTime(sec)
1
2
3
4
5
6
7
8
1 2 3 4 5 6 7 8
Number of Threads
Speedup
MoCHy-A+ (! = 1M)
MoCHy-A+ (! = 2M)
MoCHy-A+ (! = 4M)
MoCHy-A+ (! = 8M)
MoCHy-E
MoCHy-A+ (! = 1M)
MoCHy-A+ (! = 2M)
MoCHy-A+ (! = 4M)
MoCHy-A+ (! = 8M)
0
500
1000
1500
2000
0 0.1 1 10 100
Memory (% of Edges)
ElapsedTime(sec)
1
2
3
4
5
0 0.1 1 10 100
Memory (% of Edges)
Speedup
EXP4. Performance of Counting Algorithms (cont.)
• Memoizing a small fraction of projected graphs leads to significant
speedups of MoCHy-A+.
Roadmap
• Hypergraph Motif
• Proposed Method: MoCHy
• Experimental Results
• Conclusions
Conclusions
• We propose hypergraph motifs (h-motifs) for describing the connectivity
patterns of hypergraphs.
Novel Concepts
Fast and Provable Algorithms
Discoveries in 11 Real-world Hypergraphs
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif Index
Normalized
Significance
coauth-DBLP
coauth-geology
coauth-history
Co-authorship
-0.2
0.0
0.2
1 5 10 15 20 25
Hypergraph Motif IndexNormalized
Significance
tags-ubuntu
tags-math
Tags
Hypergraph Motifs:
Concepts, Algorithms, and Discoveries
Kijung ShinJihoon KoGeon Lee

More Related Content

What's hot

Trial terengganu 2014 spm add math k1 skema [scan]
Trial terengganu 2014 spm add math k1 skema [scan]Trial terengganu 2014 spm add math k1 skema [scan]
Trial terengganu 2014 spm add math k1 skema [scan]
Cikgu Pejal
 
Oct8 - 131 slid
Oct8 - 131 slidOct8 - 131 slid
Oct8 - 131 slid
Tak Lee
 
Application of parallel hierarchical matrices and low-rank tensors in spatial...
Application of parallel hierarchical matrices and low-rank tensors in spatial...Application of parallel hierarchical matrices and low-rank tensors in spatial...
Application of parallel hierarchical matrices and low-rank tensors in spatial...
Alexander Litvinenko
 
Igraph
IgraphIgraph
Igraph
Anu Radha
 
Deep genenergyprobdoc
Deep genenergyprobdocDeep genenergyprobdoc
Deep genenergyprobdoc
Masato Nakai
 
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
PyData
 
Fixed point and common fixed point theorems in complete metric spaces
Fixed point and common fixed point theorems in complete metric spacesFixed point and common fixed point theorems in complete metric spaces
Fixed point and common fixed point theorems in complete metric spacesAlexander Decker
 
Hideitsu Hino
Hideitsu HinoHideitsu Hino
Hideitsu Hino
Suurist
 
Algebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions ManualAlgebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions Manual
kejeqadaqo
 
Trial terengganu 2014 spm add math k2 skema
Trial terengganu 2014 spm add math k2 skemaTrial terengganu 2014 spm add math k2 skema
Trial terengganu 2014 spm add math k2 skema
Cikgu Pejal
 
Identification of unknown parameters and prediction with hierarchical matrice...
Identification of unknown parameters and prediction with hierarchical matrice...Identification of unknown parameters and prediction with hierarchical matrice...
Identification of unknown parameters and prediction with hierarchical matrice...
Alexander Litvinenko
 
Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology
Amro Elfeki
 
Tetsunao Matsuta
Tetsunao MatsutaTetsunao Matsuta
Tetsunao Matsuta
Suurist
 
Solution Manual : Chapter - 01 Functions
Solution Manual : Chapter - 01 FunctionsSolution Manual : Chapter - 01 Functions
Solution Manual : Chapter - 01 Functions
Hareem Aslam
 
Hiroyuki Sato
Hiroyuki SatoHiroyuki Sato
Hiroyuki Sato
Suurist
 
Digital signal processing (2nd ed) (mitra) solution manual
Digital signal processing (2nd ed) (mitra) solution manualDigital signal processing (2nd ed) (mitra) solution manual
Digital signal processing (2nd ed) (mitra) solution manual
Ramesh Sundar
 
Numerical integration
Numerical integrationNumerical integration
Numerical integration
vishal kathiriya
 
Skema SPM SBP Add Maths Paper 2012
Skema SPM SBP Add Maths Paper 2012Skema SPM SBP Add Maths Paper 2012
Skema SPM SBP Add Maths Paper 2012
Tuisyen Geliga
 

What's hot (20)

Trial terengganu 2014 spm add math k1 skema [scan]
Trial terengganu 2014 spm add math k1 skema [scan]Trial terengganu 2014 spm add math k1 skema [scan]
Trial terengganu 2014 spm add math k1 skema [scan]
 
Oct8 - 131 slid
Oct8 - 131 slidOct8 - 131 slid
Oct8 - 131 slid
 
Application of parallel hierarchical matrices and low-rank tensors in spatial...
Application of parallel hierarchical matrices and low-rank tensors in spatial...Application of parallel hierarchical matrices and low-rank tensors in spatial...
Application of parallel hierarchical matrices and low-rank tensors in spatial...
 
Igraph
IgraphIgraph
Igraph
 
Deep genenergyprobdoc
Deep genenergyprobdocDeep genenergyprobdoc
Deep genenergyprobdoc
 
ppt_tech
ppt_techppt_tech
ppt_tech
 
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
 
Fixed point and common fixed point theorems in complete metric spaces
Fixed point and common fixed point theorems in complete metric spacesFixed point and common fixed point theorems in complete metric spaces
Fixed point and common fixed point theorems in complete metric spaces
 
Hideitsu Hino
Hideitsu HinoHideitsu Hino
Hideitsu Hino
 
Algebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions ManualAlgebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions Manual
 
Trial terengganu 2014 spm add math k2 skema
Trial terengganu 2014 spm add math k2 skemaTrial terengganu 2014 spm add math k2 skema
Trial terengganu 2014 spm add math k2 skema
 
Identification of unknown parameters and prediction with hierarchical matrice...
Identification of unknown parameters and prediction with hierarchical matrice...Identification of unknown parameters and prediction with hierarchical matrice...
Identification of unknown parameters and prediction with hierarchical matrice...
 
Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology Lecture 5: Stochastic Hydrology
Lecture 5: Stochastic Hydrology
 
Tetsunao Matsuta
Tetsunao MatsutaTetsunao Matsuta
Tetsunao Matsuta
 
Solution Manual : Chapter - 01 Functions
Solution Manual : Chapter - 01 FunctionsSolution Manual : Chapter - 01 Functions
Solution Manual : Chapter - 01 Functions
 
Hiroyuki Sato
Hiroyuki SatoHiroyuki Sato
Hiroyuki Sato
 
Digital signal processing (2nd ed) (mitra) solution manual
Digital signal processing (2nd ed) (mitra) solution manualDigital signal processing (2nd ed) (mitra) solution manual
Digital signal processing (2nd ed) (mitra) solution manual
 
5431305022
54313050225431305022
5431305022
 
Numerical integration
Numerical integrationNumerical integration
Numerical integration
 
Skema SPM SBP Add Maths Paper 2012
Skema SPM SBP Add Maths Paper 2012Skema SPM SBP Add Maths Paper 2012
Skema SPM SBP Add Maths Paper 2012
 

Similar to Hmotif vldb2020 slide

A computational method for system of linear fredholm integral equations
A computational method for system of linear fredholm integral equationsA computational method for system of linear fredholm integral equations
A computational method for system of linear fredholm integral equationsAlexander Decker
 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & RRajarshi Guha
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Simplilearn
 
Lecture-1-Algorithms.pptx
Lecture-1-Algorithms.pptxLecture-1-Algorithms.pptx
Lecture-1-Algorithms.pptx
xalahama3
 
Abductive commonsense reasoning
Abductive commonsense reasoningAbductive commonsense reasoning
Abductive commonsense reasoning
San Kim
 
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
Vahid Taslimitehrani
 
PREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial Approach
PREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial ApproachPREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial Approach
PREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial Approach
ahmet furkan emrehan
 
Transformers.pdf
Transformers.pdfTransformers.pdf
Transformers.pdf
Ali Zoljodi
 
thmotif_icdm2021_slide.pdf
thmotif_icdm2021_slide.pdfthmotif_icdm2021_slide.pdf
thmotif_icdm2021_slide.pdf
Geon Lee
 
Faster, More Effective Flowgraph-based Malware Classification
Faster, More Effective Flowgraph-based Malware ClassificationFaster, More Effective Flowgraph-based Malware Classification
Faster, More Effective Flowgraph-based Malware ClassificationSilvio Cesare
 
Curvefitting
CurvefittingCurvefitting
Curvefitting
Philberto Saroni
 
Grds international conference on pure and applied science (5)
Grds international conference on pure and applied science (5)Grds international conference on pure and applied science (5)
Grds international conference on pure and applied science (5)
Global R & D Services
 
Graph Summarization with Quality Guarantees
Graph Summarization with Quality GuaranteesGraph Summarization with Quality Guarantees
Graph Summarization with Quality Guarantees
Two Sigma
 
Smart Room Gesture Control
Smart Room Gesture ControlSmart Room Gesture Control
Smart Room Gesture Control
Giwrgos Paraskevopoulos
 
AINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, NikolenkoAINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, Nikolenko
Lidia Pivovarova
 
Wenying Yan Heuristics for Truck Scheduling at Cross Docking Terminals
Wenying Yan Heuristics for Truck Scheduling at Cross Docking TerminalsWenying Yan Heuristics for Truck Scheduling at Cross Docking Terminals
Wenying Yan Heuristics for Truck Scheduling at Cross Docking TerminalsWenying Yan
 
ML基本からResNetまで
ML基本からResNetまでML基本からResNetまで
ML基本からResNetまで
Institute of Agricultural Machinery, NARO
 
Slides for "Do Deep Generative Models Know What They Don't know?"
Slides for "Do Deep Generative Models Know What They Don't know?"Slides for "Do Deep Generative Models Know What They Don't know?"
Slides for "Do Deep Generative Models Know What They Don't know?"
Julius Hietala
 
Computer Vision: Correlation, Convolution, and Gradient
Computer Vision: Correlation, Convolution, and GradientComputer Vision: Correlation, Convolution, and Gradient
Computer Vision: Correlation, Convolution, and Gradient
Ahmed Gad
 
Symbolic Computation via Gröbner Basis
Symbolic Computation via Gröbner BasisSymbolic Computation via Gröbner Basis
Symbolic Computation via Gröbner Basis
IJERA Editor
 

Similar to Hmotif vldb2020 slide (20)

A computational method for system of linear fredholm integral equations
A computational method for system of linear fredholm integral equationsA computational method for system of linear fredholm integral equations
A computational method for system of linear fredholm integral equations
 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
 
Lecture-1-Algorithms.pptx
Lecture-1-Algorithms.pptxLecture-1-Algorithms.pptx
Lecture-1-Algorithms.pptx
 
Abductive commonsense reasoning
Abductive commonsense reasoningAbductive commonsense reasoning
Abductive commonsense reasoning
 
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
 
PREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial Approach
PREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial ApproachPREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial Approach
PREDICTION MODELS BASED ON MAX-STEMS Episode Two: Combinatorial Approach
 
Transformers.pdf
Transformers.pdfTransformers.pdf
Transformers.pdf
 
thmotif_icdm2021_slide.pdf
thmotif_icdm2021_slide.pdfthmotif_icdm2021_slide.pdf
thmotif_icdm2021_slide.pdf
 
Faster, More Effective Flowgraph-based Malware Classification
Faster, More Effective Flowgraph-based Malware ClassificationFaster, More Effective Flowgraph-based Malware Classification
Faster, More Effective Flowgraph-based Malware Classification
 
Curvefitting
CurvefittingCurvefitting
Curvefitting
 
Grds international conference on pure and applied science (5)
Grds international conference on pure and applied science (5)Grds international conference on pure and applied science (5)
Grds international conference on pure and applied science (5)
 
Graph Summarization with Quality Guarantees
Graph Summarization with Quality GuaranteesGraph Summarization with Quality Guarantees
Graph Summarization with Quality Guarantees
 
Smart Room Gesture Control
Smart Room Gesture ControlSmart Room Gesture Control
Smart Room Gesture Control
 
AINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, NikolenkoAINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, Nikolenko
 
Wenying Yan Heuristics for Truck Scheduling at Cross Docking Terminals
Wenying Yan Heuristics for Truck Scheduling at Cross Docking TerminalsWenying Yan Heuristics for Truck Scheduling at Cross Docking Terminals
Wenying Yan Heuristics for Truck Scheduling at Cross Docking Terminals
 
ML基本からResNetまで
ML基本からResNetまでML基本からResNetまで
ML基本からResNetまで
 
Slides for "Do Deep Generative Models Know What They Don't know?"
Slides for "Do Deep Generative Models Know What They Don't know?"Slides for "Do Deep Generative Models Know What They Don't know?"
Slides for "Do Deep Generative Models Know What They Don't know?"
 
Computer Vision: Correlation, Convolution, and Gradient
Computer Vision: Correlation, Convolution, and GradientComputer Vision: Correlation, Convolution, and Gradient
Computer Vision: Correlation, Convolution, and Gradient
 
Symbolic Computation via Gröbner Basis
Symbolic Computation via Gröbner BasisSymbolic Computation via Gröbner Basis
Symbolic Computation via Gröbner Basis
 

Recently uploaded

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

Hmotif vldb2020 slide

  • 1. Hypergraph Motifs: Concepts, Algorithms, and Discoveries Kijung ShinJihoon KoGeon Lee
  • 2. Hypergraphs are Everywhere Icon made by Freepik from www.flaticon.com Collaborations of Researchers Co-purchases of Items Joint Interactions of Proteins • Hypergraphs consist of nodes and hyperedges. • Each hyperedge is a subset of any number of nodes.
  • 3. Our Questions Q1 What are structural design principles of real-world hypergraphs? Real-world Hypergraph Randomized Hypergraph VS
  • 4. Our Questions (cont.) Q2 How can we compare local structures of hypergraphs of different sizes? VS Small Hypergraph Large Hypergraph
  • 5. Our Questions (cont.) Q3 How can we identify domains which hypergraphs are from?
  • 6. Roadmap • Hypergraph Motif • Proposed Method: MoCHy • Experimental Results • Conclusions
  • 7. Hypergraph Motifs: Definition • Hypergraph motifs (h-motifs) describe connectivity patterns of three connected hyperedges. • H-motifs describe the connectivity pattern of hyperedges 𝑒", 𝑒#, and 𝑒$ by the emptiness of seven subsets. 1 2 3 4 6 5 7 𝑒" 𝑒# 𝑒$ (1) 𝑒"𝑒#𝑒$ (2) 𝑒#𝑒$𝑒" (3) 𝑒$𝑒"𝑒# (4) 𝑒" ∩ 𝑒#𝑒$ (5) 𝑒# ∩ 𝑒$𝑒" (6) 𝑒$ ∩ 𝑒"𝑒# (7) 𝑒" ∩ 𝑒# ∩ 𝑒$
  • 8. Hypergraph Motifs: Definition (cont.) • While there can exist 2( h-motifs, 26 h-motifs remain once we exclude: 1. symmetric ones 2. those with duplicated hyperedges 3. those cannot be obtained from connected hyperedges. h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 h-motif 23 h-motif 24 h-motif 25 h-motif 26 Open h-motifs contain two non-adjacent hyperedges. Others are closed h-motifs.
  • 9. Hypergraph Motifs: Example 1 2 54 36 7 1 2 54 36 7 1 2 54 36 7 𝒆 𝟑 𝒆 𝟐 𝒆 𝟏 𝒆 𝟐 𝒆 𝟏 𝒆 𝟒 𝒆 𝟑 𝒆 𝟒 𝒆 𝟏 FH L K B G S R 𝒆 𝟏 𝒆 𝟐 𝒆 𝟑 𝒆 𝟒 • Hypergraph motifs (h-motifs) describe connectivity patterns of three connected hyperedges.
  • 10. Hypergraph Motifs: Properties Exhaustive H-motifs capture connectivity patterns of all possible three connected hyperedges. Unique Connectivity pattern of any three connected hyperedges is captured by at most one h-motif. Size Independent H-motifs capture connectivity patterns independently of the sizes of hyperedges.
  • 11. Hypergraph Motifs: Properties Question: Why are non-pairwise relations considered? Answer: Non-pairwise relations play a key role in capturing the local structural patterns of real-world hypergraphs. 𝒆 𝟐 𝒆 𝟏 𝒆 𝟒 𝒆 𝟑 𝒆 𝟏 𝒆 𝟒 𝑒), 𝑒*, 𝑒+ and 𝑒), 𝑒,, 𝑒+ have same pairwise relations, while their connectivity patterns are distinguished by h-motifs.
  • 12. Characteristic Profiles Question: How can we summarize structural properties of hypergraphs? Answer: We compute a compact vector of normalized significance of every h-motif.
  • 13. Characteristic Profiles (cont.) • Significance of h-motifs ∆0≔ 𝑀 𝑡 − 𝑀5678 𝑡 𝑀 𝑡 + 𝑀5678 𝑡 + 𝜖 • Characteristic Profiles (CPs) 𝐶𝑃0 ≔ ∆0 ∑0>? @A ∆0 @ # of instances of h-motif 𝑡 in the given hypergraph # of instances of h-motif 𝑡 in randomized hypergraphs
  • 14. Roadmap • Hypergraph Motif • Proposed Method: MoCHy • Experimental Results • Conclusions
  • 15. MoCHy: Motif Counting in Hypergraphs • Given a hypergraph, how can we count the instances of each h-motif? • We present MoCHy (Motif Counting in Hypergraphs), a family of parallel algorithms for counting the instances of h-motifs. Hypergraph Projection (Preprocessing Step) MoCHy-A MoCHy-E MoCHy-A+ Exact Counting Approximate Counting
  • 16. MoCHy: Hypergraph Projection • Every version of MoCHy builds the projected graph ̅𝐺 = 𝐸,∧, 𝜔 of the input hypergraph 𝐺 = 𝑉, 𝐸 . • To find the neighbors of each hyperedge 𝑒", find hyperedge 𝑒# that contains any node 𝑣 ∈ 𝑒". FH L K B G S R 𝒆 𝟏 𝒆 𝟐 𝒆 𝟑 𝒆 𝟒 𝒆 𝟐 𝒆 𝟑 𝒆 𝟏 𝒆 𝟒 hyperwedge ∧*,
  • 17. MoCHy-E: Exact Counting 𝒆 𝟐 𝒆 𝟒 𝒆 𝟑 𝒆 𝟓 𝒆 𝟏 • Enumerate all three connected hyperedges in the hypergraph. • Increment the count of h-motif corresponding to each instance. 𝑒?, 𝑒@, 𝑒L 𝑒?, 𝑒@, 𝑒M 𝑒@, 𝑒L, 𝑒M 𝑒@, 𝑒L, 𝑒N 𝑒L, 𝑒M, 𝑒N Definition 𝒉 𝒆𝒊, 𝒆𝒋, 𝒆 𝒌 : h-motif corresponding to an instance 𝑒", 𝑒#, 𝑒$ 𝑴 𝒕 : count of h-motif 𝑡’s instances 𝑀 ℎ 𝑒?, 𝑒@, 𝑒L ↑ 𝑀 ℎ 𝑒?, 𝑒@, 𝑒M ↑ 𝑀 ℎ 𝑒@, 𝑒L, 𝑒M ↑ 𝑀 ℎ 𝑒@, 𝑒L, 𝑒N ↑ 𝑀 ℎ 𝑒L, 𝑒M, 𝑒N ↑
  • 18. MoCHy-E: Exact Counting Question: How to compute ℎ 𝑒", 𝑒#, 𝑒$ ? Answer: Check the emptiness of seven sets by computing the cardinalities of each set ( 𝑂 min 𝑒! , 𝑒" , 𝑒# ). (1) 𝑒"𝑒#𝑒$ = 𝑒" − 𝑒" ∩ 𝑒# − 𝑒$ ∩ 𝑒" + 𝑒" ∩ 𝑒# ∩ 𝑒# (2) 𝑒#𝑒$𝑒" = 𝑒# − 𝑒" ∩ 𝑒# − 𝑒# ∩ 𝑒$ + 𝑒" ∩ 𝑒# ∩ 𝑒# (3) 𝑒$𝑒"𝑒# = 𝑒$ − 𝑒$ ∩ 𝑒" − 𝑒# ∩ 𝑒$ + 𝑒" ∩ 𝑒# ∩ 𝑒# (4) 𝑒" ∩ 𝑒#𝑒$ = 𝑒" ∩ 𝑒# − 𝑒" ∩ 𝑒# ∩ 𝑒# (5) 𝑒# ∩ 𝑒$𝑒" = 𝑒# ∩ 𝑒$ − 𝑒" ∩ 𝑒# ∩ 𝑒# (6) 𝑒$ ∩ 𝑒"𝑒# = 𝑒$ ∩ 𝑒" − 𝑒" ∩ 𝑒# ∩ 𝑒# (7) 𝑒"𝑒#𝑒$ 1 2 3 4 6 5 7 𝑒# 𝑒$ 𝑒"
  • 19. MoCHy-A: Hyperedge Sampling 𝒆 𝒔 𝒆 𝟑 𝒆 𝟐 𝒆 𝟒 𝒆 𝟏 Sampled hyperedge 𝑒U, 𝑒?, 𝑒@ 𝑒U, 𝑒?, 𝑒L 𝑒U, 𝑒@, 𝑒L 𝑒U, 𝑒@, 𝑒M • Sample 𝑠 hyperedges from the hyperedge set 𝐸 uniformly at random. • For each sampled hyperedge 𝑒U, count the number of instances of each h-motif 𝑡 that contains 𝑒U. • Rescale the total approximate counts based on the sample size. 𝒔 Rescale
  • 20. MoCHy-A+: Hyperwedge Sampling 𝒆𝒊 𝒆 𝟐 𝒆𝒋 𝒆 𝟑 𝒆 𝟏 Sampled hyperwedge 𝑒", 𝑒#, 𝑒? 𝑒", 𝑒#, 𝑒@ 𝑒", 𝑒#, 𝑒L • Sample 𝑟 hyperwedges from the hyperwedge set ∧ uniformly at random. • For each sampled hyperwedge ∧"#= 𝑒", 𝑒# , count the number of instances of each h-motif 𝑡 that contains ∧"#. • Rescale the total approximate counts based on the sample size. 𝒓 Rescale
  • 21. Roadmap • Hypergraph Motif • Proposed Method: MoCHy • Experimental Results • Conclusions
  • 22. Experimental Settings • 11 real-world hypergraphs from 5 different domains • All versions of MoCHy implemented using C++ and OpenMP. Co-authorship E-mail Contact Tags Threads & for parallelization
  • 23. EXP1. Comparison with Random • Real-world and random hypergraphs have distinct distributions of h- motif instances. real random real random real random real random real random 1 9.6E07 (7) 1.3E09 (4) 3 -0.86 4.8E04 (16) 2.8E07 (5) 11 -1.00 7.5E06 (13) 1.7E08 (7) 6 -0.91 9.0E08 (13) 2.2E11 (6) 7 -0.99 6.4E08 (7) 2.4E11 (4) 3 -0.99 2 7.0E09 (2) 7.2E09 (2) 0 -0.01 1.1E08 (3) 8.6E07 (3) 0 0.12 6.3E08 (2) 8.2E08 (3) 1 -0.13 1.6E12 (2) 1.6E12 (2) 0 0.02 1.1E12 (2) 7.7E11 (2) 0 0.16 3 2.2E06 (17) 6.1E03 (14) 3 0.99 2.8E03 (21) 1.7E05 (16) 5 -0.97 1.6E06 (21) 7.8E05 (17) 4 0.34 3.0E06 (20) 1.1E09 (15) 5 -0.99 1.7E05 (20) 1.7E08 (14) 6 -1.00 4 9.6E06 (11) 1.1E05 (12) 1 0.98 8.4E02 (24) 9.2E05 (12) 12 -1.00 4.3E06 (16) 1.5E07 (12) 4 -0.55 1.5E08 (17) 1.6E10 (12) 5 -0.98 3.1E06 (13) 1.2E09 (11) 2 -0.99 5 1.5E08 (6) 1.2E05 (11) 5 1.00 4.6E06 (5) 1.6E06 (11) 6 0.49 7.5E07 (7) 1.1E07 (13) 6 0.74 7.4E09 (8) 2.5E10 (8) 0 -0.54 4.1E08 (8) 1.7E09 (10) 2 -0.61 6 9.9E08 (3) 1.8E06 (9) 6 1.00 1.3E07 (4) 8.2E06 (7) 3 0.24 3.9E08 (4) 1.9E08 (6) 2 0.34 6.8E11 (3) 3.3E11 (4) 1 0.35 1.4E10 (4) 1.1E10 (8) 4 0.11 7 1.9E05 (23) 0.0E00 (20) 3 1.00 1.6E04 (17) 2.0E02 (24) 7 0.98 7.5E04 (24) 1.2E02 (25) 1 1.00 8.3E05 (25) 9.1E05 (25) 0 -0.05 8.8E03 (24) 1.7E04 (24) 0 -0.32 8 3.9E05 (22) 0.0E00 (20) 2 1.00 4.6E03 (20) 2.6E03 (22) 2 0.27 4.2E06 (17) 2.5E04 (21) 4 0.99 2.0E06 (23) 3.4E07 (22) 1 -0.89 2.2E04 (23) 3.5E05 (21) 2 -0.88 9 2.4E06 (16) 0.0E00 (20) 4 1.00 1.7E05 (12) 4.6E03 (20) 8 0.95 1.8E06 (20) 1.1E04 (22) 2 0.99 1.4E08 (18) 5.4E07 (21) 3 0.45 5.1E05 (17) 4.5E05 (20) 3 0.06 10 7.6E06 (13) 7.5E00 (18) 5 1.00 5.7E04 (15) 5.5E04 (17) 2 0.03 2.8E07 (10) 1.7E06 (14) 4 0.88 7.1E08 (14) 1.9E09 (14) 0 -0.45 2.3E06 (15) 9.4E06 (17) 2 -0.61 11 8.6E06 (12) 0.9E00 (19) 7 1.00 4.1E05 (11) 2.4E04 (18) 7 0.89 9.0E06 (11) 1.9E05 (19) 8 0.96 3.5E09 (10) 7.4E08 (16) 6 0.65 2.8E06 (14) 3.1E06 (18) 4 -0.05 12 6.4E07 (8) 1.9E02 (16) 8 1.00 1.7E05 (13) 2.7E05 (14) 1 -0.24 8.2E07 (6) 2.4E07 (10) 4 0.55 6.9E10 (6) 2.4E10 (10) 4 0.49 8.2E07 (10) 6.2E07 (15) 5 0.14 13 1.6E04 (26) 0.0E00 (20) 6 1.00 5.5E03 (19) 1.6E00 (26) 7 1.00 2.7E04 (26) 0.4E00 (26) 0 1.00 1.1E06 (24) 1.7E04 (26) 2 0.97 1.5E02 (26) 8.6E00 (26) 0 0.89 14 1.4E05 (24) 0.0E00 (20) 4 1.00 6.0E03 (18) 7.1E01 (25) 7 0.98 7.2E05 (22) 3.7E02 (24) 2 1.00 2.8E07 (19) 1.8E06 (24) 5 0.88 3.9E03 (25) 9.3E02 (25) 0 0.61 15 6.5E05 (19) 0.0E00 (20) 1 1.00 1.7E03 (22) 8.6E02 (23) 1 0.34 3.6E06 (19) 5.0E04 (20) 1 0.97 2.9E08 (15) 5.7E07 (20) 5 0.67 2.7E04 (22) 2.0E04 (23) 1 0.16 16 2.0E06 (18) 0.0E00 (20) 2 1.00 1.4E02 (25) 3.2E03 (21) 4 -0.92 6.7E06 (14) 1.7E06 (15) 1 0.60 1.9E09 (11) 5.8E08 (18) 7 0.53 2.4E05 (18) 1.3E05 (22) 4 0.29 17 4.2E05 (21) 2.0E06 (8) 13 -0.65 1.0E03 (23) 6.3E05 (13) 10 -1.00 3.8E04 (25) 8.7E05 (16) 9 -0.92 5.1E05 (26) 5.0E08 (19) 7 -1.00 2.3E05 (19) 9.2E08 (12) 7 -1.00 18 2.6E06 (15) 6.4E07 (7) 8 -0.92 1.2E02 (26) 7.0E06 (8) 18 -1.00 6.0E06 (15) 4.0E07 (8) 7 -0.74 2.5E06 (22) 1.6E10 (13) 9 -1.00 8.3E05 (16) 1.3E10 (7) 9 -1.00 19 3.6E07 (9) 6.7E07 (6) 3 -0.30 2.0E06 (6) 1.2E07 (6) 0 -0.72 8.7E06 (12) 2.9E07 (9) 3 -0.54 9.4E08 (12) 2.4E10 (9) 3 -0.93 3.5E08 (9) 1.8E10 (6) 3 -0.96 20 3.4E08 (5) 2.2E09 (3) 2 -0.73 6.0E05 (10) 1.3E08 (2) 8 -0.99 2.2E08 (5) 1.2E09 (2) 3 -0.69 9.2E09 (7) 7.2E11 (3) 4 -0.97 1.9E09 (5) 2.4E11 (3) 2 -0.98 21 7.9E08 (4) 5.6E08 (5) 1 0.17 1.7E08 (2) 5.7E07 (4) 2 0.50 5.3E08 (3) 2.3E08 (4) 1 0.39 1.2E11 (5) 2.8E11 (5) 0 -0.40 2.8E10 (3) 8.6E10 (5) 2 -0.51 22 1.7E10 (1) 1.8E10 (1) 0 -0.03 3.1E08 (1) 5.8E08 (1) 0 -0.30 4.9E09 (1) 8.5E09 (1) 0 -0.27 6.6E12 (1) 7.6E12 (1) 0 -0.07 1.1E12 (1) 1.2E12 (1) 0 -0.02 23 2.4E04 (25) 1.5E01 (17) 8 1.00 1.2E05 (14) 5.4E03 (19) 5 0.91 8.8E04 (23) 4.0E03 (23) 0 0.91 2.6E06 (21) 7.9E06 (23) 2 -0.51 1.4E05 (21) 7.8E05 (19) 2 -0.70 24 4.4E05 (20) 1.4E03 (15) 5 0.99 7.7E05 (9) 1.8E05 (15) 6 0.63 4.2E06 (18) 5.4E05 (18) 0 0.77 2.2E08 (16) 7.2E08 (17) 1 -0.53 7.5E06 (12) 3.1E07 (16) 4 -0.61 25 3.8E06 (14) 4.6E04 (13) 1 0.98 1.7E06 (8) 1.8E06 (10) 2 -0.03 3.2E07 (9) 2.0E07 (11) 2 0.23 6.0E09 (9) 2.0E10 (11) 2 -0.54 8.0E07 (11) 4.2E08 (13) 2 -0.68 26 2.3E07 (10) 4.9E05 (10) 0 0.96 1.8E06 (7) 6.14E06 (9) 2 -0.54 7.5E07 (8) 2.1E08 (5) 3 -0.48 1.3E11 (4) 1.8E11 (7) 3 -0.14 1.2E09 (6) 1.9E09 (9) 3 -0.21 h-motif contact-primary email-EU tags-math threads-math count (rank) count (rank) coauth-DBLP RD RC RD RC RD RC count (rank) count (rank) count (rank) RD RC RD RC
  • 24. 0 5 10 15 20 1 5 10 15 20 25 Hypergraph Motif Index AbsoluteCounts (logscaled) 0 5 10 15 20 1 5 10 15 20 25 Hypergraph Motif Index AbsoluteCounts (logscaled) 0 5 10 15 20 1 5 10 15 20 25 Hypergraph Motif Index AbsoluteCounts (logscaled) 0 10 20 30 1 5 10 15 20 25 Hypergraph Motif Index AbsoluteCounts (logscaled) 0 10 20 1 5 10 15 20 25 Hypergraph Motif Index AbsoluteCounts (logscaled) EXP1. Comparison with Random • Real-world and random hypergraphs have distinct distributions of h- motif instances. coauth-DBLP email-Eu contact-primary tags-math threads-math
  • 25. EXP2. Comparison across Domains • The CPs are similar within domains but different across domains. -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif Index Normalized Significance coauth-DBLP coauth-geology coauth-history Co-authorship -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif Index Normalized Significance contact-primary contact-high Contact -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif Index Normalized Significance email-Enron email-EU Email -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif Index Normalized Significance threads-ubuntu threads-math Threads -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif Index Normalized Significance tags-ubuntu tags-math Tags
  • 26. EXP2. Comparison across Domains (cont.) • Characteristic profiles (CPs) based on hypergraph motifs (h-motifs) capture local structural patterns more accurately than CPs based on network motifs. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 coauth-DBLP coauth-geology coauth-history contact-primary contact-high email-Enron email-EU tags-ubuntu tags-math threads-ubuntu threads-math 1.0 0.9 0.8 0.7 CorrelationCoefficient
  • 27. EXP3. Observations and Applications • Trends in the formation of collaborations are captured by h-motifs. (a) The fractions of the instances of h-motifs 2 and 22 have increased rapidly. (b) The fraction of the instances of open h-motifs increased steadily since 2001. 0.3 0.4 0.5 0.6 0.7 1984 1992 2000 2008 2016 Year Proportion Closed h-motifs Open h-motifs 0.00 0.25 0.50 0.75 1.00 1984 1992 2000 2008 2016 Year Ratio h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 0.00 0.25 0.50 0.75 1.00 1984 1992 2000 2008 2016 Year h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 .25 .50 .75 .00 h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 0.00 0.25 0.50 0.75 1.00 1984 1992 2000 2008 2016 Year Ratio h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 .00 .25 .50 .75 .00 1984 1992 2000 2008 2016 Year h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 0.00 0.25 0.50 0.75 1.00 1984 1992 2000 2008 2016 Year Ratio h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 0.00 0.25 0.50 0.75 1.00 1984 1992 2000 2008 2016 Year Ratio h-motif 1 h-motif 2 h-motif 3 h-motif 4 h-motif 5 h-motif 6 h-motif 7 h-motif 8 h-motif 9 h-motif 10 h-motif 11 h-motif 12 h-motif 13 h-motif 14 h-motif 15 h-motif 16 h-motif 23 h-motif 24 h-motif 25 h-motif 26 h-motif 17 h-motif 18 h-motif 19 h-motif 20 h-motif 21 h-motif 22 (a) Fraction of the instances of each h-motif in the coauth-DBLP over time. (b) Open and closed h-motifs.
  • 28. EXP3. Observations and Applications (cont.) • Hyperedge prediction: classify real hyperedges and fake ones. • H-motifs give informative features. HM26 HM7 HC Logistic Regression ACC 0.754 0.656 0.636 AUC 0.813 0.693 0.691 Random Forest ACC 0.768 0.741 0.639 AUC 0.852 0.779 0.692 Decision Tree ACC 0.731 0.684 0.613 AUC 0.732 0.685 0.616 K-Nearest Neighbors ACC 0.694 0.689 0.640 AUC 0.750 0.743 0.684 MLP Classifier ACC 0.795 0.762 0.646 AUC 0.875 0.841 0.701 ACC: accuracy, AUC: area under the ROC curve The number of each h-motif’s instances that contain each hyperedge. HM26 (∈ ℝ&' ) The seven features with the largest variance among those in HM26. HM7 (∈ ℝ( ) The mean, maximum, and minimum degree and the mean, maximum, and minimum number of neighbors of the nodes in each hyperedge and its size. HC (∈ ℝ()
  • 29. EXP4. Performance of Counting Algorithms • MoCHy-A+ is up to 25X more accurate than MoCHy-A. • MoCHy-A+ is up to 32.5X faster than MoCHy-E. 26.3X 5.9X 0.00 0.01 0.02 0.03 0.04 0 2 4 6 Elapsed Time (sec) RelativeError 28.2X 10.2X 0.00 0.01 0.02 0.03 0 20 40 60 Elapsed Time (sec) RelativeError 11.1X 8.4X 0.00 0.02 0.04 0.06 0.08 0 2 4 6 8 10 Elapsed Time (sec) RelativeError 32.5X 24.6X 0.00 0.01 0.02 0.03 0.04 0 1000 2000 Elapsed Time (sec) RelativeError 24.2X 25X 0.00 0.02 0.04 0.06 0 200 400 Elapsed Time (sec) RelativeError 7.6X 7.6X 0.00 0.05 0.10 0.15 0.0 0.1 0.2 Elapsed Time (sec) RelativeError Definition Relative Error: ∑'() *+ * + , -* + ∑'() *+ * + threads-ubuntu contact-high email-Enron coauth-history contact-primary email-Eu
  • 30. EXP4. Performance of Counting Algorithms (cont.) • CPs obtained by MoCHy-A+ are estimated near perfectly even with a smaller number of samples. -0.2 0.0 0.2 1 6 11 16 21 26 Hypergraph Motif Index Normalized Significance -0.2 0.0 0.2 1 6 11 16 21 26 Hypergraph Motif Index Normalized Significance -0.2 0.0 0.2 1 6 11 16 21 26 Hypergraph Motif Index Normalized Significance MoCHy-A+ (! = ∧ ⋅ 0.01) MoCHy-A+ (! = ∧ ⋅ 0.05) MoCHy-A+ (! = ∧ ⋅ 0.001) MoCHy-A+ (! = ∧ ⋅ 0.005) MoCHy-E coauth-history contact-primary email-Eu
  • 31. EXP4. Performance of Counting Algorithms (cont.) • Both MoCHy-E and MoCHy-A+ achieve significant speedups with multiple threads. 0 1000 2000 1 2 3 4 5 6 7 8 Number of Threads ElapsedTime(sec) 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 Number of Threads Speedup MoCHy-A+ (! = 1M) MoCHy-A+ (! = 2M) MoCHy-A+ (! = 4M) MoCHy-A+ (! = 8M) MoCHy-E
  • 32. MoCHy-A+ (! = 1M) MoCHy-A+ (! = 2M) MoCHy-A+ (! = 4M) MoCHy-A+ (! = 8M) 0 500 1000 1500 2000 0 0.1 1 10 100 Memory (% of Edges) ElapsedTime(sec) 1 2 3 4 5 0 0.1 1 10 100 Memory (% of Edges) Speedup EXP4. Performance of Counting Algorithms (cont.) • Memoizing a small fraction of projected graphs leads to significant speedups of MoCHy-A+.
  • 33. Roadmap • Hypergraph Motif • Proposed Method: MoCHy • Experimental Results • Conclusions
  • 34. Conclusions • We propose hypergraph motifs (h-motifs) for describing the connectivity patterns of hypergraphs. Novel Concepts Fast and Provable Algorithms Discoveries in 11 Real-world Hypergraphs -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif Index Normalized Significance coauth-DBLP coauth-geology coauth-history Co-authorship -0.2 0.0 0.2 1 5 10 15 20 25 Hypergraph Motif IndexNormalized Significance tags-ubuntu tags-math Tags
  • 35. Hypergraph Motifs: Concepts, Algorithms, and Discoveries Kijung ShinJihoon KoGeon Lee