Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Summary of MAST
1. Sangjun Son (SNU) 1
Multi-Aspect Streaming
Tensor Completion
(Qingquan Song et al., KDD 2017)
Sangjun Son
Data Mining Lab
Dept. of CSE
Seoul National University
4. Sangjun Son (SNU) 4
Tensor
◼ An N-way array which is a generalization of
vectors and matrices.
5. Sangjun Son (SNU) 5
Real-world Tensor
◼ Recommendation
❑ A 3rd order tensor
Index: (user, movie, time)
❑ Value: movie rating
◼ Link Prediction
❑ A 3rd order tensor
Index: (user, user, interaction)
❑ Value: connectivity
Missing entries of partially observed tensors
6. Sangjun Son (SNU) 6
Tensor Completion (1/2)
◼ Real-world tensors are often incomplete.
❑ Due to missing at random,
limited permissions and maloperations.
◼ Tensor Decomposition (CP)
user
topic
𝒖𝑖
≈
𝑖=1
𝑅
𝒕𝑖
=
𝐔
𝐓
𝐄
𝓣
𝑖=1
𝑅
𝒖𝑖 ∘ 𝒕𝑖 ∘ 𝒆𝑖≈ = 𝐔, 𝐄, 𝐓
7. Sangjun Son (SNU) 7
Tensor Completion (2/2)
user
topic
≈
𝐔
𝐓
𝐄
user
topic
=
8. Sangjun Son (SNU) 8
Tensor Completion (2/2)
user
topic
≈
𝐔
𝐓
𝐄
user
topic
=
9. Sangjun Son (SNU) 9
Tensor Completion (2/2)
user
topic
≈
𝐔
𝐓
𝐄
user
topic
=
10. Sangjun Son (SNU) 10
Tensor Completion (2/2)
user
topic
≈
𝐔
𝐓
𝐄
user
topic
=
◼ Fill missing entries of partially observed tensors.
❑ Recommender systems
❑ Image recovery
❑ Clinical data analysis
11. Sangjun Son (SNU) 11
Streaming Tensor
◼ High velocity streaming tensors in real world.
❑ Due to popularity of online information systems.
❑ Develop in one temporal mode.
user
item
𝓣 𝑡
user
item
user
item
𝓣 𝑡+1 𝓣 𝑡+2
12. Sangjun Son (SNU) 12
Multi-Aspect Streaming Tensor
◼ Existing methods ignore that,
❑ A tensor may develop in multiple dimensions.
user
item
𝓣 𝑡 𝓣 𝑡+1 𝓣 𝑡+2
item
user
item
user
13. Sangjun Son (SNU) 13
Multi-Aspect Streaming Tensor
◼ Existing methods ignore that,
❑ A tensor may develop in multiple dimensions.
user
item
𝓣 𝑡 𝓣 𝑡+1 𝓣 𝑡+2
item
user
item
user
⊆ ⊆
22. Sangjun Son (SNU) 22
Problem Definition
◼ Multi-Aspect Steaming Tensor Completion
❑ Given MAST sequence 𝓧(𝑇) with missing entries,
recover the missing data in current snapshot 𝓧(𝑇).
❑ Input
◼ Previously recovered MAST
𝓧(𝑇−1)
⊆ 𝓧 𝑇
❑ Output
◼ Relative complement of
𝓧(𝑇−1)
in 𝓧 𝑇
, 𝓧(𝑇)
𝓧 𝑇−1
❑ Goal
◼ Maximize the effectiveness
and efficiency
user
item
item
user
𝓧(𝑇−1)
𝓧(𝑇)
34. Sangjun Son (SNU) 34
Low Rank Tensor Completion
◼ Generalized from matrix completion problem.
minimize
𝓧
𝑟𝑎𝑛𝑘(𝓧) subject to 𝛀 ⊛ 𝓧 = 𝓣
❑ 𝓧 is the complete tensor and
𝓣 denotes the practical observations of 𝓧.
𝛀 is a binary tensor indicating whether
each corresponding entry is observed or not.
❑ Rank Calculation is NP-hard!
35. Sangjun Son (SNU) 35
Low Rank Tensor Completion
◼ Relaxation objective function
minimize
𝓧
𝑟𝑎𝑛𝑘(𝓧) subject to 𝛀 ⊛ 𝓧 = 𝓣
minimize
𝓧, 𝐀1,𝐀2,… ,𝐀 𝑁
𝑛=1
𝑁
𝛼 𝑛 𝐀 𝑛 ∗ subject to 𝛀 ⊛ 𝓧 = 𝓣
where 𝛼 𝑛 are trade-offs to balance the significance of each mode
and ∙ ∗ is a nuclear norm of a matrix.
36. Sangjun Son (SNU) 36
MAST Framework
◼ Loss function
❑ 𝓛DTD = 𝜇 ෪𝐀1, ෪𝐀2, … , ෪𝐀 𝑁 − 𝐀1
(0)
, 𝐀2
(0)
, … , 𝐀 𝑁
(0)
F
2
+ 𝓛0
❑ 𝓛LRTC = σ 𝑛=1
𝑁
𝛼 𝑛 𝐀 𝑛 ∗
❑ 𝓛 = 𝓛DTD + 𝓛LRTC
◼ Optimization method
❑ Alternating Direction Method of Multipliers
❑ Recover missing entries with an EM-like approach.
41. Sangjun Son (SNU) 41
Experimental Setup
◼ Datasets
◼ Baselines
❑ Static CP-ALS: CPD with ALS
❑ TNCP: trace norm based CPD using ADMM
❑ OLSTEC: online CPD with recursive least square
42. Sangjun Son (SNU) 42
Evaluation Metric
◼ How effective?
❑ Running Average Area Under Curve, RA-AUC
❑
1
𝑇
σ 𝑡=1
𝑇
AUC 𝑡
◼ How efficient?
❑ Average Running Time
❑
1
𝑇
σ 𝑡=1
𝑇
RT𝑡
Figure from GLASS BOX
Machine Learning and Medicine,
by Rachel Lea Ballantyne Draelos
43. Sangjun Son (SNU) 43
Evaluation of Effectiveness
◼ MAST has commensurate performance w. static CPD
method.
◼ T-MAST outperforms MAST on temporal tensor
completion.
44. Sangjun Son (SNU) 44
Evaluation of Efficiency
◼ MAST outperforms all the others.
◼ Computation of T-MAST is faster than static methods,
but slower than MAST.
46. Sangjun Son (SNU) 46
Conclusion
◼ Define the problem of MAST completion.
❑ Propose a CP-based general algorithm MAST.
❑ Propose a modified model T-MAST for a special case.
◼ Empirically validate the effectiveness
and efficiency on real-world datasets.
❑ MAST, T-MAST outperforms in speed,
maintaining decomposition accuracy.
47. Sangjun Son (SNU) 47
Relevance to My Research
◼ This is the first work on streaming analysis
which solved multi-aspect streaming problem.
◼ I’m also working on streaming tensor.
❑ Online tensor analysis when drastic data incomes.
◼ I will implement this approach.
❑ Get intuitions to improve my model.
❑ Experiment with real world streaming datasets.