SlideShare a Scribd company logo
1 KYOTO UNIVERSITY
KYOTO UNIVERSITY
Fast Unbalanced Optimal Transport on a Tree
Ryoma Sato
Kyoto University / RIKEN AIP
2 / 13 KYOTO UNIVERSITY
Self Introduction

I am a second year master's student at
Kyoto University

I’m interested in algorithmic aspects of machine
learning and data mining for structured data, including
Graph neural networks:

Ryoma Sato, Makoto Yamada, Hisashi Kashima. Approximation Ratios of
Graph Neural Networks for Combinatorial Problems. NeurIPS 2019.

Ryoma Sato, Makoto Yamada, Hisashi Kashima. Random Features
Strengthen Graph Neural Networks. SDM 2021
Optimal transport:

Ryoma Sato, Makoto Yamada, Hisashi Kashima. Fast Unbalanced Optimal
Transport on a Tree. NeurIPS 2020.
 Today’s topic
3 / 13 KYOTO UNIVERSITY
Background: optimal transport is useful

The optimal transport (OT) distance measures
the discrepancy of two distributions.
We consider discrete distributions in this presentation.

The OT distance is the minimum cumulative
distance that all masses need to travel from
one distribution to another distribution

In generative modeling, a mass is a sample.
discrepancy of model  sample distribution

In text classification, a mass is a word.
 OT does not require the same support  KL divergence
 OT exploits the underground geometry
From Word Embeddings To
Document Distances, ICML 2015
4 / 13 KYOTO UNIVERSITY
Background: sliced OT is computationally cheap

OT is formulated as a linear program  cubic cost

Sliced OT projects distributions to random
1D spaces and computes OT there

Greedy matching solves 1D OT exactly  linear cost
: distance matrix (input), : matching matrix (variable)
: 1st mass vector (input), : 2nd mass vector (input)
The leftmost mass should be matched to
the leftmost mass
The second leftmost mass should be matched
to the second leftmost mass ...
https://www.programmersought.com/article/67174999352/
https://analyticsindiamag.com/how-to-establish-domain-transferability-in-neural-models/
5 / 13 KYOTO UNIVERSITY
Background: unbalanced OT is robust

OT is sensitive to outliers because transporting outliers
becomes the dominating term

Unbalanced OT (UOT) allows to discard and create
masses by paying some penalties
 We can discard outliers  robust to outliers

UOT is also formulated by a linear program
 cubic cost
6 / 13 KYOTO UNIVERSITY
Background: UOT is difficult even in 1D spaces

We want to make a cheap alternative of UOT as 1D OT

But the greedy matching fails to solve 1D UOT

Let’s consider the following instance with discard cost λ

The following plan costs 3λ.

The following plan costs 2λ + 2ε. Thus this is better.
λ λ λ
7 / 13 KYOTO UNIVERSITY
Background: UOT is difficult even in 1D spaces

Let’s consider the following instance with discard cost λ

The following plan costs λ + 2ε.

The following plan costs 2λ + 2ε. Thus this is worse.
λ ε
8 / 13 KYOTO UNIVERSITY
Background: UOT is difficult even in 1D spaces

Although these two instances share the leftmost part,
the leftmost mass in the first instance should be
discarded while that in the second instance should not

The optimal UOT plan cannot be determined locally
 The optimal OT plan is determined locally

Thus the greedy algorithm fails to solve 1D UOT

We proposed how to solve 1D UOT efficiently
λ λ λ
λ ε
9 / 13 KYOTO UNIVERSITY
Algorithm: prune redundant plans

Our proposed method determines assignments from
left to right (as the greedy algorithm)

Although there are exponentially many plans, most of
them are redundant.
We proved that only O(n) plans are non-redundant
 Only one plan is non-redundant (thus greedy is valid) in the standard OT
not yet
not yet
 non redundant
 redundant
10 / 13 KYOTO UNIVERSITY
Algorithm: we solve 1D UOT in O(n log2
n) time

A naive algorithm requires cubic time even with this
(non redundant plan) observation

More algorithmic techniques are required for further
speedup (skipped in this presentation)

Dynamic programming

Fast convex min-sum convolution

Efficient data structure (BBST)

Weighted union heuristics

Finally, we derived a quasi-linear time algorithm
which runs in O(n log2
n) time in the worst case
11 / 13 KYOTO UNIVERSITY
Algorithm: tree UOT generalizes 1D UOT

Our method can be extended to tree spaces
A 1D space (path) is a special case of tree spaces

In text classification, the word
space can be represented by a
word tree. Each mass (word)
travels on the word tree to a
nearby (semantically similar) word.

We can “tree-slice” high dimensional
spaces instead of 1D-slicing,
which captures richer structures
http://www.sthda.com/english/articles/31-principal-component-methods-in-r-practical-guide/117-hcpc-hierarchical-clustering-on-principal-components-essentials/
a → b
12 / 13 KYOTO UNIVERSITY
Experiments: our algorithm is empirically fast

We confirmed that our algorithm could compute tree
UOT with one million masses within one second

We also confirmed that tree-slicing high dimensional
spaces could approximate the original UOT problem
13 / 13 KYOTO UNIVERSITY
Conclusion: fast computation of tree UOT

Sliced OT is a fast alternative of OT

UOT is a robust variant of OT

1D UOT is more difficult than 1D OT

We proposed an efficient algorithm for 1D UOT for the
first time

Our method can be extended to tree spaces

Our method is empirically fast (1M masses in 1 sec)

More Related Content

What's hot

Sliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルSliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデル
ohken
 
4 データ間の距離と類似度
4 データ間の距離と類似度4 データ間の距離と類似度
4 データ間の距離と類似度
Seiichi Uchida
 
EMアルゴリズム
EMアルゴリズムEMアルゴリズム
[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...
[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...
[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...
Deep Learning JP
 
大規模凸最適化問題に対する勾配法
大規模凸最適化問題に対する勾配法大規模凸最適化問題に対する勾配法
大規模凸最適化問題に対する勾配法
京都大学大学院情報学研究科数理工学専攻
 
Curriculum Learning (関東CV勉強会)
Curriculum Learning (関東CV勉強会)Curriculum Learning (関東CV勉強会)
Curriculum Learning (関東CV勉強会)
Yoshitaka Ushiku
 
【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features
【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features
【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features
ARISE analytics
 
【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?
【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?
【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?
Deep Learning JP
 
[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用
[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用
[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用
Deep Learning JP
 
Transformer メタサーベイ
Transformer メタサーベイTransformer メタサーベイ
Transformer メタサーベイ
cvpaper. challenge
 
[DL輪読会]data2vec: A General Framework for Self-supervised Learning in Speech,...
[DL輪読会]data2vec: A General Framework for  Self-supervised Learning in Speech,...[DL輪読会]data2vec: A General Framework for  Self-supervised Learning in Speech,...
[DL輪読会]data2vec: A General Framework for Self-supervised Learning in Speech,...
Deep Learning JP
 
PRML輪読#14
PRML輪読#14PRML輪読#14
PRML輪読#14
matsuolab
 
【DL輪読会】Dropout Reduces Underfitting
【DL輪読会】Dropout Reduces Underfitting【DL輪読会】Dropout Reduces Underfitting
【DL輪読会】Dropout Reduces Underfitting
Deep Learning JP
 
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII
 
Wasserstein GANを熟読する
Wasserstein GANを熟読するWasserstein GANを熟読する
Wasserstein GANを熟読する
ssusera4bf2d
 
[DL輪読会]Estimating Predictive Uncertainty via Prior Networks
[DL輪読会]Estimating Predictive Uncertainty via Prior Networks[DL輪読会]Estimating Predictive Uncertainty via Prior Networks
[DL輪読会]Estimating Predictive Uncertainty via Prior Networks
Deep Learning JP
 
最適輸送入門
最適輸送入門最適輸送入門
最適輸送入門
joisino
 
PRML学習者から入る深層生成モデル入門
PRML学習者から入る深層生成モデル入門PRML学習者から入る深層生成モデル入門
PRML学習者から入る深層生成モデル入門
tmtm otm
 
自動微分変分ベイズ法の紹介
自動微分変分ベイズ法の紹介自動微分変分ベイズ法の紹介
自動微分変分ベイズ法の紹介
Taku Yoshioka
 

What's hot (20)

Sliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルSliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデル
 
4 データ間の距離と類似度
4 データ間の距離と類似度4 データ間の距離と類似度
4 データ間の距離と類似度
 
EMアルゴリズム
EMアルゴリズムEMアルゴリズム
EMアルゴリズム
 
[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...
[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...
[DL輪読会]GENESIS: Generative Scene Inference and Sampling with Object-Centric L...
 
大規模凸最適化問題に対する勾配法
大規模凸最適化問題に対する勾配法大規模凸最適化問題に対する勾配法
大規模凸最適化問題に対する勾配法
 
Curriculum Learning (関東CV勉強会)
Curriculum Learning (関東CV勉強会)Curriculum Learning (関東CV勉強会)
Curriculum Learning (関東CV勉強会)
 
【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features
【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features
【論文読み会】Deep Clustering for Unsupervised Learning of Visual Features
 
【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?
【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?
【DL輪読会】Is Conditional Generative Modeling All You Need For Decision-Making?
 
[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用
[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用
[DL輪読会]マテリアルズインフォマティクスにおける深層学習の応用
 
Transformer メタサーベイ
Transformer メタサーベイTransformer メタサーベイ
Transformer メタサーベイ
 
PRML_2.3.1~2.3.3
PRML_2.3.1~2.3.3PRML_2.3.1~2.3.3
PRML_2.3.1~2.3.3
 
[DL輪読会]data2vec: A General Framework for Self-supervised Learning in Speech,...
[DL輪読会]data2vec: A General Framework for  Self-supervised Learning in Speech,...[DL輪読会]data2vec: A General Framework for  Self-supervised Learning in Speech,...
[DL輪読会]data2vec: A General Framework for Self-supervised Learning in Speech,...
 
PRML輪読#14
PRML輪読#14PRML輪読#14
PRML輪読#14
 
【DL輪読会】Dropout Reduces Underfitting
【DL輪読会】Dropout Reduces Underfitting【DL輪読会】Dropout Reduces Underfitting
【DL輪読会】Dropout Reduces Underfitting
 
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
 
Wasserstein GANを熟読する
Wasserstein GANを熟読するWasserstein GANを熟読する
Wasserstein GANを熟読する
 
[DL輪読会]Estimating Predictive Uncertainty via Prior Networks
[DL輪読会]Estimating Predictive Uncertainty via Prior Networks[DL輪読会]Estimating Predictive Uncertainty via Prior Networks
[DL輪読会]Estimating Predictive Uncertainty via Prior Networks
 
最適輸送入門
最適輸送入門最適輸送入門
最適輸送入門
 
PRML学習者から入る深層生成モデル入門
PRML学習者から入る深層生成モデル入門PRML学習者から入る深層生成モデル入門
PRML学習者から入る深層生成モデル入門
 
自動微分変分ベイズ法の紹介
自動微分変分ベイズ法の紹介自動微分変分ベイズ法の紹介
自動微分変分ベイズ法の紹介
 

Similar to Fast Unbalanced Optimal Transport on a Tree

IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...
IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...
IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...
IRJET Journal
 
Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
Huge-Scale Molecular Dynamics Simulation of Multi-bubble NucleiHuge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
Hiroshi Watanabe
 
NON - EUCLIDEAN METRIC AND PATH PLANNING
NON - EUCLIDEAN METRIC AND PATH PLANNINGNON - EUCLIDEAN METRIC AND PATH PLANNING
NON - EUCLIDEAN METRIC AND PATH PLANNING
IJCSES Journal
 
Nano mos25
Nano mos25Nano mos25
Citython presentation
Citython presentationCitython presentation
Citython presentation
Ankit Tewari
 
Aggregation computation over distributed data streams(the final version)
Aggregation computation over distributed data streams(the final version)Aggregation computation over distributed data streams(the final version)
Aggregation computation over distributed data streams(the final version)
Yueshen Xu
 
Algorithms And Optimization Techniques For Solving TSP
Algorithms And Optimization Techniques For Solving TSPAlgorithms And Optimization Techniques For Solving TSP
Algorithms And Optimization Techniques For Solving TSP
Carrie Romero
 
Optimised Calculation of Symmetries for State Space Reduction
Optimised Calculation of Symmetries for State Space ReductionOptimised Calculation of Symmetries for State Space Reduction
Optimised Calculation of Symmetries for State Space Reduction
Universität Rostock
 
18 786
18 78618 786
18 786
hoailinhtinh
 
Comparing reinforcement learning and access points with rowel
Comparing reinforcement learning and access points with rowelComparing reinforcement learning and access points with rowel
Comparing reinforcement learning and access points with rowel
ijcseit
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
ijcseit
 
som
somsom
High-Speed Neural Network Controller for Autonomous Robot Navigation using FPGA
High-Speed Neural Network Controller for Autonomous Robot Navigation using FPGAHigh-Speed Neural Network Controller for Autonomous Robot Navigation using FPGA
High-Speed Neural Network Controller for Autonomous Robot Navigation using FPGA
iosrjce
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016
ijcsbi
 
H011114758
H011114758H011114758
H011114758
IOSR Journals
 
Tuple-Based Coordination in Large-Scale Situated Systems
Tuple-Based Coordination in Large-Scale Situated SystemsTuple-Based Coordination in Large-Scale Situated Systems
Tuple-Based Coordination in Large-Scale Situated Systems
Roberto Casadei
 
THE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELS
THE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELSTHE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELS
THE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELS
IJMIT JOURNAL
 
EFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSO
EFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSOEFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSO
EFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSO
cscpconf
 
Effects of The Different Migration Periods on Parallel Multi-Swarm PSO
Effects of The Different Migration Periods on Parallel Multi-Swarm PSO Effects of The Different Migration Periods on Parallel Multi-Swarm PSO
Effects of The Different Migration Periods on Parallel Multi-Swarm PSO
csandit
 
Using Genetic Algorithm for Shortest Path Selection with Real Time Traffic Flow
Using Genetic Algorithm for Shortest Path Selection with Real Time Traffic FlowUsing Genetic Algorithm for Shortest Path Selection with Real Time Traffic Flow
Using Genetic Algorithm for Shortest Path Selection with Real Time Traffic Flow
IJCSIS Research Publications
 

Similar to Fast Unbalanced Optimal Transport on a Tree (20)

IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...
IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...
IRJET- A Particle Swarm Optimization Algorithm for Total Cost Minimization in...
 
Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
Huge-Scale Molecular Dynamics Simulation of Multi-bubble NucleiHuge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
 
NON - EUCLIDEAN METRIC AND PATH PLANNING
NON - EUCLIDEAN METRIC AND PATH PLANNINGNON - EUCLIDEAN METRIC AND PATH PLANNING
NON - EUCLIDEAN METRIC AND PATH PLANNING
 
Nano mos25
Nano mos25Nano mos25
Nano mos25
 
Citython presentation
Citython presentationCitython presentation
Citython presentation
 
Aggregation computation over distributed data streams(the final version)
Aggregation computation over distributed data streams(the final version)Aggregation computation over distributed data streams(the final version)
Aggregation computation over distributed data streams(the final version)
 
Algorithms And Optimization Techniques For Solving TSP
Algorithms And Optimization Techniques For Solving TSPAlgorithms And Optimization Techniques For Solving TSP
Algorithms And Optimization Techniques For Solving TSP
 
Optimised Calculation of Symmetries for State Space Reduction
Optimised Calculation of Symmetries for State Space ReductionOptimised Calculation of Symmetries for State Space Reduction
Optimised Calculation of Symmetries for State Space Reduction
 
18 786
18 78618 786
18 786
 
Comparing reinforcement learning and access points with rowel
Comparing reinforcement learning and access points with rowelComparing reinforcement learning and access points with rowel
Comparing reinforcement learning and access points with rowel
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
som
somsom
som
 
High-Speed Neural Network Controller for Autonomous Robot Navigation using FPGA
High-Speed Neural Network Controller for Autonomous Robot Navigation using FPGAHigh-Speed Neural Network Controller for Autonomous Robot Navigation using FPGA
High-Speed Neural Network Controller for Autonomous Robot Navigation using FPGA
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016
 
H011114758
H011114758H011114758
H011114758
 
Tuple-Based Coordination in Large-Scale Situated Systems
Tuple-Based Coordination in Large-Scale Situated SystemsTuple-Based Coordination in Large-Scale Situated Systems
Tuple-Based Coordination in Large-Scale Situated Systems
 
THE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELS
THE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELSTHE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELS
THE IMPACT OF MOBILE NODES ARRIVAL PATTERNS IN MANETS USING POISSON MODELS
 
EFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSO
EFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSOEFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSO
EFFECTS OF THE DIFFERENT MIGRATION PERIODS ON PARALLEL MULTI-SWARM PSO
 
Effects of The Different Migration Periods on Parallel Multi-Swarm PSO
Effects of The Different Migration Periods on Parallel Multi-Swarm PSO Effects of The Different Migration Periods on Parallel Multi-Swarm PSO
Effects of The Different Migration Periods on Parallel Multi-Swarm PSO
 
Using Genetic Algorithm for Shortest Path Selection with Real Time Traffic Flow
Using Genetic Algorithm for Shortest Path Selection with Real Time Traffic FlowUsing Genetic Algorithm for Shortest Path Selection with Real Time Traffic Flow
Using Genetic Algorithm for Shortest Path Selection with Real Time Traffic Flow
 

More from joisino

キャッシュオブリビアスアルゴリズム
キャッシュオブリビアスアルゴリズムキャッシュオブリビアスアルゴリズム
キャッシュオブリビアスアルゴリズム
joisino
 
Metric Recovery from Unweighted k-NN Graphs
Metric Recovery from Unweighted k-NN GraphsMetric Recovery from Unweighted k-NN Graphs
Metric Recovery from Unweighted k-NN Graphs
joisino
 
Towards Principled User-side Recommender Systems
Towards Principled User-side Recommender SystemsTowards Principled User-side Recommender Systems
Towards Principled User-side Recommender Systems
joisino
 
CLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search SystemCLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search System
joisino
 
Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...
Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...
Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...
joisino
 
An Introduction to Spectral Graph Theory
An Introduction to Spectral Graph TheoryAn Introduction to Spectral Graph Theory
An Introduction to Spectral Graph Theory
joisino
 
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...
joisino
 
ユーザーサイド情報検索システム
ユーザーサイド情報検索システムユーザーサイド情報検索システム
ユーザーサイド情報検索システム
joisino
 
最適輸送の解き方
最適輸送の解き方最適輸送の解き方
最適輸送の解き方
joisino
 
Random Features Strengthen Graph Neural Networks
Random Features Strengthen Graph Neural NetworksRandom Features Strengthen Graph Neural Networks
Random Features Strengthen Graph Neural Networks
joisino
 
グラフニューラルネットワークとグラフ組合せ問題
グラフニューラルネットワークとグラフ組合せ問題グラフニューラルネットワークとグラフ組合せ問題
グラフニューラルネットワークとグラフ組合せ問題
joisino
 
死にたくない
死にたくない死にたくない
死にたくない
joisino
 

More from joisino (12)

キャッシュオブリビアスアルゴリズム
キャッシュオブリビアスアルゴリズムキャッシュオブリビアスアルゴリズム
キャッシュオブリビアスアルゴリズム
 
Metric Recovery from Unweighted k-NN Graphs
Metric Recovery from Unweighted k-NN GraphsMetric Recovery from Unweighted k-NN Graphs
Metric Recovery from Unweighted k-NN Graphs
 
Towards Principled User-side Recommender Systems
Towards Principled User-side Recommender SystemsTowards Principled User-side Recommender Systems
Towards Principled User-side Recommender Systems
 
CLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search SystemCLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search System
 
Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...
Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...
Private Recommender Systems: How Can Users Build Their Own Fair Recommender S...
 
An Introduction to Spectral Graph Theory
An Introduction to Spectral Graph TheoryAn Introduction to Spectral Graph Theory
An Introduction to Spectral Graph Theory
 
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem...
 
ユーザーサイド情報検索システム
ユーザーサイド情報検索システムユーザーサイド情報検索システム
ユーザーサイド情報検索システム
 
最適輸送の解き方
最適輸送の解き方最適輸送の解き方
最適輸送の解き方
 
Random Features Strengthen Graph Neural Networks
Random Features Strengthen Graph Neural NetworksRandom Features Strengthen Graph Neural Networks
Random Features Strengthen Graph Neural Networks
 
グラフニューラルネットワークとグラフ組合せ問題
グラフニューラルネットワークとグラフ組合せ問題グラフニューラルネットワークとグラフ組合せ問題
グラフニューラルネットワークとグラフ組合せ問題
 
死にたくない
死にたくない死にたくない
死にたくない
 

Recently uploaded

Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 

Recently uploaded (20)

Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 

Fast Unbalanced Optimal Transport on a Tree

  • 1. 1 KYOTO UNIVERSITY KYOTO UNIVERSITY Fast Unbalanced Optimal Transport on a Tree Ryoma Sato Kyoto University / RIKEN AIP
  • 2. 2 / 13 KYOTO UNIVERSITY Self Introduction  I am a second year master's student at Kyoto University  I’m interested in algorithmic aspects of machine learning and data mining for structured data, including Graph neural networks:  Ryoma Sato, Makoto Yamada, Hisashi Kashima. Approximation Ratios of Graph Neural Networks for Combinatorial Problems. NeurIPS 2019.  Ryoma Sato, Makoto Yamada, Hisashi Kashima. Random Features Strengthen Graph Neural Networks. SDM 2021 Optimal transport:  Ryoma Sato, Makoto Yamada, Hisashi Kashima. Fast Unbalanced Optimal Transport on a Tree. NeurIPS 2020.  Today’s topic
  • 3. 3 / 13 KYOTO UNIVERSITY Background: optimal transport is useful  The optimal transport (OT) distance measures the discrepancy of two distributions. We consider discrete distributions in this presentation.  The OT distance is the minimum cumulative distance that all masses need to travel from one distribution to another distribution  In generative modeling, a mass is a sample. discrepancy of model  sample distribution  In text classification, a mass is a word.  OT does not require the same support  KL divergence  OT exploits the underground geometry From Word Embeddings To Document Distances, ICML 2015
  • 4. 4 / 13 KYOTO UNIVERSITY Background: sliced OT is computationally cheap  OT is formulated as a linear program  cubic cost  Sliced OT projects distributions to random 1D spaces and computes OT there  Greedy matching solves 1D OT exactly  linear cost : distance matrix (input), : matching matrix (variable) : 1st mass vector (input), : 2nd mass vector (input) The leftmost mass should be matched to the leftmost mass The second leftmost mass should be matched to the second leftmost mass ... https://www.programmersought.com/article/67174999352/ https://analyticsindiamag.com/how-to-establish-domain-transferability-in-neural-models/
  • 5. 5 / 13 KYOTO UNIVERSITY Background: unbalanced OT is robust  OT is sensitive to outliers because transporting outliers becomes the dominating term  Unbalanced OT (UOT) allows to discard and create masses by paying some penalties  We can discard outliers  robust to outliers  UOT is also formulated by a linear program  cubic cost
  • 6. 6 / 13 KYOTO UNIVERSITY Background: UOT is difficult even in 1D spaces  We want to make a cheap alternative of UOT as 1D OT  But the greedy matching fails to solve 1D UOT  Let’s consider the following instance with discard cost λ  The following plan costs 3λ.  The following plan costs 2λ + 2ε. Thus this is better. λ λ λ
  • 7. 7 / 13 KYOTO UNIVERSITY Background: UOT is difficult even in 1D spaces  Let’s consider the following instance with discard cost λ  The following plan costs λ + 2ε.  The following plan costs 2λ + 2ε. Thus this is worse. λ ε
  • 8. 8 / 13 KYOTO UNIVERSITY Background: UOT is difficult even in 1D spaces  Although these two instances share the leftmost part, the leftmost mass in the first instance should be discarded while that in the second instance should not  The optimal UOT plan cannot be determined locally  The optimal OT plan is determined locally  Thus the greedy algorithm fails to solve 1D UOT  We proposed how to solve 1D UOT efficiently λ λ λ λ ε
  • 9. 9 / 13 KYOTO UNIVERSITY Algorithm: prune redundant plans  Our proposed method determines assignments from left to right (as the greedy algorithm)  Although there are exponentially many plans, most of them are redundant. We proved that only O(n) plans are non-redundant  Only one plan is non-redundant (thus greedy is valid) in the standard OT not yet not yet  non redundant  redundant
  • 10. 10 / 13 KYOTO UNIVERSITY Algorithm: we solve 1D UOT in O(n log2 n) time  A naive algorithm requires cubic time even with this (non redundant plan) observation  More algorithmic techniques are required for further speedup (skipped in this presentation)  Dynamic programming  Fast convex min-sum convolution  Efficient data structure (BBST)  Weighted union heuristics  Finally, we derived a quasi-linear time algorithm which runs in O(n log2 n) time in the worst case
  • 11. 11 / 13 KYOTO UNIVERSITY Algorithm: tree UOT generalizes 1D UOT  Our method can be extended to tree spaces A 1D space (path) is a special case of tree spaces  In text classification, the word space can be represented by a word tree. Each mass (word) travels on the word tree to a nearby (semantically similar) word.  We can “tree-slice” high dimensional spaces instead of 1D-slicing, which captures richer structures http://www.sthda.com/english/articles/31-principal-component-methods-in-r-practical-guide/117-hcpc-hierarchical-clustering-on-principal-components-essentials/ a → b
  • 12. 12 / 13 KYOTO UNIVERSITY Experiments: our algorithm is empirically fast  We confirmed that our algorithm could compute tree UOT with one million masses within one second  We also confirmed that tree-slicing high dimensional spaces could approximate the original UOT problem
  • 13. 13 / 13 KYOTO UNIVERSITY Conclusion: fast computation of tree UOT  Sliced OT is a fast alternative of OT  UOT is a robust variant of OT  1D UOT is more difficult than 1D OT  We proposed an efficient algorithm for 1D UOT for the first time  Our method can be extended to tree spaces  Our method is empirically fast (1M masses in 1 sec)