SlideShare a Scribd company logo
Riemannian gossip algorithms for
decentralized matrix completion
Hiroyuki Kasai†
, Bamdev Mishra‡
, and Atul Saroop‡
†The University of Electro-Communications, Japan
‡Amazon Development Centre India, India
IEICE meeting 2016
Motivation
The matrix completion problem
? ? * ?
* * ? *
? * * ?
* ? * ?
≈
Low-rank prior
m Movies
n Users
m
nr
(n + m − r)r, r ≪ (m, n)
WT
U
X?
U and W factor matrices.
[Netflix Challenge, 2006]
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 2 / 25
Motivation
Our interest is to look at the decentralized scenario
? ? * ?
* * ? *
? * * ?
* ? * ?
m Movies
n1 Users n2 Users
X?
1 X?
2
U[WT
1 WT
2 ]≈
An agent i has access to its own data matrix Xi .
The matrix U is common across all the agents.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 3 / 25
Motivation
Contributions
We develop a nonlinear gossip algorithm with minimal
communication between agents.
The optimization formulation is based on a weighted combination of
matrix completion and consensus terms.
We develop a parallel variant of the proposed gossip algorithm.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 4 / 25
Motivation
Paper and codes available online
at
www.bamdevmishra.com.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 5 / 25
Motivation
Outline
Problem formulation on the Riemannian Grassmann manifold.
Proposed gossip algorithms.
Numerical comparisons on synthetic and Netflix data.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 6 / 25
Problem formulation
Outline
Problem formulation on the Riemannian Grassmann manifold.
Proposed gossip algorithms.
Numerical comparisons on synthetic and Netflix data.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 7 / 25
Problem formulation
Batch problem formulation
min
U∈St(r,m)
min
W∈Rn×r
PΩ(UWT
) − PΩ(X ) 2
F .
W ∈ Rn×r
and
U ∈ St(r, m), the set of m × r matrices with orthonormal columns.
PΩ is the sampling operator, a convenient way to denote known
entries.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 8 / 25
Problem formulation
Eliminate W
minU∈St(r,m) minW∈Rn×r PΩ(UWT
) − PΩ(X ) 2
F
≡
minU∈St(r,m) f (U, WU), a Grassmann optimization problem.
Solve blue problem in closed form to obtain WU.
Final optimization problem is on Grassmann manifold, i.e.,
variable is ‘column space’ of U.
[Boumal and Absil, LAA, 2015]
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 9 / 25
Problem formulation
Decentralized problem formulation
X = [X1, X2, . . . , XN].
i
min
U∈St(r,m),Wi ∈Rni ×r
1
2
PΩi
(UWi
T
) − PΩi
(Xi ) 2
F
= min
U∈St(r,m)
1
2 i
PΩi
(UWT
iU) − PΩi
(Xi ) 2
F ,
where WiU is computed by agent i independently.
Although the problem is distributed, we still need to learn a common
U.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 10 / 25
Problem formulation
We add a consensus term to our optimization
formulation
Key idea: introduce multiple copies of U among N agents, but allow
them to reach consensus.
min
U1,...,UN ∈St(r,m)
1
2 i
PΩi
(Ui WT
iUi
) − PΩi
(Xi ) 2
F
completion task handled by agent i
+
ρ
2
(d(U1, U2)2
+ d(U2, U3)2
+ . . . + d(UN−1, UN)2
)
consensus among agents
.
d is the Riemannian distance on the Grassmann manifold.
A large ρ trades-off completion with consensus.
Minimizing only consensus ⇒ U1 = U2 = . . . = UN−1 = UN.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 11 / 25
Proposed Riemannian gossip algorithms
Outline
Problem formulation on the Riemannian Grassmann manifold.
Proposed gossip algorithms.
Numerical comparisons on synthetic and Netflix data.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 12 / 25
Proposed Riemannian gossip algorithms
Riemannian online gossip on Grassmann
1 Agents i and i + 1 are neighbors for all i N − 1. (ordering of
agents)
2 At each time slot, say t, we pick an agent i N − 1 randomly
with uniform probability. (SGD updates)
Equivalently, we also pick agent i + 1 (the neighbor of agent i).
Agents i and i + 1 update Ui and Ui+1, respectively, by taking
a gradient descent step with stepsize γt on Grassmann manifold.
γ2
t < ∞ and γt = +∞.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 13 / 25
Proposed Riemannian gossip algorithms
A graphical illustration
Agent 1 Agent 2 Agent 3 Agent N-1 Agent N
Universal clock
Each pair is chosen
. . .
with probability 1=(N − 1)
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 14 / 25
Proposed Riemannian gossip algorithms
Convergence of Riemannian online gossip
Asymptotic convergence follows standard SGD analysis on manifold.
The proposed algorithm is readily implementable, e.g., with the
toolbox Manopt.
[Bonnabel, IEEE TAC, 2013; Absil, Mahoney, and Sepulchre,
Princeton Press, 2008; Boumal et al., JMLR, 2014]
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 15 / 25
Proposed Riemannian gossip algorithms
Parallelizing Riemannian gossip with particular
sampling
Agent 1 Agent 2 Agent 3 Agent 4 Agent 5
Universal clock
Solid" pairs chosen at same time.
. . .
Dotted' pairs chosen at same time.
“Dotted’ and “solid” groups are chosen with probability 1/2.
Convergence guarantees remain the same.
(N − 1)/2 times faster.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 16 / 25
Numerical comparisons
Outline
Problem formulation on the Riemannian Grassmann manifold.
Proposed gossip algorithms.
Numerical comparisons on synthetic and Netflix data.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 17 / 25
Numerical comparisons
Effect of ρ
10 000×100 000 matrix with N = 6.
0 10 20 30 40
Every 10th update
10-10
10-8
10
-6
10-4
10
-2
100
Meansquareerrorontrainingset
Agent 1 rho=103
Agent 2, rho=103
Distance 1-2, rho=103
Agent 1, rho=10
10
Agent 2, rho=1010
Distance 1-2, rho=1010
A very large ρ minimizes only achieves consensus among agents.
A tuned ρ achieves both completion and consensus.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 18 / 25
Numerical comparisons
Performance of online and parallel variants
10 000×100 000 matrix with N = 6 and ρ = 103
.
0 10 20 30 40
Every 10th update
10-10
10-8
10
-6
10-4
10
-2
100
Meansquareerrorontrainingset
Onl. agent 1
Onl. agent 2
Onl. distance 1 - 2
Para. agent 1
Para. agent 2
Para. distance 1 - 2
There is no loss of performance in parallelizing the updates.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 19 / 25
Numerical comparisons
Comparison with D-LMaFit
500 × 12000 and N = 6.
0 10 20 30 40
Every 10th update
10-12
10-10
10-8
10-6
10
-4
10-2
100
102Meansquareerrorontrainingset
Online agent 1
Online agent 2
Online distance 1 - 2
D-LMaFit agent 1
D-LMaFit agent 2
D-LMaFit distance 1 - 2
D-LMaFit code is not scalable to large data.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 20 / 25
Numerical comparisons
Netflix data: different number of agents
Rank = 10
N = {2, 5, 10, 15, 20} agents
10 random 80/20 - train/test - 80/20 million split
Online gossip:
N = 2 N = 5 N = 10 N = 15 N = 20
Test
RMSE
0.877 0.885 0.891 0.894 0.900
Batch gradient descent algorithm RTRMC benchmark: 0.873.
[Boumal and Absil, LAA, 2015]
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 21 / 25
Numerical comparisons
Netflix data: consensus of agents
0 10 20 30 40 50
10
−6
10
−4
10
−2
10
0
10
2
Every 10th update
Consensusofagents
Distance agents 1 − 2
Distance agents 2 − 3
Distance agents 3 − 4
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 22 / 25
Numerical comparisons
Netflix data: test RMSE with updates
0 20 40 60 80
0.85
0.9
0.95
1
1.05
1.1
TestRMSE
Agent 1
Agent 2
Agent 3
RTRMC−1
Every 10th update
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 23 / 25
Summary and future work
Summary and future work
We proposed a Riemannian gossip approach to the decentralized
matrix completion problem.
We minimize weighted sum of completion and consensus terms
on Grassmann manifold.
Numerical comparisons show good performance of the proposed
algorithms, e.g., on Netflix dataset.
Currently, we intend to explore asynchronous updating of agents
on Grassmann manifold.
Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 24 / 25
Riemannian gossip algorithms for
decentralized matrix completion
Hiroyuki Kasai†
, Bamdev Mishra‡
, and Atul Saroop‡
†The University of Electro-Communications, Japan
‡Amazon Development Centre India, India
IEICE meeting 2016

More Related Content

Viewers also liked

Gossip
GossipGossip
JPCL 10.1021/jz3004427 Whitten Presentation
JPCL 10.1021/jz3004427 Whitten PresentationJPCL 10.1021/jz3004427 Whitten Presentation
JPCL 10.1021/jz3004427 Whitten Presentation
jpcoffice
 
System Center 2012 SP1 - Overview - EPC Group
System Center 2012 SP1 - Overview - EPC GroupSystem Center 2012 SP1 - Overview - EPC Group
System Center 2012 SP1 - Overview - EPC Group
EPC Group
 
Le bon coin
Le bon coinLe bon coin
Le bon coin
denis
 
Alabama State Economic Report 2015
Alabama State Economic Report 2015Alabama State Economic Report 2015
Alabama State Economic Report 2015Zach Hupy
 
Inondations et système d
Inondations et système dInondations et système d
Inondations et système d
denis
 
方博享學(2):我可以有什麼貢獻
方博享學(2):我可以有什麼貢獻方博享學(2):我可以有什麼貢獻
方博享學(2):我可以有什麼貢獻
funpokm
 
Letty's group
Letty's groupLetty's group
Letty's group
gjj97
 
Trabajo redes sociales
Trabajo redes socialesTrabajo redes sociales
Trabajo redes socialesCKFIL
 
Une Offre D Emploi Interessante Pps
Une Offre D Emploi Interessante PpsUne Offre D Emploi Interessante Pps
Une Offre D Emploi Interessante PpsMon nom
 
Ucesce na likovnom konkursu
Ucesce na likovnom konkursuUcesce na likovnom konkursu
Ucesce na likovnom konkursuVesnaDzinovic
 

Viewers also liked (19)

Gossip
GossipGossip
Gossip
 
JPCL 10.1021/jz3004427 Whitten Presentation
JPCL 10.1021/jz3004427 Whitten PresentationJPCL 10.1021/jz3004427 Whitten Presentation
JPCL 10.1021/jz3004427 Whitten Presentation
 
System Center 2012 SP1 - Overview - EPC Group
System Center 2012 SP1 - Overview - EPC GroupSystem Center 2012 SP1 - Overview - EPC Group
System Center 2012 SP1 - Overview - EPC Group
 
Metro uni
Metro uniMetro uni
Metro uni
 
PUBLICATIONS
PUBLICATIONSPUBLICATIONS
PUBLICATIONS
 
+
++
+
 
Le bon coin
Le bon coinLe bon coin
Le bon coin
 
Alabama State Economic Report 2015
Alabama State Economic Report 2015Alabama State Economic Report 2015
Alabama State Economic Report 2015
 
Doc4
Doc4Doc4
Doc4
 
Inondations et système d
Inondations et système dInondations et système d
Inondations et système d
 
Trabalho de fisica suelane
Trabalho de fisica suelaneTrabalho de fisica suelane
Trabalho de fisica suelane
 
Riesling
RieslingRiesling
Riesling
 
Power Point Blog
Power Point BlogPower Point Blog
Power Point Blog
 
方博享學(2):我可以有什麼貢獻
方博享學(2):我可以有什麼貢獻方博享學(2):我可以有什麼貢獻
方博享學(2):我可以有什麼貢獻
 
Letty's group
Letty's groupLetty's group
Letty's group
 
Trabajo redes sociales
Trabajo redes socialesTrabajo redes sociales
Trabajo redes sociales
 
Une Offre D Emploi Interessante Pps
Une Offre D Emploi Interessante PpsUne Offre D Emploi Interessante Pps
Une Offre D Emploi Interessante Pps
 
Ucesce na likovnom konkursu
Ucesce na likovnom konkursuUcesce na likovnom konkursu
Ucesce na likovnom konkursu
 
JORGE DONAIRE
JORGE DONAIREJORGE DONAIRE
JORGE DONAIRE
 

Similar to Riemannian gossip algorithms for decentralized matrix completion

Scenario Generation Algortihm using R
Scenario Generation Algortihm using RScenario Generation Algortihm using R
Scenario Generation Algortihm using RAshu Prakash
 
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
Kostas Hatalis, PhD
 
Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...
Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...
Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...
Hiroyuki KASAI
 
Low power cost rns comparison via partitioning the dynamic range
Low power cost rns comparison via partitioning the dynamic rangeLow power cost rns comparison via partitioning the dynamic range
Low power cost rns comparison via partitioning the dynamic range
Nexgen Technology
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
Ben Mabey
 
06-07 Chapter interpolation in MATLAB
06-07 Chapter interpolation in MATLAB06-07 Chapter interpolation in MATLAB
06-07 Chapter interpolation in MATLAB
Dr. Mohammed Danish
 
Processing Large ToF-SIMS Datasets
Processing Large ToF-SIMS DatasetsProcessing Large ToF-SIMS Datasets
Processing Large ToF-SIMS Datasets
Gustavo Ferraz Trindade
 
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
The Statistical and Applied Mathematical Sciences Institute
 
Pca ppt
Pca pptPca ppt
Pca ppt
Alaa Tharwat
 
RBM from Scratch
RBM from Scratch RBM from Scratch
RBM from Scratch
Hadi Sinaee
 
Regression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachRegression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network Approach
Khulna University
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Statistical Decision Theory
Statistical Decision TheoryStatistical Decision Theory
Statistical Decision Theory
Sangwoo Mo
 
A lambda calculus for density matrices with classical and probabilistic controls
A lambda calculus for density matrices with classical and probabilistic controlsA lambda calculus for density matrices with classical and probabilistic controls
A lambda calculus for density matrices with classical and probabilistic controls
Alejandro Díaz-Caro
 
Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Repres...
Efficient Computation ofRegret-ratio Minimizing Set:A Compact Maxima Repres...Efficient Computation ofRegret-ratio Minimizing Set:A Compact Maxima Repres...
Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Repres...
Abolfazl Asudeh
 
QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...
QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...
QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...
The Statistical and Applied Mathematical Sciences Institute
 
Role of optimization techniques in agriculture jaslam
Role of optimization techniques in agriculture jaslamRole of optimization techniques in agriculture jaslam
Role of optimization techniques in agriculture jaslam
jaslam pk
 
d-VMP: Distributed Variational Message Passing (PGM2016)
d-VMP: Distributed Variational Message Passing (PGM2016)d-VMP: Distributed Variational Message Passing (PGM2016)
d-VMP: Distributed Variational Message Passing (PGM2016)
AMIDST Toolbox
 

Similar to Riemannian gossip algorithms for decentralized matrix completion (20)

Scenario Generation Algortihm using R
Scenario Generation Algortihm using RScenario Generation Algortihm using R
Scenario Generation Algortihm using R
 
MSRtalk
MSRtalkMSRtalk
MSRtalk
 
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
 
Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...
Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...
Riemannian stochastic variance reduced gradient on Grassmann manifold (ICCOPT...
 
Low power cost rns comparison via partitioning the dynamic range
Low power cost rns comparison via partitioning the dynamic rangeLow power cost rns comparison via partitioning the dynamic range
Low power cost rns comparison via partitioning the dynamic range
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
06-07 Chapter interpolation in MATLAB
06-07 Chapter interpolation in MATLAB06-07 Chapter interpolation in MATLAB
06-07 Chapter interpolation in MATLAB
 
Processing Large ToF-SIMS Datasets
Processing Large ToF-SIMS DatasetsProcessing Large ToF-SIMS Datasets
Processing Large ToF-SIMS Datasets
 
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
 
Pca ppt
Pca pptPca ppt
Pca ppt
 
RBM from Scratch
RBM from Scratch RBM from Scratch
RBM from Scratch
 
Regression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachRegression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network Approach
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
SASA 2016
SASA 2016SASA 2016
SASA 2016
 
Statistical Decision Theory
Statistical Decision TheoryStatistical Decision Theory
Statistical Decision Theory
 
A lambda calculus for density matrices with classical and probabilistic controls
A lambda calculus for density matrices with classical and probabilistic controlsA lambda calculus for density matrices with classical and probabilistic controls
A lambda calculus for density matrices with classical and probabilistic controls
 
Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Repres...
Efficient Computation ofRegret-ratio Minimizing Set:A Compact Maxima Repres...Efficient Computation ofRegret-ratio Minimizing Set:A Compact Maxima Repres...
Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Repres...
 
QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...
QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...
QMC: Transition Workshop - Importance Sampling the Union of Rare Events with ...
 
Role of optimization techniques in agriculture jaslam
Role of optimization techniques in agriculture jaslamRole of optimization techniques in agriculture jaslam
Role of optimization techniques in agriculture jaslam
 
d-VMP: Distributed Variational Message Passing (PGM2016)
d-VMP: Distributed Variational Message Passing (PGM2016)d-VMP: Distributed Variational Message Passing (PGM2016)
d-VMP: Distributed Variational Message Passing (PGM2016)
 

Recently uploaded

SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
frank0071
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 

Recently uploaded (20)

SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 

Riemannian gossip algorithms for decentralized matrix completion

  • 1. Riemannian gossip algorithms for decentralized matrix completion Hiroyuki Kasai† , Bamdev Mishra‡ , and Atul Saroop‡ †The University of Electro-Communications, Japan ‡Amazon Development Centre India, India IEICE meeting 2016
  • 2. Motivation The matrix completion problem ? ? * ? * * ? * ? * * ? * ? * ? ≈ Low-rank prior m Movies n Users m nr (n + m − r)r, r ≪ (m, n) WT U X? U and W factor matrices. [Netflix Challenge, 2006] Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 2 / 25
  • 3. Motivation Our interest is to look at the decentralized scenario ? ? * ? * * ? * ? * * ? * ? * ? m Movies n1 Users n2 Users X? 1 X? 2 U[WT 1 WT 2 ]≈ An agent i has access to its own data matrix Xi . The matrix U is common across all the agents. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 3 / 25
  • 4. Motivation Contributions We develop a nonlinear gossip algorithm with minimal communication between agents. The optimization formulation is based on a weighted combination of matrix completion and consensus terms. We develop a parallel variant of the proposed gossip algorithm. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 4 / 25
  • 5. Motivation Paper and codes available online at www.bamdevmishra.com. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 5 / 25
  • 6. Motivation Outline Problem formulation on the Riemannian Grassmann manifold. Proposed gossip algorithms. Numerical comparisons on synthetic and Netflix data. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 6 / 25
  • 7. Problem formulation Outline Problem formulation on the Riemannian Grassmann manifold. Proposed gossip algorithms. Numerical comparisons on synthetic and Netflix data. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 7 / 25
  • 8. Problem formulation Batch problem formulation min U∈St(r,m) min W∈Rn×r PΩ(UWT ) − PΩ(X ) 2 F . W ∈ Rn×r and U ∈ St(r, m), the set of m × r matrices with orthonormal columns. PΩ is the sampling operator, a convenient way to denote known entries. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 8 / 25
  • 9. Problem formulation Eliminate W minU∈St(r,m) minW∈Rn×r PΩ(UWT ) − PΩ(X ) 2 F ≡ minU∈St(r,m) f (U, WU), a Grassmann optimization problem. Solve blue problem in closed form to obtain WU. Final optimization problem is on Grassmann manifold, i.e., variable is ‘column space’ of U. [Boumal and Absil, LAA, 2015] Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 9 / 25
  • 10. Problem formulation Decentralized problem formulation X = [X1, X2, . . . , XN]. i min U∈St(r,m),Wi ∈Rni ×r 1 2 PΩi (UWi T ) − PΩi (Xi ) 2 F = min U∈St(r,m) 1 2 i PΩi (UWT iU) − PΩi (Xi ) 2 F , where WiU is computed by agent i independently. Although the problem is distributed, we still need to learn a common U. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 10 / 25
  • 11. Problem formulation We add a consensus term to our optimization formulation Key idea: introduce multiple copies of U among N agents, but allow them to reach consensus. min U1,...,UN ∈St(r,m) 1 2 i PΩi (Ui WT iUi ) − PΩi (Xi ) 2 F completion task handled by agent i + ρ 2 (d(U1, U2)2 + d(U2, U3)2 + . . . + d(UN−1, UN)2 ) consensus among agents . d is the Riemannian distance on the Grassmann manifold. A large ρ trades-off completion with consensus. Minimizing only consensus ⇒ U1 = U2 = . . . = UN−1 = UN. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 11 / 25
  • 12. Proposed Riemannian gossip algorithms Outline Problem formulation on the Riemannian Grassmann manifold. Proposed gossip algorithms. Numerical comparisons on synthetic and Netflix data. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 12 / 25
  • 13. Proposed Riemannian gossip algorithms Riemannian online gossip on Grassmann 1 Agents i and i + 1 are neighbors for all i N − 1. (ordering of agents) 2 At each time slot, say t, we pick an agent i N − 1 randomly with uniform probability. (SGD updates) Equivalently, we also pick agent i + 1 (the neighbor of agent i). Agents i and i + 1 update Ui and Ui+1, respectively, by taking a gradient descent step with stepsize γt on Grassmann manifold. γ2 t < ∞ and γt = +∞. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 13 / 25
  • 14. Proposed Riemannian gossip algorithms A graphical illustration Agent 1 Agent 2 Agent 3 Agent N-1 Agent N Universal clock Each pair is chosen . . . with probability 1=(N − 1) Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 14 / 25
  • 15. Proposed Riemannian gossip algorithms Convergence of Riemannian online gossip Asymptotic convergence follows standard SGD analysis on manifold. The proposed algorithm is readily implementable, e.g., with the toolbox Manopt. [Bonnabel, IEEE TAC, 2013; Absil, Mahoney, and Sepulchre, Princeton Press, 2008; Boumal et al., JMLR, 2014] Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 15 / 25
  • 16. Proposed Riemannian gossip algorithms Parallelizing Riemannian gossip with particular sampling Agent 1 Agent 2 Agent 3 Agent 4 Agent 5 Universal clock Solid" pairs chosen at same time. . . . Dotted' pairs chosen at same time. “Dotted’ and “solid” groups are chosen with probability 1/2. Convergence guarantees remain the same. (N − 1)/2 times faster. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 16 / 25
  • 17. Numerical comparisons Outline Problem formulation on the Riemannian Grassmann manifold. Proposed gossip algorithms. Numerical comparisons on synthetic and Netflix data. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 17 / 25
  • 18. Numerical comparisons Effect of ρ 10 000×100 000 matrix with N = 6. 0 10 20 30 40 Every 10th update 10-10 10-8 10 -6 10-4 10 -2 100 Meansquareerrorontrainingset Agent 1 rho=103 Agent 2, rho=103 Distance 1-2, rho=103 Agent 1, rho=10 10 Agent 2, rho=1010 Distance 1-2, rho=1010 A very large ρ minimizes only achieves consensus among agents. A tuned ρ achieves both completion and consensus. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 18 / 25
  • 19. Numerical comparisons Performance of online and parallel variants 10 000×100 000 matrix with N = 6 and ρ = 103 . 0 10 20 30 40 Every 10th update 10-10 10-8 10 -6 10-4 10 -2 100 Meansquareerrorontrainingset Onl. agent 1 Onl. agent 2 Onl. distance 1 - 2 Para. agent 1 Para. agent 2 Para. distance 1 - 2 There is no loss of performance in parallelizing the updates. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 19 / 25
  • 20. Numerical comparisons Comparison with D-LMaFit 500 × 12000 and N = 6. 0 10 20 30 40 Every 10th update 10-12 10-10 10-8 10-6 10 -4 10-2 100 102Meansquareerrorontrainingset Online agent 1 Online agent 2 Online distance 1 - 2 D-LMaFit agent 1 D-LMaFit agent 2 D-LMaFit distance 1 - 2 D-LMaFit code is not scalable to large data. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 20 / 25
  • 21. Numerical comparisons Netflix data: different number of agents Rank = 10 N = {2, 5, 10, 15, 20} agents 10 random 80/20 - train/test - 80/20 million split Online gossip: N = 2 N = 5 N = 10 N = 15 N = 20 Test RMSE 0.877 0.885 0.891 0.894 0.900 Batch gradient descent algorithm RTRMC benchmark: 0.873. [Boumal and Absil, LAA, 2015] Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 21 / 25
  • 22. Numerical comparisons Netflix data: consensus of agents 0 10 20 30 40 50 10 −6 10 −4 10 −2 10 0 10 2 Every 10th update Consensusofagents Distance agents 1 − 2 Distance agents 2 − 3 Distance agents 3 − 4 Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 22 / 25
  • 23. Numerical comparisons Netflix data: test RMSE with updates 0 20 40 60 80 0.85 0.9 0.95 1 1.05 1.1 TestRMSE Agent 1 Agent 2 Agent 3 RTRMC−1 Every 10th update Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 23 / 25
  • 24. Summary and future work Summary and future work We proposed a Riemannian gossip approach to the decentralized matrix completion problem. We minimize weighted sum of completion and consensus terms on Grassmann manifold. Numerical comparisons show good performance of the proposed algorithms, e.g., on Netflix dataset. Currently, we intend to explore asynchronous updating of agents on Grassmann manifold. Kasai, Mishra, and Saroop Riemannian gossip 20 September 2016 24 / 25
  • 25. Riemannian gossip algorithms for decentralized matrix completion Hiroyuki Kasai† , Bamdev Mishra‡ , and Atul Saroop‡ †The University of Electro-Communications, Japan ‡Amazon Development Centre India, India IEICE meeting 2016