SlideShare a Scribd company logo
QR Factorizations and SVDs for Tall-and-skinny 
Matrices in MapReduce Architectures 
Austin Benson 
ICME, Stanford University 
Work completed in part at UC-Berkeley 
SIAM CSE 2013 
February 27, 2013
Collaborators 
James Demmel, UC-Berkeley David Gleich, Purdue 
Paul Constantine, Stanford 
Thanks!
Quick QR and SVD review 
A Q 
R 
VT 
n 
n 
n 
n 
n 
n 
n 
n A U 
n 
n 
n 
n 
Σ 
n 
n 
Figure: Q, U, and V are orthogonal matrices. R is upper triangular and 
 is diagonal with positive entries.
Tall-and-skinny QR 
A Q 
R 
m 
n 
m 
n 
n 
n 
Tall-and-skinny (TS): m  n
TS-QR ! TS-SVD 
R is small, so computing its SVD is cheap.
Why Tall-and-skinny QR and SVD? 
1. Regression with many samples 
2. Principle Component Analysis (PCA) 
3. Model Reduction 
Pressure, Dilation, Jet Engine 
Figure: Dynamic mode decomposition of a rectangular supersonic 
screeching jet. Joe Nichols, Stanford University.
MapReduce overview 
Two functions that operate on key value pairs: 
(key; value) 
map 
! (key; value) 
(key; hvalue1; : : : ; valueni) reduce 
! (key; value) 
A shue stage runs between map and reduce to sort the values by 
key.
MapReduce overview 
Scalability: many map tasks and many reduce tasks are used 
https://developers.google.com/appengine/docs/python/images/mapreduce_mapshuffle.png
MapReduce overview 
The idea is data-local computations. The programmer implements: 
I map(key, value) 
I reduce(key, h value1, : : :, valuen i) 
The shue and data I/O is implemented by the MapReduce 
framework, e.g., Hadoop. 
This is a very restrictive programming environment! We sacri
ce 
program control for structure, scalability, fault tolerance, etc.
Matrix representation 
We have matrices, so what are the key-value pairs? We can just 
have unique row identi
ers: 
A = 
2 
1:0 0:0 
2:4 3:7 
0:8 4:2 
9:0 9:0 
664 
3 
775 
! 
2 
(a3320354; [1:0; 0:0]) 
(5da011e2; [2:4; 3:7]) 
(94d80025; [0:8; 4:2]) 
(90764917; [9:0; 9:0]) 
664 
3 
775 
Each value in a key-value pair is a row of the matrix.
Matrix representation: an example 
We can encode more information in the keys, e.g., (x, y, z) 
coordinates and model number: 
((47570,103.429767811242,0,-16.525510963787,iDV7), [0.00019924 
-4.706066e-05 2.875293979e-05 2.456653e-05 -8.436627e-06 -1.508808e-05 
3.731976e-06 -1.048795e-05 5.229153e-06 6.323812e-06]) 
Figure: Aircraft simulation data. Paul Constantine, Stanford University
Why MapReduce? 
1. Fault tolerance 
2. Structured data I/O 
3. Cheaper clusters with more data storage 
4. Easy to program 
Generate lots of 
data on 
supercomputer 
Post-process and 
analyze on 
MapReduce cluster 
http://www.nersc.gov/assets/About-Us/hopper1.jpg
Fault tolerance 
1550 
1500 
1450 
1400 
1350 
1300 
1250 
1200 
0 50 100 150 200 
1/(probability of task fault) 
job time (secs.) 
Performance with Injected Faults 
Running time with no faults 
Running times with injected faults 
 25% performance penalty with 1 out 5 tasks failing
Why MapReduce? 
Not convinced MapReduce is good for computational science? 
MS218: Is MapReduce Good for Science and Simulation Data?, 
Thursday, 4:30-6:30pm.
Communication-avoiding TSQR 
A = 
2 
A1 
A2 
A3 
A4 
664 
3 
775 
| {z } 
8n4n 
= 
2 
Q1 
664 
Q2 
Q3 
Q4 
3 
775 
| {z } 
8n4n 
2 
R1 
R2 
R3 
R4 
664 
3 
775 
| {z } 
4nn 
= 
=Q z2 }| { 
Q1 
664 
Q2 
Q3 
Q4 
3 
775 
| {z } 
8n4n 
~Q 
|{z} 
4nn 
|{Rz} 
nn 
(Demmel et al. 2008)
Communication-avoiding TSQR 
A = 
2 
A1 
A2 
A3 
A4 
664 
3 
775 | {z } 
8n4n 
= 
2 
Q1 
664 
Q2 
Q3 
Q4 
3 
775 
| {z } 
8n4n 
2 
R1 
R2 
R3 
R4 
664 
3 
775 
| {z } 
4nn 
Ai = QiRi can be computed in parallel. If we only need R, then 
we can throw out the Q factors.
MapReduce TSQR 
S(1) 
A 
A1 
A3 
A3 
R1 map 
A2 
R emit 2 map 
R emit 3 map 
A4 
R emit 4 map 
shuffle 
S1 
A2 
reduce 
S2 
R2,1 emit 
reduce R2,2 
emit 
emit 
shuffle 
AS23 
reduce R2,3 emit 
Local TSQR 
identity map 
SA(22 ) 
reduce R emit 
Local TSQR Local TSQR 
Figure: S(1) is the matrix consisting of the rows of all of the Ri factors. 
Similarly, S(2) consists of all of the rows of the R2;j factors.
MapReduce TSQR: the implementation 
1 import random, numpy, hadoopy 
2 class SerialTSQR: 
3 def i n i t ( sel f , blocksize , isreducer ) : 
4 sel f . bsize , sel f . data =blocksize , [ ] 
5 sel f . c a l l = sel f . reducer i f isreducer else sel f .mapper 
6 
7 def compress( sel f ) : 
8 R = numpy. l inalg . qr (numpy. array ( sel f . data) , ' r ' ) 
9 sel f . data = [ ] 
10 for row in R: sel f . data .append( [ f loat (v) for v in row] ) 
11 
12 def col lect ( sel f , key , value ) : 
13 sel f . data .append(value ) 
14 i f len ( sel f . data)  sel f . bsize  len ( sel f . data [ 0 ] ) : 
15 sel f . compress() 
16 
17 def close ( sel f ) : 
18 sel f . compress() 
19 for row in sel f . data : yield random. randint (0,2000000000), row 
20 
21 def mapper( sel f , key , value ) : 
22 sel f . col lect (key , value ) 
23 
24 def reducer ( sel f , key , values ) : 
25 for value in values : sel f .mapper(key , value ) 
26 
27 i f name ==' main ' : 
28 mapper = SerialTSQR( blocksize=3, isreducer=False ) 
29 reducer = SerialTSQR( blocksize=3, isreducer=True) 
30 hadoopy. run(mapper , reducer )
MapReduce TSQR: Getting Q 
We have an ecient way of getting R in QR (and  in the SVD). 
What if I want Q (or my left singular vectors)? 
A = QR ! Q = AR1 
This is a numerically unstable computation, and Q can be far from 
orthogonal. (This may be OK in some cases).
Indirect TSQR 
We call this method Indirect TSQR.
Indirect TSQR: Iterative Re
nement 
We can repeat the TSQR computation on Q to get a more 
orthogonal matrix. This is called iterative re
nement. 
A 
A1 
R-1 map 
R 
Q1 
A2 
R-1 map 
Q2 
A3 
R-1 map 
Q3 
A4 
R-1 map 
Q4 
emit 
emit 
emit 
emit 
distribute 
TSQR 
Q 
Q1 
R1 
-1 map 
R1 
Q1 
Q2 
R1 
-1 map 
Q2 
Q3 
R1 
-1 map 
Q3 
Q4 
R1 
-1 map 
Q4 
emit 
emit 
emit 
emit 
distribute 
Local MatMul Local MatMul 
Iterative Refinement step
Direct TSQR 
Why is it hard to reconstruct Q? 
I Only maintained state is the data written to disk at the end 
of each map/reduce iteration 
I No controlled communication between processors ! can only 
control data movement via the shue 
I File names and keys are the only methods of labeling data 
components
Direct TSQR: Steps 1 and 2 
A 
A1 
map R1 
Q11 emit 
emit 
A2 
map R2 
Q21 emit 
emit 
A3 
map R3 
Q31 emit 
emit 
A4 
map R4 
Q41 emit 
emit 
First step 
R1 
R2 
R3 
R4 
Q12 
Q22 
Q32 
Q42 
R 
reduce 
emit 
emit 
emit 
emit 
emit 
Second step 
shuffle
Direct TSQR: Step 3
Cholesky QR 
We also have a Cholesky QR: 
ATA = (QR)T (QR) = RTQTQR = RTR 
Computing ATA translates nicely to MapReduce. This method has 
worse stability issues but requires fewer 
ops.
Performance Data 
matrix size Time to completion (seconds) 
4Bx4 (135 GB) 2.5Bx10 (193 GB) 600Mx25 (112 GB) 500Mx50 (184 GB) 150Mx100 (110 GB) 
8000 
7000 
6000 
5000 
4000 
3000 
2000 
1000 
0 
Performance of various MapReduce algorithms for computing QR 
Cholesky 
Indirect TSQR 
Cholesky + IR 
Indirect TSQR + IR 
Direct TSQR

More Related Content

What's hot

Single source stortest path bellman ford and dijkstra
Single source stortest path bellman ford and dijkstraSingle source stortest path bellman ford and dijkstra
Single source stortest path bellman ford and dijkstra
Roshan Tailor
 
Linear transformation and application
Linear transformation and applicationLinear transformation and application
Linear transformation and application
shreyansp
 
It elective-4-buendia lagua
It elective-4-buendia laguaIt elective-4-buendia lagua
It elective-4-buendia lagua
John Mark Lagua
 
Topological Sort
Topological SortTopological Sort
Topological Sort
Dr Sandeep Kumar Poonia
 
Isomorphism
IsomorphismIsomorphism
Isomorphism
Raj Parekh
 
CSE633
CSE633CSE633
Prim Algorithm and kruskal algorithm
Prim Algorithm and kruskal algorithmPrim Algorithm and kruskal algorithm
Prim Algorithm and kruskal algorithm
Acad
 
Signal Flow Graph ( control system)
Signal Flow Graph ( control system)Signal Flow Graph ( control system)
Signal Flow Graph ( control system)
Gourab Ghosh
 
Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)
Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)
Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)
Mohanlal Sukhadia University (MLSU)
 
Direct current machine
Direct current machineDirect current machine
Direct current machine
tahseen alshmary
 
All pairs shortest path algorithm
All pairs shortest path algorithmAll pairs shortest path algorithm
All pairs shortest path algorithm
Srikrishnan Suresh
 
Algorithms of graph
Algorithms of graphAlgorithms of graph
Algorithms of graph
getacew
 
Shortest path problem
Shortest path problemShortest path problem
Shortest path problem
Ifra Ilyas
 
Block diagram Examples
Block diagram ExamplesBlock diagram Examples
Block diagram Examples
Sagar Kuntumal
 
Skiena algorithm 2007 lecture12 topological sort connectivity
Skiena algorithm 2007 lecture12 topological sort connectivitySkiena algorithm 2007 lecture12 topological sort connectivity
Skiena algorithm 2007 lecture12 topological sort connectivity
zukun
 
Ameer 14208
Ameer 14208Ameer 14208
Ameer 14208
Ameer Mehmood
 
Linear Transformations, Matrix Algebra
Linear Transformations, Matrix AlgebraLinear Transformations, Matrix Algebra
Linear Transformations, Matrix Algebra
Prasanth George
 
Bellman Ford's Algorithm
Bellman Ford's AlgorithmBellman Ford's Algorithm
Bellman Ford's Algorithm
Tanmay Baranwal
 
Chapter 8 Root Locus Techniques
Chapter 8 Root Locus TechniquesChapter 8 Root Locus Techniques
Chapter 8 Root Locus Techniques
guesta0c38c3
 
Bellman ford Algorithm
Bellman ford AlgorithmBellman ford Algorithm
Bellman ford Algorithm
taimurkhan803
 

What's hot (20)

Single source stortest path bellman ford and dijkstra
Single source stortest path bellman ford and dijkstraSingle source stortest path bellman ford and dijkstra
Single source stortest path bellman ford and dijkstra
 
Linear transformation and application
Linear transformation and applicationLinear transformation and application
Linear transformation and application
 
It elective-4-buendia lagua
It elective-4-buendia laguaIt elective-4-buendia lagua
It elective-4-buendia lagua
 
Topological Sort
Topological SortTopological Sort
Topological Sort
 
Isomorphism
IsomorphismIsomorphism
Isomorphism
 
CSE633
CSE633CSE633
CSE633
 
Prim Algorithm and kruskal algorithm
Prim Algorithm and kruskal algorithmPrim Algorithm and kruskal algorithm
Prim Algorithm and kruskal algorithm
 
Signal Flow Graph ( control system)
Signal Flow Graph ( control system)Signal Flow Graph ( control system)
Signal Flow Graph ( control system)
 
Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)
Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)
Shortest path (Dijkistra's Algorithm) & Spanning Tree (Prim's Algorithm)
 
Direct current machine
Direct current machineDirect current machine
Direct current machine
 
All pairs shortest path algorithm
All pairs shortest path algorithmAll pairs shortest path algorithm
All pairs shortest path algorithm
 
Algorithms of graph
Algorithms of graphAlgorithms of graph
Algorithms of graph
 
Shortest path problem
Shortest path problemShortest path problem
Shortest path problem
 
Block diagram Examples
Block diagram ExamplesBlock diagram Examples
Block diagram Examples
 
Skiena algorithm 2007 lecture12 topological sort connectivity
Skiena algorithm 2007 lecture12 topological sort connectivitySkiena algorithm 2007 lecture12 topological sort connectivity
Skiena algorithm 2007 lecture12 topological sort connectivity
 
Ameer 14208
Ameer 14208Ameer 14208
Ameer 14208
 
Linear Transformations, Matrix Algebra
Linear Transformations, Matrix AlgebraLinear Transformations, Matrix Algebra
Linear Transformations, Matrix Algebra
 
Bellman Ford's Algorithm
Bellman Ford's AlgorithmBellman Ford's Algorithm
Bellman Ford's Algorithm
 
Chapter 8 Root Locus Techniques
Chapter 8 Root Locus TechniquesChapter 8 Root Locus Techniques
Chapter 8 Root Locus Techniques
 
Bellman ford Algorithm
Bellman ford AlgorithmBellman ford Algorithm
Bellman ford Algorithm
 

Similar to QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architectures (SIAM CSE)

Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)
Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)
Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)
Austin Benson
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphs
David Gleich
 
12. Linear models
12. Linear models12. Linear models
12. Linear models
ExternalEvents
 
Wiener Filter Hardware Realization
Wiener Filter Hardware RealizationWiener Filter Hardware Realization
Wiener Filter Hardware Realization
Sayan Chaudhuri
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...
Usatyuk Vasiliy
 
Estimating structured vector autoregressive models
Estimating structured vector autoregressive modelsEstimating structured vector autoregressive models
Estimating structured vector autoregressive models
Akira Tanimoto
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
pione30
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...
Usatyuk Vasiliy
 
Parallel Evaluation of Multi-Semi-Joins
Parallel Evaluation of Multi-Semi-JoinsParallel Evaluation of Multi-Semi-Joins
Parallel Evaluation of Multi-Semi-Joins
Jonny Daenen
 
GRAPH - DISCRETE STRUCTURE AND ALGORITHM
GRAPH - DISCRETE STRUCTURE AND ALGORITHMGRAPH - DISCRETE STRUCTURE AND ALGORITHM
GRAPH - DISCRETE STRUCTURE AND ALGORITHM
himanshumishra19dec
 
Random Number Generators 2018
Random Number Generators 2018Random Number Generators 2018
Random Number Generators 2018
rinnocente
 
MapReduce Tall-and-skinny QR and applications
MapReduce Tall-and-skinny QR and applicationsMapReduce Tall-and-skinny QR and applications
MapReduce Tall-and-skinny QR and applications
David Gleich
 
New data structures and algorithms for \\post-processing large data sets and ...
New data structures and algorithms for \\post-processing large data sets and ...New data structures and algorithms for \\post-processing large data sets and ...
New data structures and algorithms for \\post-processing large data sets and ...
Alexander Litvinenko
 
R Language Introduction
R Language IntroductionR Language Introduction
R Language Introduction
Khaled Al-Shamaa
 
11. Linear Models
11. Linear Models11. Linear Models
11. Linear Models
FAO
 
RBootcam Day 2
RBootcam Day 2RBootcam Day 2
RBootcam Day 2
Olga Scrivner
 
Passive network-redesign-ntua
Passive network-redesign-ntuaPassive network-redesign-ntua
Passive network-redesign-ntua
IEEE NTUA SB
 
Dsp manual
Dsp manualDsp manual
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation ModelDesign of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
IDES Editor
 
Linear models
Linear modelsLinear models
Linear models
FAO
 

Similar to QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architectures (SIAM CSE) (20)

Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)
Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)
Tall-and-skinny Matrix Computations in MapReduce (ICME MR 2013)
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphs
 
12. Linear models
12. Linear models12. Linear models
12. Linear models
 
Wiener Filter Hardware Realization
Wiener Filter Hardware RealizationWiener Filter Hardware Realization
Wiener Filter Hardware Realization
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...
 
Estimating structured vector autoregressive models
Estimating structured vector autoregressive modelsEstimating structured vector autoregressive models
Estimating structured vector autoregressive models
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...
 
Parallel Evaluation of Multi-Semi-Joins
Parallel Evaluation of Multi-Semi-JoinsParallel Evaluation of Multi-Semi-Joins
Parallel Evaluation of Multi-Semi-Joins
 
GRAPH - DISCRETE STRUCTURE AND ALGORITHM
GRAPH - DISCRETE STRUCTURE AND ALGORITHMGRAPH - DISCRETE STRUCTURE AND ALGORITHM
GRAPH - DISCRETE STRUCTURE AND ALGORITHM
 
Random Number Generators 2018
Random Number Generators 2018Random Number Generators 2018
Random Number Generators 2018
 
MapReduce Tall-and-skinny QR and applications
MapReduce Tall-and-skinny QR and applicationsMapReduce Tall-and-skinny QR and applications
MapReduce Tall-and-skinny QR and applications
 
New data structures and algorithms for \\post-processing large data sets and ...
New data structures and algorithms for \\post-processing large data sets and ...New data structures and algorithms for \\post-processing large data sets and ...
New data structures and algorithms for \\post-processing large data sets and ...
 
R Language Introduction
R Language IntroductionR Language Introduction
R Language Introduction
 
11. Linear Models
11. Linear Models11. Linear Models
11. Linear Models
 
RBootcam Day 2
RBootcam Day 2RBootcam Day 2
RBootcam Day 2
 
Passive network-redesign-ntua
Passive network-redesign-ntuaPassive network-redesign-ntua
Passive network-redesign-ntua
 
Dsp manual
Dsp manualDsp manual
Dsp manual
 
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation ModelDesign of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
 
Linear models
Linear modelsLinear models
Linear models
 

More from Austin Benson

Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)
Austin Benson
 
Spectral embeddings and evolving networks
Spectral embeddings and evolving networksSpectral embeddings and evolving networks
Spectral embeddings and evolving networks
Austin Benson
 
Computational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data AnalysisComputational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data Analysis
Austin Benson
 
Higher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modelingHigher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modeling
Austin Benson
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
Austin Benson
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
Austin Benson
 
Higher-order link prediction
Higher-order link predictionHigher-order link prediction
Higher-order link prediction
Austin Benson
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
Austin Benson
 
Three hypergraph eigenvector centralities
Three hypergraph eigenvector centralitiesThree hypergraph eigenvector centralities
Three hypergraph eigenvector centralities
Austin Benson
 
Semi-supervised learning of edge flows
Semi-supervised learning of edge flowsSemi-supervised learning of edge flows
Semi-supervised learning of edge flows
Austin Benson
 
Choosing to grow a graph
Choosing to grow a graphChoosing to grow a graph
Choosing to grow a graph
Austin Benson
 
Link prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structureLink prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structure
Austin Benson
 
Higher-order Link Prediction GraphEx
Higher-order Link Prediction GraphExHigher-order Link Prediction GraphEx
Higher-order Link Prediction GraphEx
Austin Benson
 
Higher-order Link Prediction Syracuse
Higher-order Link Prediction SyracuseHigher-order Link Prediction Syracuse
Higher-order Link Prediction Syracuse
Austin Benson
 
Random spatial network models for core-periphery structure
Random spatial network models for core-periphery structureRandom spatial network models for core-periphery structure
Random spatial network models for core-periphery structure
Austin Benson
 
Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.
Austin Benson
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
Austin Benson
 
Simplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusionsSimplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusions
Austin Benson
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
Austin Benson
 
Set prediction three ways
Set prediction three waysSet prediction three ways
Set prediction three ways
Austin Benson
 

More from Austin Benson (20)

Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)
 
Spectral embeddings and evolving networks
Spectral embeddings and evolving networksSpectral embeddings and evolving networks
Spectral embeddings and evolving networks
 
Computational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data AnalysisComputational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data Analysis
 
Higher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modelingHigher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modeling
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
 
Higher-order link prediction
Higher-order link predictionHigher-order link prediction
Higher-order link prediction
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
Three hypergraph eigenvector centralities
Three hypergraph eigenvector centralitiesThree hypergraph eigenvector centralities
Three hypergraph eigenvector centralities
 
Semi-supervised learning of edge flows
Semi-supervised learning of edge flowsSemi-supervised learning of edge flows
Semi-supervised learning of edge flows
 
Choosing to grow a graph
Choosing to grow a graphChoosing to grow a graph
Choosing to grow a graph
 
Link prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structureLink prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structure
 
Higher-order Link Prediction GraphEx
Higher-order Link Prediction GraphExHigher-order Link Prediction GraphEx
Higher-order Link Prediction GraphEx
 
Higher-order Link Prediction Syracuse
Higher-order Link Prediction SyracuseHigher-order Link Prediction Syracuse
Higher-order Link Prediction Syracuse
 
Random spatial network models for core-periphery structure
Random spatial network models for core-periphery structureRandom spatial network models for core-periphery structure
Random spatial network models for core-periphery structure
 
Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
Simplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusionsSimplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusions
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
 
Set prediction three ways
Set prediction three waysSet prediction three ways
Set prediction three ways
 

Recently uploaded

一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 

Recently uploaded (20)

一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 

QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architectures (SIAM CSE)

  • 1. QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architectures Austin Benson ICME, Stanford University Work completed in part at UC-Berkeley SIAM CSE 2013 February 27, 2013
  • 2. Collaborators James Demmel, UC-Berkeley David Gleich, Purdue Paul Constantine, Stanford Thanks!
  • 3. Quick QR and SVD review A Q R VT n n n n n n n n A U n n n n Σ n n Figure: Q, U, and V are orthogonal matrices. R is upper triangular and is diagonal with positive entries.
  • 4. Tall-and-skinny QR A Q R m n m n n n Tall-and-skinny (TS): m n
  • 5. TS-QR ! TS-SVD R is small, so computing its SVD is cheap.
  • 6. Why Tall-and-skinny QR and SVD? 1. Regression with many samples 2. Principle Component Analysis (PCA) 3. Model Reduction Pressure, Dilation, Jet Engine Figure: Dynamic mode decomposition of a rectangular supersonic screeching jet. Joe Nichols, Stanford University.
  • 7. MapReduce overview Two functions that operate on key value pairs: (key; value) map ! (key; value) (key; hvalue1; : : : ; valueni) reduce ! (key; value) A shue stage runs between map and reduce to sort the values by key.
  • 8. MapReduce overview Scalability: many map tasks and many reduce tasks are used https://developers.google.com/appengine/docs/python/images/mapreduce_mapshuffle.png
  • 9. MapReduce overview The idea is data-local computations. The programmer implements: I map(key, value) I reduce(key, h value1, : : :, valuen i) The shue and data I/O is implemented by the MapReduce framework, e.g., Hadoop. This is a very restrictive programming environment! We sacri
  • 10. ce program control for structure, scalability, fault tolerance, etc.
  • 11. Matrix representation We have matrices, so what are the key-value pairs? We can just have unique row identi
  • 12. ers: A = 2 1:0 0:0 2:4 3:7 0:8 4:2 9:0 9:0 664 3 775 ! 2 (a3320354; [1:0; 0:0]) (5da011e2; [2:4; 3:7]) (94d80025; [0:8; 4:2]) (90764917; [9:0; 9:0]) 664 3 775 Each value in a key-value pair is a row of the matrix.
  • 13. Matrix representation: an example We can encode more information in the keys, e.g., (x, y, z) coordinates and model number: ((47570,103.429767811242,0,-16.525510963787,iDV7), [0.00019924 -4.706066e-05 2.875293979e-05 2.456653e-05 -8.436627e-06 -1.508808e-05 3.731976e-06 -1.048795e-05 5.229153e-06 6.323812e-06]) Figure: Aircraft simulation data. Paul Constantine, Stanford University
  • 14. Why MapReduce? 1. Fault tolerance 2. Structured data I/O 3. Cheaper clusters with more data storage 4. Easy to program Generate lots of data on supercomputer Post-process and analyze on MapReduce cluster http://www.nersc.gov/assets/About-Us/hopper1.jpg
  • 15. Fault tolerance 1550 1500 1450 1400 1350 1300 1250 1200 0 50 100 150 200 1/(probability of task fault) job time (secs.) Performance with Injected Faults Running time with no faults Running times with injected faults 25% performance penalty with 1 out 5 tasks failing
  • 16. Why MapReduce? Not convinced MapReduce is good for computational science? MS218: Is MapReduce Good for Science and Simulation Data?, Thursday, 4:30-6:30pm.
  • 17. Communication-avoiding TSQR A = 2 A1 A2 A3 A4 664 3 775 | {z } 8n4n = 2 Q1 664 Q2 Q3 Q4 3 775 | {z } 8n4n 2 R1 R2 R3 R4 664 3 775 | {z } 4nn = =Q z2 }| { Q1 664 Q2 Q3 Q4 3 775 | {z } 8n4n ~Q |{z} 4nn |{Rz} nn (Demmel et al. 2008)
  • 18. Communication-avoiding TSQR A = 2 A1 A2 A3 A4 664 3 775 | {z } 8n4n = 2 Q1 664 Q2 Q3 Q4 3 775 | {z } 8n4n 2 R1 R2 R3 R4 664 3 775 | {z } 4nn Ai = QiRi can be computed in parallel. If we only need R, then we can throw out the Q factors.
  • 19. MapReduce TSQR S(1) A A1 A3 A3 R1 map A2 R emit 2 map R emit 3 map A4 R emit 4 map shuffle S1 A2 reduce S2 R2,1 emit reduce R2,2 emit emit shuffle AS23 reduce R2,3 emit Local TSQR identity map SA(22 ) reduce R emit Local TSQR Local TSQR Figure: S(1) is the matrix consisting of the rows of all of the Ri factors. Similarly, S(2) consists of all of the rows of the R2;j factors.
  • 20. MapReduce TSQR: the implementation 1 import random, numpy, hadoopy 2 class SerialTSQR: 3 def i n i t ( sel f , blocksize , isreducer ) : 4 sel f . bsize , sel f . data =blocksize , [ ] 5 sel f . c a l l = sel f . reducer i f isreducer else sel f .mapper 6 7 def compress( sel f ) : 8 R = numpy. l inalg . qr (numpy. array ( sel f . data) , ' r ' ) 9 sel f . data = [ ] 10 for row in R: sel f . data .append( [ f loat (v) for v in row] ) 11 12 def col lect ( sel f , key , value ) : 13 sel f . data .append(value ) 14 i f len ( sel f . data) sel f . bsize len ( sel f . data [ 0 ] ) : 15 sel f . compress() 16 17 def close ( sel f ) : 18 sel f . compress() 19 for row in sel f . data : yield random. randint (0,2000000000), row 20 21 def mapper( sel f , key , value ) : 22 sel f . col lect (key , value ) 23 24 def reducer ( sel f , key , values ) : 25 for value in values : sel f .mapper(key , value ) 26 27 i f name ==' main ' : 28 mapper = SerialTSQR( blocksize=3, isreducer=False ) 29 reducer = SerialTSQR( blocksize=3, isreducer=True) 30 hadoopy. run(mapper , reducer )
  • 21. MapReduce TSQR: Getting Q We have an ecient way of getting R in QR (and in the SVD). What if I want Q (or my left singular vectors)? A = QR ! Q = AR1 This is a numerically unstable computation, and Q can be far from orthogonal. (This may be OK in some cases).
  • 22. Indirect TSQR We call this method Indirect TSQR.
  • 24. nement We can repeat the TSQR computation on Q to get a more orthogonal matrix. This is called iterative re
  • 25. nement. A A1 R-1 map R Q1 A2 R-1 map Q2 A3 R-1 map Q3 A4 R-1 map Q4 emit emit emit emit distribute TSQR Q Q1 R1 -1 map R1 Q1 Q2 R1 -1 map Q2 Q3 R1 -1 map Q3 Q4 R1 -1 map Q4 emit emit emit emit distribute Local MatMul Local MatMul Iterative Refinement step
  • 26. Direct TSQR Why is it hard to reconstruct Q? I Only maintained state is the data written to disk at the end of each map/reduce iteration I No controlled communication between processors ! can only control data movement via the shue I File names and keys are the only methods of labeling data components
  • 27. Direct TSQR: Steps 1 and 2 A A1 map R1 Q11 emit emit A2 map R2 Q21 emit emit A3 map R3 Q31 emit emit A4 map R4 Q41 emit emit First step R1 R2 R3 R4 Q12 Q22 Q32 Q42 R reduce emit emit emit emit emit Second step shuffle
  • 29. Cholesky QR We also have a Cholesky QR: ATA = (QR)T (QR) = RTQTQR = RTR Computing ATA translates nicely to MapReduce. This method has worse stability issues but requires fewer ops.
  • 30. Performance Data matrix size Time to completion (seconds) 4Bx4 (135 GB) 2.5Bx10 (193 GB) 600Mx25 (112 GB) 500Mx50 (184 GB) 150Mx100 (110 GB) 8000 7000 6000 5000 4000 3000 2000 1000 0 Performance of various MapReduce algorithms for computing QR Cholesky Indirect TSQR Cholesky + IR Indirect TSQR + IR Direct TSQR
  • 31. End Questions?? I code: https://github.com/arbenson/mrtsqr I arXiv paper: http://arxiv.org/abs/1301.1071 I my email: arbenson@stanford.edu
  • 33. Cholesky QR A A1 A1 TA1 map emit A2 map A2 TA2 emit A3 map A3 TA3 emit A4 map A4 TA4 emit shuffle (A1 TA1)1 (A2 TA2)1 (A3 TA3)1 (A4 TA4)1 (ATA)1 (A1 TA1)2 (A2 TA2)2 (A3 TA3)2 (A4 TA4)2 (ATA)2 (A1 TA1)3 (A2 TA2)3 (A3 TA3)3 (A4 TA4)3 (ATA)3 (A1 TA1)4 (A2 TA2)4 (A3 TA3)4 (A4 TA4)4 (ATA)4 emit emit emit emit ATA RTR emit reduce reduce reduce reduce Local ATA Row Sum Cholesky
  • 34. Stability 5 10 0 10 −5 10 −10 10 10 0 20 10 5 10 10 10 15 10 10 −15 cond(A) 2 ||QTQ − I|| Numerical stability: 10000x10 matrices Dir. TSQR Indir. TSQR Indir. TSQR + I.R. Chol. Chol. + I.R.