SlideShare a Scribd company logo
Center for Uncertainty
Quantification
C
Q
LikelihoodApproximationWithParallelHierarchicalMatrices
ForLargeSpatialDatasets
A. Litvinenko, Y. Sun, M. Genton, D. Keyes, CEMSE, KAUST
HIERARCHICAL LIKELIHOOD APPROXIMATION
Suppose we observe a mean-zero, stationary and isotropic Gaussian process Z with a Matérn covari-
ance at n irregularly spaced locations. Let Z = (Z(s1), ..., Z(sn))T
then Z ∼ N(0, C(θ)), θ ∈ Rq
is an
unknown parameter vector of interest, where
Cij(θ) = cov(Z(si), Z(sj)) = C( si − sj , θ), and
C(r) := Cθ(r) =
2σ2
Γ(ν)
r
2
ν
Kν
r
, θ = (σ2
, ν, )T
is the Matérn covariance function. The MLE of
θ is obtained by maximizing the Gaussian log-
likelihood function:
L(θ) = −
n
2
log(2π)−
1
2
log |C(θ)|−
1
2
Z C(θ)−1
Z.
We approximate C ≈ C in the H-matrix format
with cost and storage O(kn log n), k n.
Theorem 1 1. Let ρ(C−1
C − I) < ε < 1. It holds
| log |C| − log |C|| ≤ −n log(1 − ε). Let C−1
≤ c1,
then |L(θ; k) − L(θ)| ≤ c2
0 · c1 · ε + n log(1 − ε)
Operation Sequen. Compl. Parallel Compl. (shared mem.)
building( ˜C) O(n log n) O(n log n)
p
+ O(|V (T)L(T)|)
storage( ˜C) O(kn log n) O(kn log n)
˜Cz O(kn log n) O(kn log n)
p
+ n
√
p
H-Cholesky O(k2
n log2
n) O(n log n)
p
+ O(k2
n log2
n
n1/d )
Daily soil moisture, Mississippi basin.
H-matrix rank
3 7 9
cov.length
0.02
0.025
0.03
0.035
0.04
0.045
0.05
0.055
0.06
Box-plots for different
H-ranks k = {3, 7, 9}, = 0.0334.
ℓ, ν = 0.325, σ2
= 0.98
0 0.2 0.4 0.6 0.8 1
×10 4
-5
-4
-3
-2
-1
0
1
2
3
log(| ˜C|)
zT ˜C−1
z
− ˜L
Moisture, n = 66049, rank k = 11.
PARALLEL HIERARCHICAL MATRICES (HACKBUSCH, KRIEMANN’05)
Advantages to approximate C by C: H-approximation is cheap; storage and matrix-vector product
cost O(kn log n); LU and inverse cost O(k2
n log2
n); efficient parallel implementations exists.
54
54
86
86
68
67 67
20
19 112
28 26
15 24
112
112 123
26 107
19 17
107
107
82
67 67
25
25
25
32 66
20 20
28 24
25 26
16 25
66
66 73
44 63
30 23
63
63 67
29
15 67
14 27
22 18
69
60 60
25 60
25 25
79
63 63
24
14
27 63
19 29
16 21
23 30
64
64 79
19 66
23 29
66
66 66
24
11 65
21 52
28 35
65
65 70
112 112
19
24
35
21 112
19 8
23 35
18 15
12 20
79
71 71
115 115
33 115
23 29
127
125
66
66 67
24
25 125
27 25
25 30
125
125
81
64 64
17 125
26 19
125
125
65
65 73
17
35
22
14 65
20 30
25 26
25 21
15 28
65
65 82
21 70
22 49
70
60 60
30
29 60
25 16
17 13
71
64 64
15 61
14 20
61
61 70
19
20
18 59
23 23
17 25
21 25
59
59 85
119 119
16 119
23 21
70
62 62
120 120
12
17
16
13
20 120
14 17
25 23
17 24
13 20
8 12
80
67 67
19 65
11 15
65
64 64
17 122
18 15
122
122 122
34
23 112
11 26
24 23
113
113
74
74 84
21 104
17 23
104
104 120
28
24
18 120
29 22
26 31
18 27
78
57 57
125 125
28 125
19 30
62
62 77
122 122
23
32 121
15 15
26 33
122
122 123
25 123
24 19
70
63 63
29 63
24 18
75
65 65
18
28
35
33
28 65
18 24
24 35
16 14
19 21
12 20
78
66 66
124 124
18 123
21 16
123
123
66
66 70
18
27 127
29 19
21 23
127
127
77
63 63
21 123
11 32
123
123
61
61 82
28
26
16 123
26 30
19 31
18 26
123
113 113
22 113
16 25
73
59 59
29 59
26 29
76
76 79
26
16
19 62
20 19
28 23
24 26
62
62 73
18 69
41 20
69
65 66
28 123
25 12
123
26 26
54
54
86
86
68
67 67
20
39 112
48 73
15 39
112
112 123
63 107
56 65
107
107
82
67 67
25
50
67
58 66
59 58
71 76
50 75
16 41
66
66 73
63 63
62 66
63
63 67
84
58 67
54 55
75 76
69
60 60
59 60
56 55
79
63 63
65
70
58 63
59 57
67 69
63 80
64
64 79
59 66
54 56
66
66 66
82
57 65
60 66
79 83
65
65 70
112 112
19
43
76
82 112
73 72
62 83
37 51
12 32
79
71 71
115 115
87 115
78 77
127
127
66
66 67
63
73 125
70 77
59 75
125
125
81
64 64
76 125
78 84
125
125
65
65 73
48
71
79
56 65
61 67
75 82
59 71
43 61
65
65 82
59 70
54 60
70
60 60
80
52 60
48 54
67 52
71
64 64
54 61
61 55
61
61 70
69
74
48 59
52 63
67 67
64 78
59
59 85
119 119
64 119
65 74
70
62 62
120 120
12
29
45
47
59 120
51 58
53 69
44 52
25 40
8 20
80
67 67
46 65
50 56
65
64 64
67 122
62 61
122
122 122
69
71 113
63 71
58 71
113
113
74
74 84
64 104
59 72
104
104 120
55
69
63 120
66 80
66 74
43 60
78
57 57
125 125
81 125
69 78
62
62 77
122 122
68
77 122
62 65
64 81
122
122 123
76 123
71 80
70
63 63
63 63
56 55
75
65 65
34
56
73
85
70 65
63 60
76 93
55 61
45 54
27 40
78
66 66
124 124
68 123
63 71
123
123
66
66 70
61
69 127
68 74
60 74
127
127
77
63 63
73 123
63 74
123
123
61
61 82
57
72
66 123
67 88
63 74
45 61
123
113 113
72 113
65 75
73
59 59
62 59
59 57
76
76 79
69
68
52 62
54 64
72 83
62 74
62
62 73
58 69
62 58
69
66 66
83 123
26 26
123
26 26
(1st) Matérn H-matrix approximations for moisture example, n = 8000, ε = 10−3
, = 0.64, ν = 0.325,
σ2
= 0.98, 29.3MB vs 488.3MB for dense, set up time 0.4 sec.; (2nd) Cholesky factor L, with accuracy in
each block ε = 10−8
, 4.8 sec., storage 52.8 MB.; (3rd) Distribution across p processors; (4) Kronecker prod-
uct of H-matrices, n = 381K; (5) Discretization of Mississippi basin, [−84.8◦
−72.9◦
]×[32.446◦
, 43.4044◦
].
NUMERICAL EXAMPLES
H-matrix approximation, ν = 0.5, domain G = [0, 1]2
, C(0.25,0.75) 2 = {212, 568}, n = 16049.
k KLD C − C 2 CC−1
− I 2
= 0.25 = 0.75 = 0.25 = 0.75 = 0.25 = 0.75
10 2.6e-3 0.2 7.7e-4 7.0e-4 6.0e-2 3.1
50 3.4e-13 5e-12 2.0e-13 2.4e-13 4e-11 2.7e-9
Computing time and number of iterations for maximization of log-likelihood L(θ; k), n = 66049.
k size, GB C, set up time, s. compute L, s. maximizing, s. # iters
10 1 7 115 1994 13
20 1.7 11 370 5445 9
dense 38 42 657 ∞ -
Moisture data. We used adaptive rank arithmetics with ε = 10−4
for each block of C and ε = 10−8
for
each block of C−1
. Number of processing cores is 40.
n compute C LLT
inverse
Compr. time size time size I − (LLT
)−1
C 2 time size I − C−1
C 2
rate % sec. MB sec. MB sec. MB
10000 86% 0.9 106 4.1 109 7.7e-6 44 230 7.8e-5
30000 92.5% 4.3 515 25 557 1.1e-3 316 1168 1.1e-1
n = 512K, accuracy inside each block 10−8
, matrix setup 261 sec., compression rate 99.98% (0.4
GB against 2006 GB). H-LU is done in 843 sec., required 5.8 GB RAM, inversion LU error 2 · 10−3
.
(1st) −L vs. ν; (2nd) with nuggets {0.01, 0.005, 0.001} for Gaussian covariance, n = 2000, k = 14,
σ2
= 1; (3rd) Zoom of 2nd figure; (4th) box-plots for ν vs number of locations n.
REFERENCES AND ACKNOWLEDGEMENTS
[1] B. N. KHOROMSKIJ, A. LITVINENKO, H. G. MATTHIES, Application of hierarchical matrices for computing the
Karhunen-Loéve expan-sion, Computing, Vol. 84, Issue 1-2, pp 49-67, 2008.
[2] Y. SUN, M. STEIN, Statistically and computationally efficient estimating equations for large spatial datasets, JCGS, 2016,
[3] A. LITVINENKO, M. GENTON, Y. SUN, D. KEYES, ??matrix techniques for approximating large covariance matrices
and estimating its parameters, PAMM 16 (1), 731-732, 2016
[4] W. NOWAK, A. LITVINENKO, Kriging and spatial design accelerated by orders of magnitude: combining low-rank
covariance approximations with FFT-techniques, J. Mathematical Geosciences, Vol. 45, N4, pp 411-435, 2013.
Work supported by SRI-UQ and ECRC, KAUST. Thanks to Ronald Kriemann for HLIBPro .

More Related Content

What's hot

What's hot (18)

Capitulo 2, 7ma edición
Capitulo 2, 7ma ediciónCapitulo 2, 7ma edición
Capitulo 2, 7ma edición
 
Table of transformation of laplace & z
Table of transformation of laplace & zTable of transformation of laplace & z
Table of transformation of laplace & z
 
Tabla trasformada z
Tabla trasformada zTabla trasformada z
Tabla trasformada z
 
Capitulo 1, 7ma edición
Capitulo 1, 7ma ediciónCapitulo 1, 7ma edición
Capitulo 1, 7ma edición
 
Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Families of Triangular Norm Based Kernel Function and Its Application to Kern...Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Families of Triangular Norm Based Kernel Function and Its Application to Kern...
 
Data sparse approximation of the Karhunen-Loeve expansion
Data sparse approximation of the Karhunen-Loeve expansionData sparse approximation of the Karhunen-Loeve expansion
Data sparse approximation of the Karhunen-Loeve expansion
 
Resistencia de materiales
Resistencia de materialesResistencia de materiales
Resistencia de materiales
 
Capitulo 3, 7ma edición
Capitulo 3, 7ma ediciónCapitulo 3, 7ma edición
Capitulo 3, 7ma edición
 
Capitulo 10, 7ma edición
Capitulo 10, 7ma ediciónCapitulo 10, 7ma edición
Capitulo 10, 7ma edición
 
Capitulo 5, 7ma edición
Capitulo 5, 7ma ediciónCapitulo 5, 7ma edición
Capitulo 5, 7ma edición
 
Tutorial no. 1.doc
Tutorial no. 1.docTutorial no. 1.doc
Tutorial no. 1.doc
 
Capitulo 12, 7ma edición
Capitulo 12, 7ma ediciónCapitulo 12, 7ma edición
Capitulo 12, 7ma edición
 
Capitulo 9, 7ma edición
Capitulo 9, 7ma ediciónCapitulo 9, 7ma edición
Capitulo 9, 7ma edición
 
Solution of simplified neutron diffusion equation by FDM
Solution of simplified neutron diffusion equation by FDMSolution of simplified neutron diffusion equation by FDM
Solution of simplified neutron diffusion equation by FDM
 
Tutorial no. 2
Tutorial no. 2Tutorial no. 2
Tutorial no. 2
 
Tutorial no. 3(1)
Tutorial no. 3(1)Tutorial no. 3(1)
Tutorial no. 3(1)
 
Integral calculus formula sheet
Integral calculus formula sheetIntegral calculus formula sheet
Integral calculus formula sheet
 
Capitulo 13, 7ma edición
Capitulo 13, 7ma ediciónCapitulo 13, 7ma edición
Capitulo 13, 7ma edición
 

Viewers also liked

B2B Marketing and The Power of Twitter
B2B Marketing and The Power of TwitterB2B Marketing and The Power of Twitter
B2B Marketing and The Power of Twitter
Steve Yanor
 

Viewers also liked (20)

Minimum mean square error estimation and approximation of the Bayesian update
Minimum mean square error estimation and approximation of the Bayesian updateMinimum mean square error estimation and approximation of the Bayesian update
Minimum mean square error estimation and approximation of the Bayesian update
 
Hierarchical matrix approximation of large covariance matrices
Hierarchical matrix approximation of large covariance matricesHierarchical matrix approximation of large covariance matrices
Hierarchical matrix approximation of large covariance matrices
 
Data sparse approximation of the Karhunen-Loeve expansion
Data sparse approximation of the Karhunen-Loeve expansionData sparse approximation of the Karhunen-Loeve expansion
Data sparse approximation of the Karhunen-Loeve expansion
 
Low-rank methods for analysis of high-dimensional data (SIAM CSE talk 2017)
Low-rank methods for analysis of high-dimensional data (SIAM CSE talk 2017) Low-rank methods for analysis of high-dimensional data (SIAM CSE talk 2017)
Low-rank methods for analysis of high-dimensional data (SIAM CSE talk 2017)
 
Vascularised Ulnar Nerve Graft
Vascularised Ulnar Nerve GraftVascularised Ulnar Nerve Graft
Vascularised Ulnar Nerve Graft
 
A small introduction into H-matrices which I gave for my colleagues
A small introduction into H-matrices which I gave for my colleaguesA small introduction into H-matrices which I gave for my colleagues
A small introduction into H-matrices which I gave for my colleagues
 
Top 5 Deep Learning and AI Stories 3/9
Top 5 Deep Learning and AI Stories 3/9Top 5 Deep Learning and AI Stories 3/9
Top 5 Deep Learning and AI Stories 3/9
 
Possible applications of low-rank tensors in statistics and UQ (my talk in Bo...
Possible applications of low-rank tensors in statistics and UQ (my talk in Bo...Possible applications of low-rank tensors in statistics and UQ (my talk in Bo...
Possible applications of low-rank tensors in statistics and UQ (my talk in Bo...
 
RS
RSRS
RS
 
My PhD on 4 pages
My PhD on 4 pagesMy PhD on 4 pages
My PhD on 4 pages
 
Litvinenko nlbu2016
Litvinenko nlbu2016Litvinenko nlbu2016
Litvinenko nlbu2016
 
My paper for Domain Decomposition Conference in Strobl, Austria, 2005
My paper for Domain Decomposition Conference in Strobl, Austria, 2005My paper for Domain Decomposition Conference in Strobl, Austria, 2005
My paper for Domain Decomposition Conference in Strobl, Austria, 2005
 
AWSでアプリ開発するなら 知っておくべこと
AWSでアプリ開発するなら 知っておくべことAWSでアプリ開発するなら 知っておくべこと
AWSでアプリ開発するなら 知っておくべこと
 
B2B Marketing and The Power of Twitter
B2B Marketing and The Power of TwitterB2B Marketing and The Power of Twitter
B2B Marketing and The Power of Twitter
 
The Next Tsunami AI Blockchain IOT and Our Swarm Evolutionary Singularity
The Next Tsunami AI Blockchain IOT and Our Swarm Evolutionary SingularityThe Next Tsunami AI Blockchain IOT and Our Swarm Evolutionary Singularity
The Next Tsunami AI Blockchain IOT and Our Swarm Evolutionary Singularity
 
Design in Tech Report 2017
Design in Tech Report 2017Design in Tech Report 2017
Design in Tech Report 2017
 
Tensor Completion for PDEs with uncertain coefficients and Bayesian Update te...
Tensor Completion for PDEs with uncertain coefficients and Bayesian Update te...Tensor Completion for PDEs with uncertain coefficients and Bayesian Update te...
Tensor Completion for PDEs with uncertain coefficients and Bayesian Update te...
 
Scalable hierarchical algorithms for stochastic PDEs and UQ
Scalable hierarchical algorithms for stochastic PDEs and UQScalable hierarchical algorithms for stochastic PDEs and UQ
Scalable hierarchical algorithms for stochastic PDEs and UQ
 
Future of construction World Economic Forum moladi
Future of construction World Economic Forum moladiFuture of construction World Economic Forum moladi
Future of construction World Economic Forum moladi
 
KelvinT_Directors Treatment
KelvinT_Directors TreatmentKelvinT_Directors Treatment
KelvinT_Directors Treatment
 

Similar to Likelihood approximation with parallel hierarchical matrices for large spatial datasets

Similar to Likelihood approximation with parallel hierarchical matrices for large spatial datasets (20)

Application of parallel hierarchical matrices and low-rank tensors in spatial...
Application of parallel hierarchical matrices and low-rank tensors in spatial...Application of parallel hierarchical matrices and low-rank tensors in spatial...
Application of parallel hierarchical matrices and low-rank tensors in spatial...
 
Numerical Methods: curve fitting and interpolation
Numerical Methods: curve fitting and interpolationNumerical Methods: curve fitting and interpolation
Numerical Methods: curve fitting and interpolation
 
Application of parallel hierarchical matrices for parameter inference and pre...
Application of parallel hierarchical matrices for parameter inference and pre...Application of parallel hierarchical matrices for parameter inference and pre...
Application of parallel hierarchical matrices for parameter inference and pre...
 
Identification of unknown parameters and prediction with hierarchical matrice...
Identification of unknown parameters and prediction with hierarchical matrice...Identification of unknown parameters and prediction with hierarchical matrice...
Identification of unknown parameters and prediction with hierarchical matrice...
 
Interface 2010
Interface 2010Interface 2010
Interface 2010
 
Root locus
Root locusRoot locus
Root locus
 
Future cosmology with CMB lensing and galaxy clustering
Future cosmology with CMB lensing and galaxy clusteringFuture cosmology with CMB lensing and galaxy clustering
Future cosmology with CMB lensing and galaxy clustering
 
Time series data mining techniques
Time series data mining techniquesTime series data mining techniques
Time series data mining techniques
 
Propagation of Uncertainties in Density Driven Groundwater Flow
Propagation of Uncertainties in Density Driven Groundwater FlowPropagation of Uncertainties in Density Driven Groundwater Flow
Propagation of Uncertainties in Density Driven Groundwater Flow
 
Talk litvinenko prior_cov
Talk litvinenko prior_covTalk litvinenko prior_cov
Talk litvinenko prior_cov
 
Project_Report.pdf
Project_Report.pdfProject_Report.pdf
Project_Report.pdf
 
assignemts.pdf
assignemts.pdfassignemts.pdf
assignemts.pdf
 
Computation of electromagnetic_fields_scattered_from_dielectric_objects_of_un...
Computation of electromagnetic_fields_scattered_from_dielectric_objects_of_un...Computation of electromagnetic_fields_scattered_from_dielectric_objects_of_un...
Computation of electromagnetic_fields_scattered_from_dielectric_objects_of_un...
 
A kernel-free particle method: Smile Problem Resolved
A kernel-free particle method: Smile Problem ResolvedA kernel-free particle method: Smile Problem Resolved
A kernel-free particle method: Smile Problem Resolved
 
Dynamic stiffness and eigenvalues of nonlocal nano beams
Dynamic stiffness and eigenvalues of nonlocal nano beamsDynamic stiffness and eigenvalues of nonlocal nano beams
Dynamic stiffness and eigenvalues of nonlocal nano beams
 
Anatomy of human motion
Anatomy of human motionAnatomy of human motion
Anatomy of human motion
 
Semi-Classical Transport Theory.ppt
Semi-Classical Transport Theory.pptSemi-Classical Transport Theory.ppt
Semi-Classical Transport Theory.ppt
 
Talk at SciCADE2013 about "Accelerated Multiple Precision ODE solver base on ...
Talk at SciCADE2013 about "Accelerated Multiple Precision ODE solver base on ...Talk at SciCADE2013 about "Accelerated Multiple Precision ODE solver base on ...
Talk at SciCADE2013 about "Accelerated Multiple Precision ODE solver base on ...
 
Identification of unknown parameters and prediction of missing values. Compar...
Identification of unknown parameters and prediction of missing values. Compar...Identification of unknown parameters and prediction of missing values. Compar...
Identification of unknown parameters and prediction of missing values. Compar...
 
Data sparse approximation of Karhunen-Loeve Expansion
Data sparse approximation of Karhunen-Loeve ExpansionData sparse approximation of Karhunen-Loeve Expansion
Data sparse approximation of Karhunen-Loeve Expansion
 

More from Alexander Litvinenko

Poster_density_driven_with_fracture_MLMC.pdf
Poster_density_driven_with_fracture_MLMC.pdfPoster_density_driven_with_fracture_MLMC.pdf
Poster_density_driven_with_fracture_MLMC.pdf
Alexander Litvinenko
 
Density Driven Groundwater Flow with Uncertain Porosity and Permeability
Density Driven Groundwater Flow with Uncertain Porosity and PermeabilityDensity Driven Groundwater Flow with Uncertain Porosity and Permeability
Density Driven Groundwater Flow with Uncertain Porosity and Permeability
Alexander Litvinenko
 
Computing f-Divergences and Distances of High-Dimensional Probability Density...
Computing f-Divergences and Distances of High-Dimensional Probability Density...Computing f-Divergences and Distances of High-Dimensional Probability Density...
Computing f-Divergences and Distances of High-Dimensional Probability Density...
Alexander Litvinenko
 

More from Alexander Litvinenko (20)

Poster_density_driven_with_fracture_MLMC.pdf
Poster_density_driven_with_fracture_MLMC.pdfPoster_density_driven_with_fracture_MLMC.pdf
Poster_density_driven_with_fracture_MLMC.pdf
 
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdflitvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
 
litvinenko_Intrusion_Bari_2023.pdf
litvinenko_Intrusion_Bari_2023.pdflitvinenko_Intrusion_Bari_2023.pdf
litvinenko_Intrusion_Bari_2023.pdf
 
Density Driven Groundwater Flow with Uncertain Porosity and Permeability
Density Driven Groundwater Flow with Uncertain Porosity and PermeabilityDensity Driven Groundwater Flow with Uncertain Porosity and Permeability
Density Driven Groundwater Flow with Uncertain Porosity and Permeability
 
litvinenko_Gamm2023.pdf
litvinenko_Gamm2023.pdflitvinenko_Gamm2023.pdf
litvinenko_Gamm2023.pdf
 
Litvinenko_Poster_Henry_22May.pdf
Litvinenko_Poster_Henry_22May.pdfLitvinenko_Poster_Henry_22May.pdf
Litvinenko_Poster_Henry_22May.pdf
 
Uncertain_Henry_problem-poster.pdf
Uncertain_Henry_problem-poster.pdfUncertain_Henry_problem-poster.pdf
Uncertain_Henry_problem-poster.pdf
 
Litvinenko_RWTH_UQ_Seminar_talk.pdf
Litvinenko_RWTH_UQ_Seminar_talk.pdfLitvinenko_RWTH_UQ_Seminar_talk.pdf
Litvinenko_RWTH_UQ_Seminar_talk.pdf
 
Litv_Denmark_Weak_Supervised_Learning.pdf
Litv_Denmark_Weak_Supervised_Learning.pdfLitv_Denmark_Weak_Supervised_Learning.pdf
Litv_Denmark_Weak_Supervised_Learning.pdf
 
Computing f-Divergences and Distances of High-Dimensional Probability Density...
Computing f-Divergences and Distances of High-Dimensional Probability Density...Computing f-Divergences and Distances of High-Dimensional Probability Density...
Computing f-Divergences and Distances of High-Dimensional Probability Density...
 
Computing f-Divergences and Distances of\\ High-Dimensional Probability Densi...
Computing f-Divergences and Distances of\\ High-Dimensional Probability Densi...Computing f-Divergences and Distances of\\ High-Dimensional Probability Densi...
Computing f-Divergences and Distances of\\ High-Dimensional Probability Densi...
 
Low rank tensor approximation of probability density and characteristic funct...
Low rank tensor approximation of probability density and characteristic funct...Low rank tensor approximation of probability density and characteristic funct...
Low rank tensor approximation of probability density and characteristic funct...
 
Computation of electromagnetic fields scattered from dielectric objects of un...
Computation of electromagnetic fields scattered from dielectric objects of un...Computation of electromagnetic fields scattered from dielectric objects of un...
Computation of electromagnetic fields scattered from dielectric objects of un...
 
Low-rank tensor approximation (Introduction)
Low-rank tensor approximation (Introduction)Low-rank tensor approximation (Introduction)
Low-rank tensor approximation (Introduction)
 
Computation of electromagnetic fields scattered from dielectric objects of un...
Computation of electromagnetic fields scattered from dielectric objects of un...Computation of electromagnetic fields scattered from dielectric objects of un...
Computation of electromagnetic fields scattered from dielectric objects of un...
 
Computation of electromagnetic fields scattered from dielectric objects of un...
Computation of electromagnetic fields scattered from dielectric objects of un...Computation of electromagnetic fields scattered from dielectric objects of un...
Computation of electromagnetic fields scattered from dielectric objects of un...
 
Simulation of propagation of uncertainties in density-driven groundwater flow
Simulation of propagation of uncertainties in density-driven groundwater flowSimulation of propagation of uncertainties in density-driven groundwater flow
Simulation of propagation of uncertainties in density-driven groundwater flow
 
Semi-Supervised Regression using Cluster Ensemble
Semi-Supervised Regression using Cluster EnsembleSemi-Supervised Regression using Cluster Ensemble
Semi-Supervised Regression using Cluster Ensemble
 
Talk Alexander Litvinenko on SIAM GS Conference in Houston
Talk Alexander Litvinenko on SIAM GS Conference in HoustonTalk Alexander Litvinenko on SIAM GS Conference in Houston
Talk Alexander Litvinenko on SIAM GS Conference in Houston
 
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
 

Recently uploaded

Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...
Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...
Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...
Open Access Research Paper
 
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business VenturesWillie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
greendigital
 
一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理
一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理
一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理
exehay
 
一比一原版(Monash毕业证)莫纳什大学毕业证成绩单
一比一原版(Monash毕业证)莫纳什大学毕业证成绩单一比一原版(Monash毕业证)莫纳什大学毕业证成绩单
一比一原版(Monash毕业证)莫纳什大学毕业证成绩单
pcoow
 
Genetic diversity and association analysis for different morphological traits...
Genetic diversity and association analysis for different morphological traits...Genetic diversity and association analysis for different morphological traits...
Genetic diversity and association analysis for different morphological traits...
Open Access Research Paper
 
Prevalence, biochemical and hematological study of diabetic patients
Prevalence, biochemical and hematological study of diabetic patientsPrevalence, biochemical and hematological study of diabetic patients
Prevalence, biochemical and hematological study of diabetic patients
Open Access Research Paper
 
Green house gases GlobalWarmingPotential.pptx
Green house gases GlobalWarmingPotential.pptxGreen house gases GlobalWarmingPotential.pptx
Green house gases GlobalWarmingPotential.pptx
ViniHema
 
Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...
Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...
Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...
Open Access Research Paper
 

Recently uploaded (20)

DRAFT NRW Recreation Strategy - People and Nature thriving together
DRAFT NRW Recreation Strategy - People and Nature thriving togetherDRAFT NRW Recreation Strategy - People and Nature thriving together
DRAFT NRW Recreation Strategy - People and Nature thriving together
 
Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...
Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...
Micro RNA genes and their likely influence in rice (Oryza sativa L.) dynamic ...
 
Navigating the complex landscape of AI governance
Navigating the complex landscape of AI governanceNavigating the complex landscape of AI governance
Navigating the complex landscape of AI governance
 
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business VenturesWillie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
 
Sustainable farming practices in India .pptx
Sustainable farming  practices in India .pptxSustainable farming  practices in India .pptx
Sustainable farming practices in India .pptx
 
Bhopal Gas Leak Tragedy - A Night of death
Bhopal Gas Leak Tragedy - A Night of deathBhopal Gas Leak Tragedy - A Night of death
Bhopal Gas Leak Tragedy - A Night of death
 
NRW Board Paper - DRAFT NRW Recreation Strategy
NRW Board Paper - DRAFT NRW Recreation StrategyNRW Board Paper - DRAFT NRW Recreation Strategy
NRW Board Paper - DRAFT NRW Recreation Strategy
 
一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理
一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理
一比一原版EUR毕业证鹿特丹伊拉斯姆斯大学毕业证成绩单如何办理
 
一比一原版(Monash毕业证)莫纳什大学毕业证成绩单
一比一原版(Monash毕业证)莫纳什大学毕业证成绩单一比一原版(Monash毕业证)莫纳什大学毕业证成绩单
一比一原版(Monash毕业证)莫纳什大学毕业证成绩单
 
Genetic diversity and association analysis for different morphological traits...
Genetic diversity and association analysis for different morphological traits...Genetic diversity and association analysis for different morphological traits...
Genetic diversity and association analysis for different morphological traits...
 
Artificial Reefs by Kuddle Life Foundation - May 2024
Artificial Reefs by Kuddle Life Foundation - May 2024Artificial Reefs by Kuddle Life Foundation - May 2024
Artificial Reefs by Kuddle Life Foundation - May 2024
 
Major-Environmental-Problems and Proven Solutions.pdf
Major-Environmental-Problems and Proven Solutions.pdfMajor-Environmental-Problems and Proven Solutions.pdf
Major-Environmental-Problems and Proven Solutions.pdf
 
Natural farming @ Dr. Siddhartha S. Jena.pptx
Natural farming @ Dr. Siddhartha S. Jena.pptxNatural farming @ Dr. Siddhartha S. Jena.pptx
Natural farming @ Dr. Siddhartha S. Jena.pptx
 
Prevalence, biochemical and hematological study of diabetic patients
Prevalence, biochemical and hematological study of diabetic patientsPrevalence, biochemical and hematological study of diabetic patients
Prevalence, biochemical and hematological study of diabetic patients
 
Green house gases GlobalWarmingPotential.pptx
Green house gases GlobalWarmingPotential.pptxGreen house gases GlobalWarmingPotential.pptx
Green house gases GlobalWarmingPotential.pptx
 
Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...
Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...
Use of Raffias’ species (Raphia spp.) and its impact on socioeconomic charact...
 
International+e-Commerce+Platform-www.cfye-commerce.shop
International+e-Commerce+Platform-www.cfye-commerce.shopInternational+e-Commerce+Platform-www.cfye-commerce.shop
International+e-Commerce+Platform-www.cfye-commerce.shop
 
Environmental Impact Assessment (EIA) in Nepal.pptx
Environmental Impact Assessment (EIA) in Nepal.pptxEnvironmental Impact Assessment (EIA) in Nepal.pptx
Environmental Impact Assessment (EIA) in Nepal.pptx
 
@@how to Join @occult for money ritual..☎️+2349022657119.
@@how to Join @occult for money ritual..☎️+2349022657119.@@how to Join @occult for money ritual..☎️+2349022657119.
@@how to Join @occult for money ritual..☎️+2349022657119.
 
IPCC Vice Chair Ladislaus Change Central Asia Climate Change Conference 27 Ma...
IPCC Vice Chair Ladislaus Change Central Asia Climate Change Conference 27 Ma...IPCC Vice Chair Ladislaus Change Central Asia Climate Change Conference 27 Ma...
IPCC Vice Chair Ladislaus Change Central Asia Climate Change Conference 27 Ma...
 

Likelihood approximation with parallel hierarchical matrices for large spatial datasets

  • 1. Center for Uncertainty Quantification C Q LikelihoodApproximationWithParallelHierarchicalMatrices ForLargeSpatialDatasets A. Litvinenko, Y. Sun, M. Genton, D. Keyes, CEMSE, KAUST HIERARCHICAL LIKELIHOOD APPROXIMATION Suppose we observe a mean-zero, stationary and isotropic Gaussian process Z with a Matérn covari- ance at n irregularly spaced locations. Let Z = (Z(s1), ..., Z(sn))T then Z ∼ N(0, C(θ)), θ ∈ Rq is an unknown parameter vector of interest, where Cij(θ) = cov(Z(si), Z(sj)) = C( si − sj , θ), and C(r) := Cθ(r) = 2σ2 Γ(ν) r 2 ν Kν r , θ = (σ2 , ν, )T is the Matérn covariance function. The MLE of θ is obtained by maximizing the Gaussian log- likelihood function: L(θ) = − n 2 log(2π)− 1 2 log |C(θ)|− 1 2 Z C(θ)−1 Z. We approximate C ≈ C in the H-matrix format with cost and storage O(kn log n), k n. Theorem 1 1. Let ρ(C−1 C − I) < ε < 1. It holds | log |C| − log |C|| ≤ −n log(1 − ε). Let C−1 ≤ c1, then |L(θ; k) − L(θ)| ≤ c2 0 · c1 · ε + n log(1 − ε) Operation Sequen. Compl. Parallel Compl. (shared mem.) building( ˜C) O(n log n) O(n log n) p + O(|V (T)L(T)|) storage( ˜C) O(kn log n) O(kn log n) ˜Cz O(kn log n) O(kn log n) p + n √ p H-Cholesky O(k2 n log2 n) O(n log n) p + O(k2 n log2 n n1/d ) Daily soil moisture, Mississippi basin. H-matrix rank 3 7 9 cov.length 0.02 0.025 0.03 0.035 0.04 0.045 0.05 0.055 0.06 Box-plots for different H-ranks k = {3, 7, 9}, = 0.0334. ℓ, ν = 0.325, σ2 = 0.98 0 0.2 0.4 0.6 0.8 1 ×10 4 -5 -4 -3 -2 -1 0 1 2 3 log(| ˜C|) zT ˜C−1 z − ˜L Moisture, n = 66049, rank k = 11. PARALLEL HIERARCHICAL MATRICES (HACKBUSCH, KRIEMANN’05) Advantages to approximate C by C: H-approximation is cheap; storage and matrix-vector product cost O(kn log n); LU and inverse cost O(k2 n log2 n); efficient parallel implementations exists. 54 54 86 86 68 67 67 20 19 112 28 26 15 24 112 112 123 26 107 19 17 107 107 82 67 67 25 25 25 32 66 20 20 28 24 25 26 16 25 66 66 73 44 63 30 23 63 63 67 29 15 67 14 27 22 18 69 60 60 25 60 25 25 79 63 63 24 14 27 63 19 29 16 21 23 30 64 64 79 19 66 23 29 66 66 66 24 11 65 21 52 28 35 65 65 70 112 112 19 24 35 21 112 19 8 23 35 18 15 12 20 79 71 71 115 115 33 115 23 29 127 125 66 66 67 24 25 125 27 25 25 30 125 125 81 64 64 17 125 26 19 125 125 65 65 73 17 35 22 14 65 20 30 25 26 25 21 15 28 65 65 82 21 70 22 49 70 60 60 30 29 60 25 16 17 13 71 64 64 15 61 14 20 61 61 70 19 20 18 59 23 23 17 25 21 25 59 59 85 119 119 16 119 23 21 70 62 62 120 120 12 17 16 13 20 120 14 17 25 23 17 24 13 20 8 12 80 67 67 19 65 11 15 65 64 64 17 122 18 15 122 122 122 34 23 112 11 26 24 23 113 113 74 74 84 21 104 17 23 104 104 120 28 24 18 120 29 22 26 31 18 27 78 57 57 125 125 28 125 19 30 62 62 77 122 122 23 32 121 15 15 26 33 122 122 123 25 123 24 19 70 63 63 29 63 24 18 75 65 65 18 28 35 33 28 65 18 24 24 35 16 14 19 21 12 20 78 66 66 124 124 18 123 21 16 123 123 66 66 70 18 27 127 29 19 21 23 127 127 77 63 63 21 123 11 32 123 123 61 61 82 28 26 16 123 26 30 19 31 18 26 123 113 113 22 113 16 25 73 59 59 29 59 26 29 76 76 79 26 16 19 62 20 19 28 23 24 26 62 62 73 18 69 41 20 69 65 66 28 123 25 12 123 26 26 54 54 86 86 68 67 67 20 39 112 48 73 15 39 112 112 123 63 107 56 65 107 107 82 67 67 25 50 67 58 66 59 58 71 76 50 75 16 41 66 66 73 63 63 62 66 63 63 67 84 58 67 54 55 75 76 69 60 60 59 60 56 55 79 63 63 65 70 58 63 59 57 67 69 63 80 64 64 79 59 66 54 56 66 66 66 82 57 65 60 66 79 83 65 65 70 112 112 19 43 76 82 112 73 72 62 83 37 51 12 32 79 71 71 115 115 87 115 78 77 127 127 66 66 67 63 73 125 70 77 59 75 125 125 81 64 64 76 125 78 84 125 125 65 65 73 48 71 79 56 65 61 67 75 82 59 71 43 61 65 65 82 59 70 54 60 70 60 60 80 52 60 48 54 67 52 71 64 64 54 61 61 55 61 61 70 69 74 48 59 52 63 67 67 64 78 59 59 85 119 119 64 119 65 74 70 62 62 120 120 12 29 45 47 59 120 51 58 53 69 44 52 25 40 8 20 80 67 67 46 65 50 56 65 64 64 67 122 62 61 122 122 122 69 71 113 63 71 58 71 113 113 74 74 84 64 104 59 72 104 104 120 55 69 63 120 66 80 66 74 43 60 78 57 57 125 125 81 125 69 78 62 62 77 122 122 68 77 122 62 65 64 81 122 122 123 76 123 71 80 70 63 63 63 63 56 55 75 65 65 34 56 73 85 70 65 63 60 76 93 55 61 45 54 27 40 78 66 66 124 124 68 123 63 71 123 123 66 66 70 61 69 127 68 74 60 74 127 127 77 63 63 73 123 63 74 123 123 61 61 82 57 72 66 123 67 88 63 74 45 61 123 113 113 72 113 65 75 73 59 59 62 59 59 57 76 76 79 69 68 52 62 54 64 72 83 62 74 62 62 73 58 69 62 58 69 66 66 83 123 26 26 123 26 26 (1st) Matérn H-matrix approximations for moisture example, n = 8000, ε = 10−3 , = 0.64, ν = 0.325, σ2 = 0.98, 29.3MB vs 488.3MB for dense, set up time 0.4 sec.; (2nd) Cholesky factor L, with accuracy in each block ε = 10−8 , 4.8 sec., storage 52.8 MB.; (3rd) Distribution across p processors; (4) Kronecker prod- uct of H-matrices, n = 381K; (5) Discretization of Mississippi basin, [−84.8◦ −72.9◦ ]×[32.446◦ , 43.4044◦ ]. NUMERICAL EXAMPLES H-matrix approximation, ν = 0.5, domain G = [0, 1]2 , C(0.25,0.75) 2 = {212, 568}, n = 16049. k KLD C − C 2 CC−1 − I 2 = 0.25 = 0.75 = 0.25 = 0.75 = 0.25 = 0.75 10 2.6e-3 0.2 7.7e-4 7.0e-4 6.0e-2 3.1 50 3.4e-13 5e-12 2.0e-13 2.4e-13 4e-11 2.7e-9 Computing time and number of iterations for maximization of log-likelihood L(θ; k), n = 66049. k size, GB C, set up time, s. compute L, s. maximizing, s. # iters 10 1 7 115 1994 13 20 1.7 11 370 5445 9 dense 38 42 657 ∞ - Moisture data. We used adaptive rank arithmetics with ε = 10−4 for each block of C and ε = 10−8 for each block of C−1 . Number of processing cores is 40. n compute C LLT inverse Compr. time size time size I − (LLT )−1 C 2 time size I − C−1 C 2 rate % sec. MB sec. MB sec. MB 10000 86% 0.9 106 4.1 109 7.7e-6 44 230 7.8e-5 30000 92.5% 4.3 515 25 557 1.1e-3 316 1168 1.1e-1 n = 512K, accuracy inside each block 10−8 , matrix setup 261 sec., compression rate 99.98% (0.4 GB against 2006 GB). H-LU is done in 843 sec., required 5.8 GB RAM, inversion LU error 2 · 10−3 . (1st) −L vs. ν; (2nd) with nuggets {0.01, 0.005, 0.001} for Gaussian covariance, n = 2000, k = 14, σ2 = 1; (3rd) Zoom of 2nd figure; (4th) box-plots for ν vs number of locations n. REFERENCES AND ACKNOWLEDGEMENTS [1] B. N. KHOROMSKIJ, A. LITVINENKO, H. G. MATTHIES, Application of hierarchical matrices for computing the Karhunen-Loéve expan-sion, Computing, Vol. 84, Issue 1-2, pp 49-67, 2008. [2] Y. SUN, M. STEIN, Statistically and computationally efficient estimating equations for large spatial datasets, JCGS, 2016, [3] A. LITVINENKO, M. GENTON, Y. SUN, D. KEYES, ??matrix techniques for approximating large covariance matrices and estimating its parameters, PAMM 16 (1), 731-732, 2016 [4] W. NOWAK, A. LITVINENKO, Kriging and spatial design accelerated by orders of magnitude: combining low-rank covariance approximations with FFT-techniques, J. Mathematical Geosciences, Vol. 45, N4, pp 411-435, 2013. Work supported by SRI-UQ and ECRC, KAUST. Thanks to Ronald Kriemann for HLIBPro .