(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
Litvinenko, Uncertainty Quantification - an Overview
1. Uncertainty Quantification and related areas -
An overview
Alexander Litvinenko,
talk given at Department of Mathematical Sciences,
Durham University
Bayesian Computational Statistics & Modeling, KAUST
https://bayescomp.kaust.edu.sa/
Stochastic Numerics Group
http://sri-uq.kaust.edu.sa/
Alexander Litvinenko, talk given at Department of Mathematical Sciences, Durham UniversityUncertainty Quantification and related areas - An overview
4. 4*
Motivation to do Uncertainty Quantification (UQ)
Motivation: there is an urgent need to quantify and reduce the
uncertainty in multiscale-multiphysics applications.
Nowadays computational predictions
are used in critical engineering de-
cisions. But, how reliable are these
predictions?
Example: Saudi Aramco currently
has a simulator, TeraPOWERS, which
runs trillion-cell simulation. How
sensitive are these simulations w.r.t.
unknown reservoir properties?
My goal is development of UQ
methods and low-rank algo-
rithms relevant for applications. (pictures are taken from internet)
4
5. 4*
Uncertainty Quantification
Consider
− div(κ(x, ω) u(x, ω)) = f(x, ω) in G × Ω, G ⊂ Rd ,
u = 0 on ∂G,
where κ(x, ω) - uncertain diffusion coefficient.
1. Efficient Analysis of High Dimensional Data in Tensor
Formats, Espig, Hackbusch, Litvinenko., Matthies, Zander,
2012.
2. Efficient low-rank approx. of the stoch. Galerkin ma-
trix in tensor formats, W¨ahnert, Espig, Hackbusch, A.L.,
Matthies, 2013.
3. PCE of random coefficients and the solution of stochas-
tic PDEs in the Tensor Train format, Dolgov, Litvinenko,
Khoromskij, Matthies, 2016.
4. Application of H-matrices for computing the KL expan-
sion, Khoromskij, Litvinenko, Matthies Computing 84 (1-2),
49-67, 2009
0 0.5 1
-20
0
20
40
60
50 realizations of the solution u,
the mean and quantiles
Related work by R. Scheichl, Chr. Schwab, A. Teckentrup, F. Nobile, D. Kressner,...
5
6. 4*
Discretisation techniques
Truncated Karhunen Lo`eve Expansion
(KLE):
κ(x, ω) ≈ κ0(x) +
L
j=1
κjgj(x)ξj(θ(ω)),
where θ = θ(ω) = (θ1(ω), θ2(ω), ..., ),
ξj(θ) = 1
κj G (κ(x, ω) − κ0(x)) gj(x)dx.
1. S. Dolgov, B.N. Khoromskij, A. Litvinenko, H.G. Matthies, Computation of the Response Surface in the
Tensor Train data format, arXiv preprint arXiv:1406.2816, 2014
2. S. Dolgov, B.N. Khoromskij, A. Litvinenko, H.G. Matthies, Polynomial Chaos Expansion of Random Co-
efficients and the Solution of Stochastic Partial Differential Equations in the Tensor Train Format, IAM/ASA
J. Uncertainty Quantification 3 (1), 1109-1135, 2015
6
7. 4*
Generalized Polynomial Chaos Expansions (gPCE)
Decompose ξj(θ) from above:
ξj(θ) ≈
α∈JM,p
ξ
(α)
j Hα(θ),
Hα(θ) could be multivariate Hermite polynomi-
als, α = (α1, ..., αM).
And, combining with KLE above, obtain
κ(x, ω) ≈ κ0(x) +
L
j=1
κjgj(x)
α∈JM,p
ξ
(α)
j Hα(θ)
Need a compact representation of a sum above. → TENSORS!
Can also apply gPCE directly
κ(x, ω) ≈
α∈JM,p
κ(α)
(x)Hα(θ).
7
8. Part II. Low-rank tensor approximations
of big data
Goal: To provide fast and cheap numerical techniques for
working with big and high-dimensional data.
How ?: By reducing linear algebra cost from O(nd ) to
O(drn).
8
9. 4*
Curse of dimensionality
Assume we have nd data. Our aim is to reduce
storage/complexity from O(nd ) to O(dn).
For n = 100 and d = 10, then just to store one needs
8 · 10010 ≈ 8 · 1020 = 8 · 108 TB. If we assume that a modern
computer compares 107 numbers per second, then the total
time for comparison 1020 elements will be 1013 seconds or
≈ 3 ∗ 105 years. In some chemical applications we had n = 100
and d = 800.
how to compute the mean ?
how to compute maxima and minima ?
how to compute level sets, i.e. all elements from an interval
[a, b] ?
how to compute the number of elements in an interval
[a, b] ?
9
10. 4*
Canonical (CP) and Tucker tensor formats
Tensor of order d is a multidimensional array over a d-tuple
index set I = I1 × · · · × Id ,
A = [ai1...id
: i ∈ I ] ∈ RI
, I = {1, ..., n }, = 1, .., d.
Storage: O(nd ) → O(dRn) and O(Rd + dRn).
A. Litvinenko, D. Keyes, V. Khoromskaia, B.N. Khoromskij, H.G. Matthies, Tucker Tensor analysis of Matern
functions in spatial statistics, preprint arXiv:1711.06874, 2017
10
11. 4*
Discretization of elliptic PDE
Ku = f, where
K :=
L
=1
K ⊗
M
µ=1
∆ µ, K ∈ RN×N
, ∆ µ ∈ RRµ×Rµ
,
u :=
r
j=1
uj ⊗
M
µ=1
ujµ, uj ∈ RN
, ujµ ∈ RRµ
,
f :=
R
k=1
fk ⊗
M
µ=1
gkµ, fk ∈ RN
, gkµ ∈ RRµ
.
Efficient low-rank approximation of the stochastic Galerkin matrix in tensor formats, W¨ahnert, Espig,
Hackbusch, Litvinenko, Matthies, 2013.
We analyzed tensor ranks (compression properties) of the
stochastic Galerkin operator K.
11
12. 4*
Numerical Experiments
2D L-shape domain, N = 557 dofs.
Total stochastic dimension is Mu =
Mk + Mf = 20, there are |JM,p| = 231
PCE coefficients
u =
231
j=1
uj,0 ⊗
20
µ=1
ujµ ∈ R557
⊗
20
µ=1
R3
.
Tensor u has 320 · 557 ≈ 2 · 1012 entries
≈ 16 TB of memory.
Instead we store only
231 · (557 + 20 · 3) ≈ 144000 entries
≈ 1.14 MB.
12
13. 4*
How to compute the variance in CP format
Let u ∈ Rr and
˜u := u − u
d
µ=1
1
nµ
1 =
r+1
j=1
d
µ=1
˜ujµ ∈ Rr+1, (1)
then the variance var(u) of u can be computed as follows
var(u) =
˜u, ˜u
d
µ=1 nµ
=
1
d
µ=1 nµ
r+1
i=1
d
µ=1
˜uiµ
,
r+1
j=1
d
ν=1
˜ujν
=
r+1
i=1
r+1
j=1
d
µ=1
1
nµ
˜uiµ, ˜ujµ .
Numerical cost is O (r + 1)2 · d
µ=1 nµ .
14. 4*
Level sets
Now we compute level sets
{ui : ui > b · max
i
u},
i := (i1, ..., iM+1)
for b ∈ {0.2, 0.4, 0.6, 0.8}.
The computing time for each b was 10 minutes.
Key words: u is tensor, Newton methods, inverse u−1, sign(u),
frequency, characteristic function.
13
15. 4*
Computation of level sets and frequency
1. To compute level sets and frequencies we need
characteristic function.
2. To compute characteristic function we need sign(u) function.
3. To compute sign(u) function we need u−1.
Proposition
Let I ⊂ R, u ∈ T , and χI(u) its characteristic. We have
LI(u) = χI(u) u
and rank(LI(u)) ≤ rank(χI(u)) rank(u).
The frequency FI(u) ∈ N of u respect to I is
FI(u) = χI(u), 1 ,
where 1 = d
µ=1
˜1µ, ˜1µ := (1, . . . , 1)T ∈ Rnµ .
17. 4*
Developing of cheap Bayesian update surrogate
1. H.G. Matthies, E. Zander, B.V. Rosic, A. Litvinenko, Parameter estimation via conditional expectation:
a Bayesian inversion, Advanced modeling and simulation in engineering sciences 3 (1), 24, 2016
2. Inverse Problems in a Bayesian Setting, H.G. Matthies, E. Zander, O. Pajonk, B. Rosic, A. Litvinenko.
Computational Methods for Solids and Fluids Multiscale Analysis, ISSN: 1871-3033, 2016
Related work by A. Stuart, Chr. Schwab, A. El Sheikh, Y. Marzouk, H. Najm, O. Ernst
10 0 10
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
x
20 0 20
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
y
0 10 20
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
z
xf
xa
yf
ya
zf
za
18. 4*
Numerical computation of Bayesian Update surrogate
Notation: ˆy – measurements from engineers, y(ξ) – forecast
from the simulator, ε(ω) – the Gaussian noise.
Look for ϕ such that q(ξ) = ϕ(z(ξ)), z(ξ) = y(ξ) + ε(ω):
ϕ ≈ ˜ϕ =
α∈Jp
ϕαΦα(z(ξ))
and minimize q(ξ) − ˜ϕ(z(ξ)) 2
L2
, where Φα are known
polynomials (e.g. Hermite).
Taking derivatives with respect to ϕα:
∂
∂ϕα
q(ξ) − ˜ϕ(z(ξ)), q(ξ) − ˜ϕ(z(ξ)) = 0 ∀α ∈ Jp
16
19. 4*
Numerical computation of NLBU
∂
∂ϕα
E
q2
(ξ) − 2
β∈J
qϕβΦβ(z) +
β,γ∈J
ϕβϕγΦβ(z)Φγ(z)
= 2E
−qΦα(z) +
β∈J
ϕβΦβ(z)Φα(z)
= 2
β∈J
E [Φβ(z)Φα(z)] ϕβ − E [qΦα(z)]
= 0 ∀α ∈ J .
17
20. 4*
Numerical computation of NLBU
Now, rewriting the last sum in a matrix form, obtain the linear
system of equations (=: A) to compute coefficients ϕβ:
... ... ...
... E [Φα(z(ξ))Φβ(z(ξ))]
...
... ... ...
...
ϕβ
...
=
...
E [q(ξ)Φα(z(ξ))]
...
,
where α, β ∈ J , A is of size |J | × |J |.
A. Litvinenko, H.G. Matthies, Inverse problems and uncertainty quantification, arXiv preprint arXiv:1312.5048, 2013
18
21. 4*
Numerical computation of NLBU
Finally, the assimilated parameter qa will be
qa = qf + ˜ϕ(ˆy) − ˜ϕ(z), (2)
z(ξ) = y(ξ) + ε(ω),
˜ϕ = β∈Jp
ϕβΦβ(z(ξ))
0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
0
2
4
6
κ
PDF
κ
f
κ
a
19
22. 4*
Example: 1D elliptic PDE with uncertain coeffs
− · (κ(x, ξ) u(x, ξ)) = f(x, ξ), x ∈ [0, 1]
+ Dirichlet random b.c. g(0, ξ) and g(1, ξ).
3 measurements: u(0.3) = 22, s.d. 0.2, x(0.5) = 28, s.d. 0.3,
x(0.8) = 18, s.d. 0.3.
κ(x, ξ): N = 100 dofs, M = 5, number of KLE terms 35, beta distribution for κ, Gaussian covκ, cov.
length 0.1, multi-variate Hermite polynomial of order pκ = 2;
RHS f(x, ξ): Mf = 5, number of KLE terms 40, beta distribution for κ, exponential covf , cov. length 0.03,
multi-variate Hermite polynomial of order pf = 2;
b.c. g(x, ξ): Mg = 2, number of KLE terms 2, normal distribution for g, Gaussian covg , cov. length 10,
multi-variate Hermite polynomial of order pg = 1;
pφ = 3 and pu = 3
20
23. 4*
Example: Updating of the parameter
Figure: Prior and posterior (updated) parameter κ.
Collaboration with Y. Marzouk, MIT, and TU Braunschweig.
Together with H. Najm, Sandia Lab, we try to compare our
technique with his advanced MCMC technique for chemical
combustion eqn.
21
24. 4*
Example: updating of the solution u
0 0.5 1
-20
0
20
40
60
0 0.5 1
-20
0
20
40
60
0 0.5 1
-20
0
20
40
60
0 0.5 1
-20
0
20
40
60
0 0.5 1
-20
0
20
40
60
Figure: Original and updated solutions, mean value plus/minus 1,2,3
standard deviations. Number of available measurements {0, 1, 2, 3, 5}
[graphics are built in the stochastic Galerkin library sglib, written by E. Zander in TU Braunschweig]
22
25. 4*
Minimization of Uncert. in Num. Aerodynamics
Involved: 7 German Universities,
DLR, Airbus
Duration: 2007-2012.
1. Quantification of airfoil geometry-induced aerodynamic
uncertainties-comparison of approaches, Liu, Litvinenko,
Schillings, Schulz, JUQ 2017
2. Numerical Methods for Uncertainty Quantification and
Bayesian Update in Aerodynamics Litvinenko, Matthies,
chapter in Springer book, Vol 122, pp 262-285, 2013.
23
26. 4*
Example: uncertainties in free stream turbulence
α
v
v
u
u’
α’
v1
2
Random vectors v1(θ) and v2(θ) model free stream turbulence
24
27. 4*
Example: 3sigma intervals
Figure: 3σ interval, σ standard deviation, in each point of RAE2822
airfoil for the pressure (cp) and friction (cf) coefficients.
25
28. 4*
Mean and variance of density, tke, xv, zv, pressure
mean density, variance of density, mean turbul. kinetic energy,
x-velocity, z-velocity, a pressure (failed to compute)
26
29. Part IV. INLA and advance numerics for
spatio-temporal statistics
27
33. 4*
Improving of the moisture model
Task: to improve statistical model, which predicts moisture
Given: 2D-Grid with n ≈ 2.5Mi locations
−120 −110 −100 −90 −80 −70
253035404550
Soil moisture
longitude
latitude
0.15
0.20
0.25
0.30
0.35
0.40
0.45
0.50
High-resolution daily soil moisture data at the top layer of the
Mississippi basin, U.S.A., 01.01.2014
Important for agriculture, defense, ...
1. A. Litvinenko, HLIBCov: Parallel Hierarchical Matrix Approximation of Large Covariance Matrices and Likelihoods
with Applications in Parameter Identification, preprint arXiv:1709.08625, 2017
2. A. Litvinenko, Y. Sun, M.G. Genton, D. Keyes, Likelihood Approximation With Hierarchical Matrices For Large
Spatial Datasets, preprint arXiv:1709.04419, 2017
31
34. 4*
Improving of the moisture model
Goal: To improve estimation of un-
known statistical parameters in a spa-
tial soil moisture field.
Log-likelihood function with Mat´ern C(θ) and available data Z:
L(θ) = −
n
2
log(2π) −
1
2
log |C(θ)| −
1
2
Z C(θ)−1
Z.
To identify: unknown parameters θ := (σ2, ν, ).
Collaboration with statisticians: M. Genton, Y. Sun, R. Huser, from KAUST.
n = 512K, matrix setup 261 sec., compression rate 99.98% (0.4 GB against 2006 GB). H-LU is done in
843 sec., error 2 · 10−3
.
32
35. 4*
Parameter identification
64 32 16 8 4 2
n, samples in thousands
0.07
0.08
0.09
0.1
0.11
0.12
ℓ
64 32 16 8 4 2
n, samples in thousands
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
νSynthetic data with known parameters
( ∗, ν∗, σ2∗
) = (0.5, 1, 0.5). Boxplots for and ν for
n = 1, 000 × {64, 32, ..., 4, 2}; 100 replicates
33
37. 4*
Parallel algorithms with distributed data
Consider density driven groundwater flow with uncertain
porosity and permeability
· (ρq) = 0, (3)
∂t (φc) + · (cq − D c) = 0. (4)
0
200
400
600
0
50
100
150
0.09
0.095
0.1
0.105
0.11
3
150
4
600
×10 -5
100
5
400
6
50 200
0 0
Mean and variance of porosity
The mean (left) and the variances (right) of the porosity,
[0, 600] × [0, 150], φ(x) ≈ 0.1,
var(φ(x)) ∈ (3.2 · 10−5, 5.2 · 10−5).
We compute ≈ 800 scenarios in parallel, each scenario on 32
cores in parallel. Total: 800 × 32 = 25600 cores.
35
38. 4*
Mean and variance of concentration
The mean and variance of the concentration, computed via QMC.
The number of time steps is 1500; E(c) ∈ (0, 1), var(c) ∈ (0, 0.068).
Two different realizations of concentration: with 5 fingers (left) and 4
fingers (right).
36
39. 4*
Conclusion
Introduced:
1. UQ and many examples,
2. Big data require a compact representation. We suggest
low-rank tensor data techniques.
3. Tensor techniques requires to redefine standard statistical
algorithms
4. Bayesian update surrogate ϕ (as a linear, quadratic,...
approximation)
5. Applications in aerodynamics, geostatistics, reservoirs,
environmental statistics
6. HPC applications: parallel algorithms for UQ on a
supercomputer
37
40. 4*
Literature
1. A. Litvinenko, Application of Hierarchical matrices for solving multiscale
problems, PhD Thesis, Leipzig University, Germany,
https://www.wire.tu-bs.de/mitarbeiter/litvinen/diss.pdf, 2006
2. B.N. Khoromskij, A. Litvinenko, Domain decomposition based H-matrix
preconditioners for the skin problem, Domain Decomposition Methods
in Science and Engineering XVII, pp 175-182, 2006
3. A. Litvinenko, Documentation for the Hierarchical Domain
Decomposition (HDD) method, Technical report, Vol 5, pp 1-33,
Max-Planck Institute for applied mathematics in the science in Leipzig,
http://www.mis.mpg.de/publications/other-series/tr/report-0506.html,
2006
4. A. Litvinenko, Partial inversion of elliptic operator to speed up
computation of likelihood in Bayesian inference, arXiv preprint
1708.02207, https://arxiv.org/abs/1708.02207, 2017
5. W. Nowak, A. Litvinenko, Kriging and spatial design accelerated by
orders of magnitude: Combining low-rank covariance approximations
with FFT-techniques, Mathematical Geosciences 45 (4), 411-435, 2013
38