ICISA 2010 Conference Presentation

ubi-logo
Introduction Stable Fluids NVIDIA Compute Unified Device Architecture (CUDA) Results Conclusions
CUDA-based Linear Solvers for Stable
Fluids
G. Amador and A. Gomes
Departamento de Informática
Universidade da Beira Interior
Covilhã, Portugal
m1420@ubi.pt, agomes@di.ubi.pt
April, 2010

ubi-logo
1 Introduction
2 Stable Fluids
The Eulerian approach
Physics Model
3 NVIDIA Compute Uniﬁed Device Architecture (CUDA)
Workﬂow
Iterative solvers
Jacobi
Gauss-Seidel red-black
Conjugate gradient
4 Results
Jacobi performance
Gauss-Seidel performance
Conjugate gradient performance
5 Conclusions
Conclusions
Future Work

ubi-logo
Overview
The study of ﬂuid simulation (e.g., water) is important
for two industries:
(real-time ≥ 30 fps) (off-line ≤ 30 fps)

ubi-logo
Overview
The study of fluid simulation (e.g., water) is important
for two industries:
(real-time ≥ 30 fps) (off-line ≤ 30 fps)
Problems:
How to implement (specifically for 3D stable fluids) the
CUDA-based versions of the Jacobi, Gauss-Seidel,
and conjugate gradient iterative solvers?
What are the real-time performance limitations of
these solvers implementations?

ubi-logo
Space partitioning:
Variations of velocity and density are observed at the
center of each cell.
Velocities and densities are updated through an im-
plicit method (Stam stable ﬂuids, 1999), i.e., uncondi-
tionally stable for any time step.

ubi-logo
Physics Model
Navier-Stokes equations for incompressible fluids
Mass conservation: −→
u = 0
Velocity evolution:
∂
−→
u
∂t
= −
−→
u ·
−→
u + v 2−→
u +
−→
f
Density evolution:
∂ρ
∂t
= −
−→
u · ρ + k 2
ρ + S
−→
u : velocity field.
v: fluids viscosity.
ρ: density of the field.
k: density diffusion rate.
−→
f : external forces added to the velocity field.
S: external sources added to the density field.
=
∂
∂x
,
∂
∂y
,
∂
∂z
: gradient.

ubi-logo
Physics Model
Navier-Stokes equations implementation
Update velocity:
Add external forces (
−→
f ).
Velocity Diffusion (v 2−→
u ).
Move (−
−→
u .
−→
u e
−→
u = 0).
Update density:
Add external sources (S).
Density advection (−
−→
u . ρ).
Density diffusion (k 2
ρ).

ubi-logo
Physics Model
Diffusion
Exchanges of density
or velocity between
neighbours (2D).
Solve a sparse linear system (Ax = b), using an iter-
ative method (e.g., Jacobi, Gauss-Seidel, conjugate
gradient, etc.).

ubi-logo
Physics Model
Move
Ensure mass conservation and the fluid’s incom-
pressibility.
Hodge decomposition:
Conservative field = our field - gradient
Determine the gradient using diffusion’s iterative
method (e.g., Jacobi, Gauss-Seidel, conjugate gradi-
ent, etc.).

ubi-logo
Workﬂow
Workﬂow

ubi-logo
Iterative solvers
Jacobi

ubi-logo
Iterative solvers
Gauss-Seidel red-black

ubi-logo
Iterative solvers
Conjugate gradient

ubi-logo
Jacobi performance
Jacobi performance

ubi-logo

ubi-logo
Conclusions
Conclusions
The CUDA-based implementation of the Gauss-
Seidel solver allows more iterations than the CPU-
based implementation, however it converges two
times slower.
The CUDA-based implementations of the Jacobi and
Gauss-Seidel iterative solvers achieved better perfor-
mances (i.e. faster in processing time) than the CPU-
based implementations.
The CUDA-based implementation of the conjugate
gradient, for grid sizes superior to 643, due to global
memory latency, performs worst than the CPU-based
version.

ubi-logo
Future Work
Future Work
Search ways, implementable using CUDA, to reduce
global memory accesses (e.g., data structures, dy-
namic memory, etc.).
Implement the CPU-based multi-core versions of
the solvers and compare their performance with the
CUDA-based versions.
Search new solvers implementable using CUDA, with
better convergence rate than relaxation techniques
(Jacobi and Gauss-Seidel), with no signiﬁcant extra
computational effort such as the conjugate gradient.

ubi-logo
Future Work
Questions???

ICISA 2010 Conference Presentation

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to ICISA 2010 Conference Presentation

Similar to ICISA 2010 Conference Presentation (20)

Recently uploaded

Recently uploaded (20)

ICISA 2010 Conference Presentation