SlideShare a Scribd company logo
Lifting Scheme Cores for Wavelet Transform
David Barina
(supervised by Pavel Zemcik)
1 / 24
DWT in image processing
can be found in many image-processing tasks
analysis
(edge detection, feature extraction, multiscale representation),
compression (JPEG 2000, Dirac),
watermarking, edge sharpening, contrast enhancement,
tone mapping, denoising, fusion, etc.
2 / 24
Filter bank
S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989)
˜H(z−1
) a
d
↓ 2
+
˜G(z−1
) ↓ 2
↑ 2 H(z)
↑ 2 G(z)
decomposition: two complementary filters,
high number of operations
3 / 24
Lifting scheme
I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998)
a
d
split ˜P(z−1
)T
P(z) merge
P(z) =
I−1
i=0
1 Si(z)
0 1
1 0
Ti(z) 1
K 0
0 1/K
decomposition: sequence of simple filtering steps,
reduces the number of operations, split: even, odd
4 / 24
CDF 9/7 wavelet
I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998)
α
β
γ
δ
input
output
steps
even samples
odd samples
˜P(z) =
1 α 1 + z−1
0 1
1 0
β (1 + z) 1
1 γ 1 + z−1
0 1
1 0
δ (1 + z) 1
ζ 0
0 1/ζ
four two-tap symmetric filters
5 / 24
2-D decomposition
S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989)
a h
v d
horizontal vertical
h
v d
a h
dv
image: 2-D signal, by a series of 1-D transforms, four subbands,
multi-scale decomposition
6 / 24
Lenna
how to calculate this as efficiently as possible
7 / 24
Strategies and issues
R. Kutil, ”A single-loop approach to SIMD parallelization of 2-D wavelet lifting” (2006)
a h
v d
horizontal vertical
strategies row-column, block-based, and line-based
cache issues cache line, limited size, set associativity, prefetching
techniques padding, aggregation, memory layouts,
interleave loops, parallelization
the approaches have to repeatedly visit samples,
memory access is expensive ⇒ CPU cache, limitations,
existing techniques, single-loop approach
8 / 24
Unsolved issues
2 × 2
prolog
core
epilog
prolog epilog
F
F
FF
complicated border treatment (prolog/epilog phases)
suspend/resume processing
arbitrary processing order (scan order)
interleave the transform and a subsequent processing
multi-scale decomposition
reorganization of underlying scheme
9 / 24
Objectives of the thesis
Aims improve image transform performance and resource
consumption
Objectives eliminate the shortcomings of existing methods
previous slide
Evaluation prove experimentally
performance, memory requirements
10 / 24
Lifting core
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
solution: a processing unit
continuously consumes an input and produces an output
which visits every image sample only once (cache friendly)
which is aware of image coordinates (can handle the borders)
whose configuration (state) can be saved/restored
which can be run in any direction
which can be SIMD vectorized
which can run in parallel (on independent parts of the image)
y = C x
x
def
= In B y
def
= On B
11 / 24
Core examples
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
α
β
γ
δ m
n
1 2 3 4
core inputs, outputs
12 / 24
Processing orders
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
horizontal horiz. strips horiz. blocks
vertical vert. strips vert. blocks
13 / 24
Borders treatment
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
d a d a d a d a d a d a d a d a d a d
d a d a d a d a d a d a d a d a d a d a
n n n n n n n
a d aad
n nnnn
d a d a d a d a d a d a d a d a d a d
0
d a d a d a d a d a d a d a d a d a d a
2 n N − 2 N
0 0
n n n n n n
a
y = Cn x
cores gracefully treats the boundaries
14 / 24
Parallel cores and reorganization
M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016)
1 2 3 4
Sweldens1995
1 2 3
Iwahashi2007
1 2
proposed
15 / 24
3-D core
D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015)
x
y
z
buffer x
buffer y
buffer z
extended into more dimensions, buffers on the sides
16 / 24
CPU implementation
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
0.0 s
5.0ns
10.0ns
15.0ns
20.0ns
25.0ns
30.0ns
35.0ns
40.0ns
45.0ns
50.0ns
1.0k 10.0k 100.0k 1.0M 10.0M 100.0M
time/pixel
pixels
separable approach
core approach
an evaluation of approaches,
implemented the separable, single-loop, and core
17 / 24
3-D CPU implementation
D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015)
x
y
z
buffer x
buffer y
buffer z
0.0 s
20.0ns
40.0ns
60.0ns
80.0ns
100.0ns
120.0ns
140.0ns
160.0ns
0.0 50.0M 100.0M 150.0M 200.0M 250.0M
time/voxel
voxels
naive horizontal
naive vertical
core 42
core 23
core 43
performance of 3-D transform: separable, 2-D core, 3-D core
18 / 24
GPU implementation
M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016)
80.0
100.0
120.0
140.0
160.0
180.0
200.0
220.0
240.0
260.0
0.0 10.0M 20.0M 30.0M 40.0M 50.0M 60.0M 70.0M
GB/s
pixels
Kucis2014
Separable Block
Non-Separable Block
0
10
20
30
40
50
60
100kpel 1Mpel 10Mpel 100Mpel
GB/s
Sweldens
Iwahashi*
Explosive*
Monolithic*
Polyphase*
Monolithic∗
scheme:
left: SotA is in red, block methods in blue/green, reorganization
right: block methods, separable in black, our in blue/green
19 / 24
FPGA implementation
D. Barina, et al., ”Single-Loop Approach to 2-D Wavelet Lifting with JPEG 2000 Compatibility” (2015)
H V
BRAM
Input Transform
core FF LUT BRAM
latency 4 441 (0.1 %) 399 (0.18 %) 6 (1.1 %)
latency 2 391 (< 0.1 %) 592 (0.27 %) 6 (1.1 %)
architecture device BRAM [bits] clocks/pel time [ms]
Dillen2003 VirtexE1000-8 50K 0.50 1.20
Descampe2004 Virtex-II XC2V6000 N/A 0.60 1.75
Seo2007 Altera Stratix 128K 2.64 6.02
Zhang2012 Virtex-II Pro XC2VP30 6 × 18K 0.50 0.97
the cores Zynq XC7Z045 1 × 36K 0.26 0.27
20 / 24
JPEG 2000 implementation
D. Barina, O. Klima, P. Zemcik, ”Single-Loop Architecture for JPEG 2000” (2016)
core
codeblock
2 × 2cn
2 × 2cm
aj
aj+1
h v d
0.0
20.0
40.0
60.0
80.0
100.0
120.0
140.0
100.0k 1.0M 10.0M 100.0M 1.0G
time[ns]
resolution [pel]
proposed
OpenJPEG
JasPer
FFmpeg
21 / 24
Contributions of the thesis
Aims improved image transform performance and resource
consumption
Objectives eliminated the shortcomings of existing methods
Evaluation assessed experimentally
(performance, memory requirements)
evaluation performed:
2-D on CPU, 3-D on CPU, 2-D on GPU, 2-D on FPGA,
JPEG 2000 on CPU
22 / 24
Selected papers
Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Software Architecture for JPEG 2000. In
Data Compression Conference (DCC), 2016
Barina, D.; Musil, M.; Musil, P.; et al.: Single-Loop Approach to 2-D Wavelet Lifting with
JPEG 2000 Compatibility. In Workshop on Applications for MultiCore Architectures
(WAMCA), 2015
Barina, D.; Zemcik, P.: Minimum Memory Vectorisation of Wavelet Lifting. In Advanced
Concepts for Intelligent Vision Systems (ACIVS), 2013
Barina, D.; Zemcik, P.: Wavelet Lifting on Application Specific Vector Processor. In
GraphiCon, 2013
Barina, D.; Zemcik, P.: Diagonal Vectorisation of 2-D Wavelet Lifting. In IEEE International
Conference on Image Processing (ICIP), 2014
Barina, D.; Zemcik, P.: Real-Time 3-D Wavelet Lifting. In International Conference in
Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG), 2015
Barina, D.; Zemcik, P.: Vectorization and parallelization of 2-D wavelet lifting. Journal of
Real-Time Image Processing (JRTIP), in press
Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Architecture for JPEG 2000. In: Image and
Signal Processing (ICISP), 2016
Kula, M.; Barina, D.; Zemcik, P.: Block-based Approach to 2-D Wavelet Transform on GPUs.
In International Conference on Information Technology – New Generations (ITNG), 2016
Kucis, M.; Barina, D.; Kula, M.; et al.: 2-D Discrete Wavelet Transform Using GPU. In
Workshop on Application for Multi-Core Architectures (WAMCA), 2014
23 / 24
Summary
the core
computing unit which processes the data in a single pass,
can suspend/resume execution,
can processes the data in many different orders,
can handle signal boundaries (is aware of coordinates),
can be easily SIMD vectorized and parallelized,
and whose underlying scheme can be reorganized.
24 / 24

More Related Content

What's hot

Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Editor IJARCET
 
Discrete cosine transform
Discrete cosine transformDiscrete cosine transform
Discrete cosine transformaniruddh Tyagi
 
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
ijsrd.com
 
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression forIaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd Iaetsd
 
Wavelet video processing tecnology
Wavelet video processing tecnologyWavelet video processing tecnology
Wavelet video processing tecnologyPrashant Madnavat
 
Dk33669673
Dk33669673Dk33669673
Dk33669673
IJERA Editor
 
B042107012
B042107012B042107012
B042107012
ijceronline
 
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency BandDWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
IOSR Journals
 
Watermarking
WatermarkingWatermarking
Watermarking
Vikas Garg
 
H0545156
H0545156H0545156
H0545156
IOSR Journals
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compression
sipij
 
Image compression using discrete cosine transform
Image compression using discrete cosine transformImage compression using discrete cosine transform
Image compression using discrete cosine transform
manoj kumar
 
Cb34474478
Cb34474478Cb34474478
Cb34474478
IJERA Editor
 
Image Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving EdgesImage Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving Edges
IDES Editor
 
DCT based Image Watermarking
DCT based Image WatermarkingDCT based Image Watermarking
DCT based Image Watermarking
Nisarg Shah
 
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
IJSRD
 

What's hot (19)

Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154
 
Discrete cosine transform
Discrete cosine transformDiscrete cosine transform
Discrete cosine transform
 
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
 
DCT
DCTDCT
DCT
 
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression forIaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression for
 
Wavelet video processing tecnology
Wavelet video processing tecnologyWavelet video processing tecnology
Wavelet video processing tecnology
 
145 153
145 153145 153
145 153
 
Dk33669673
Dk33669673Dk33669673
Dk33669673
 
B042107012
B042107012B042107012
B042107012
 
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency BandDWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
 
Watermarking
WatermarkingWatermarking
Watermarking
 
Ad24210214
Ad24210214Ad24210214
Ad24210214
 
H0545156
H0545156H0545156
H0545156
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compression
 
Image compression using discrete cosine transform
Image compression using discrete cosine transformImage compression using discrete cosine transform
Image compression using discrete cosine transform
 
Cb34474478
Cb34474478Cb34474478
Cb34474478
 
Image Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving EdgesImage Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving Edges
 
DCT based Image Watermarking
DCT based Image WatermarkingDCT based Image Watermarking
DCT based Image Watermarking
 
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
 

Viewers also liked

Lossless image compression via by lifting scheme
Lossless image compression via by lifting schemeLossless image compression via by lifting scheme
Lossless image compression via by lifting schemeSubhashini Subramanian
 
Wavelets @ CPU
Wavelets @ CPUWavelets @ CPU
Wavelets @ CPU
David Bařina
 
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
idescitation
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet Lifting
David Bařina
 
SVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image WatermarkingSVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image Watermarking
IDES Editor
 

Viewers also liked (6)

Lossless image compression via by lifting scheme
Lossless image compression via by lifting schemeLossless image compression via by lifting scheme
Lossless image compression via by lifting scheme
 
Wavelets @ CPU
Wavelets @ CPUWavelets @ CPU
Wavelets @ CPU
 
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet Lifting
 
SVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image WatermarkingSVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image Watermarking
 
Lossless
LosslessLossless
Lossless
 

Similar to Lifting Scheme Cores for Wavelet Transform

Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
idescitation
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijripublishers Ijri
 
Image denoising using curvelet transform
Image denoising using curvelet transformImage denoising using curvelet transform
Image denoising using curvelet transform
Government Engineering College, Gandhinagar
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijripublishers Ijri
 
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD TechniquesIRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET Journal
 
Ijetr011837
Ijetr011837Ijetr011837
Ijetr011837
ER Publication.org
 
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...
Tuan Q. Pham
 
nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)
ijceronline
 
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image WatermarkingComparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
ijsrd.com
 
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGESEFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
ijcnac
 
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
IJECEIAES
 
Bf36342346
Bf36342346Bf36342346
Bf36342346
IJERA Editor
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
csandit
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
csandit
 
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
Ravi Kiran B.
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
INFOGAIN PUBLICATION
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...INFOGAIN PUBLICATION
 
Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...
TELKOMNIKA JOURNAL
 
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
VLSICS Design
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
cscpconf
 

Similar to Lifting Scheme Cores for Wavelet Transform (20)

Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
 
Image denoising using curvelet transform
Image denoising using curvelet transformImage denoising using curvelet transform
Image denoising using curvelet transform
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
 
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD TechniquesIRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
 
Ijetr011837
Ijetr011837Ijetr011837
Ijetr011837
 
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...
 
nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)
 
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image WatermarkingComparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
 
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGESEFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
 
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
 
Bf36342346
Bf36342346Bf36342346
Bf36342346
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
 
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
 
Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...
 
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
 

More from David Bařina

CCSDS 122.0
CCSDS 122.0CCSDS 122.0
CCSDS 122.0
David Bařina
 
Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field Compression
David Bařina
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiosities
David Bařina
 
C/C++ tricks
C/C++ tricksC/C++ tricks
C/C++ tricks
David Bařina
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG Format
David Bařina
 
JPEG
JPEGJPEG
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel Architectures
David Bařina
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet Transform
David Bařina
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
David Bařina
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000
David Bařina
 
Wavelet News
Wavelet NewsWavelet News
Wavelet News
David Bařina
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkce
David Bařina
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMD
David Bařina
 
Wavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorWavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector Processor
David Bařina
 
GStreamer
GStreamerGStreamer
GStreamer
David Bařina
 
FFmpeg
FFmpegFFmpeg
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: Integers
David Bařina
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmetic
David Bařina
 

More from David Bařina (18)

CCSDS 122.0
CCSDS 122.0CCSDS 122.0
CCSDS 122.0
 
Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field Compression
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiosities
 
C/C++ tricks
C/C++ tricksC/C++ tricks
C/C++ tricks
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG Format
 
JPEG
JPEGJPEG
JPEG
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel Architectures
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet Transform
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000
 
Wavelet News
Wavelet NewsWavelet News
Wavelet News
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkce
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMD
 
Wavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorWavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector Processor
 
GStreamer
GStreamerGStreamer
GStreamer
 
FFmpeg
FFmpegFFmpeg
FFmpeg
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: Integers
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmetic
 

Recently uploaded

power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 

Recently uploaded (20)

power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 

Lifting Scheme Cores for Wavelet Transform

  • 1. Lifting Scheme Cores for Wavelet Transform David Barina (supervised by Pavel Zemcik) 1 / 24
  • 2. DWT in image processing can be found in many image-processing tasks analysis (edge detection, feature extraction, multiscale representation), compression (JPEG 2000, Dirac), watermarking, edge sharpening, contrast enhancement, tone mapping, denoising, fusion, etc. 2 / 24
  • 3. Filter bank S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989) ˜H(z−1 ) a d ↓ 2 + ˜G(z−1 ) ↓ 2 ↑ 2 H(z) ↑ 2 G(z) decomposition: two complementary filters, high number of operations 3 / 24
  • 4. Lifting scheme I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998) a d split ˜P(z−1 )T P(z) merge P(z) = I−1 i=0 1 Si(z) 0 1 1 0 Ti(z) 1 K 0 0 1/K decomposition: sequence of simple filtering steps, reduces the number of operations, split: even, odd 4 / 24
  • 5. CDF 9/7 wavelet I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998) α β γ δ input output steps even samples odd samples ˜P(z) = 1 α 1 + z−1 0 1 1 0 β (1 + z) 1 1 γ 1 + z−1 0 1 1 0 δ (1 + z) 1 ζ 0 0 1/ζ four two-tap symmetric filters 5 / 24
  • 6. 2-D decomposition S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989) a h v d horizontal vertical h v d a h dv image: 2-D signal, by a series of 1-D transforms, four subbands, multi-scale decomposition 6 / 24
  • 7. Lenna how to calculate this as efficiently as possible 7 / 24
  • 8. Strategies and issues R. Kutil, ”A single-loop approach to SIMD parallelization of 2-D wavelet lifting” (2006) a h v d horizontal vertical strategies row-column, block-based, and line-based cache issues cache line, limited size, set associativity, prefetching techniques padding, aggregation, memory layouts, interleave loops, parallelization the approaches have to repeatedly visit samples, memory access is expensive ⇒ CPU cache, limitations, existing techniques, single-loop approach 8 / 24
  • 9. Unsolved issues 2 × 2 prolog core epilog prolog epilog F F FF complicated border treatment (prolog/epilog phases) suspend/resume processing arbitrary processing order (scan order) interleave the transform and a subsequent processing multi-scale decomposition reorganization of underlying scheme 9 / 24
  • 10. Objectives of the thesis Aims improve image transform performance and resource consumption Objectives eliminate the shortcomings of existing methods previous slide Evaluation prove experimentally performance, memory requirements 10 / 24
  • 11. Lifting core D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) solution: a processing unit continuously consumes an input and produces an output which visits every image sample only once (cache friendly) which is aware of image coordinates (can handle the borders) whose configuration (state) can be saved/restored which can be run in any direction which can be SIMD vectorized which can run in parallel (on independent parts of the image) y = C x x def = In B y def = On B 11 / 24
  • 12. Core examples D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) α β γ δ m n 1 2 3 4 core inputs, outputs 12 / 24
  • 13. Processing orders D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) horizontal horiz. strips horiz. blocks vertical vert. strips vert. blocks 13 / 24
  • 14. Borders treatment D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) d a d a d a d a d a d a d a d a d a d d a d a d a d a d a d a d a d a d a d a n n n n n n n a d aad n nnnn d a d a d a d a d a d a d a d a d a d 0 d a d a d a d a d a d a d a d a d a d a 2 n N − 2 N 0 0 n n n n n n a y = Cn x cores gracefully treats the boundaries 14 / 24
  • 15. Parallel cores and reorganization M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016) 1 2 3 4 Sweldens1995 1 2 3 Iwahashi2007 1 2 proposed 15 / 24
  • 16. 3-D core D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015) x y z buffer x buffer y buffer z extended into more dimensions, buffers on the sides 16 / 24
  • 17. CPU implementation D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) 0.0 s 5.0ns 10.0ns 15.0ns 20.0ns 25.0ns 30.0ns 35.0ns 40.0ns 45.0ns 50.0ns 1.0k 10.0k 100.0k 1.0M 10.0M 100.0M time/pixel pixels separable approach core approach an evaluation of approaches, implemented the separable, single-loop, and core 17 / 24
  • 18. 3-D CPU implementation D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015) x y z buffer x buffer y buffer z 0.0 s 20.0ns 40.0ns 60.0ns 80.0ns 100.0ns 120.0ns 140.0ns 160.0ns 0.0 50.0M 100.0M 150.0M 200.0M 250.0M time/voxel voxels naive horizontal naive vertical core 42 core 23 core 43 performance of 3-D transform: separable, 2-D core, 3-D core 18 / 24
  • 19. GPU implementation M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016) 80.0 100.0 120.0 140.0 160.0 180.0 200.0 220.0 240.0 260.0 0.0 10.0M 20.0M 30.0M 40.0M 50.0M 60.0M 70.0M GB/s pixels Kucis2014 Separable Block Non-Separable Block 0 10 20 30 40 50 60 100kpel 1Mpel 10Mpel 100Mpel GB/s Sweldens Iwahashi* Explosive* Monolithic* Polyphase* Monolithic∗ scheme: left: SotA is in red, block methods in blue/green, reorganization right: block methods, separable in black, our in blue/green 19 / 24
  • 20. FPGA implementation D. Barina, et al., ”Single-Loop Approach to 2-D Wavelet Lifting with JPEG 2000 Compatibility” (2015) H V BRAM Input Transform core FF LUT BRAM latency 4 441 (0.1 %) 399 (0.18 %) 6 (1.1 %) latency 2 391 (< 0.1 %) 592 (0.27 %) 6 (1.1 %) architecture device BRAM [bits] clocks/pel time [ms] Dillen2003 VirtexE1000-8 50K 0.50 1.20 Descampe2004 Virtex-II XC2V6000 N/A 0.60 1.75 Seo2007 Altera Stratix 128K 2.64 6.02 Zhang2012 Virtex-II Pro XC2VP30 6 × 18K 0.50 0.97 the cores Zynq XC7Z045 1 × 36K 0.26 0.27 20 / 24
  • 21. JPEG 2000 implementation D. Barina, O. Klima, P. Zemcik, ”Single-Loop Architecture for JPEG 2000” (2016) core codeblock 2 × 2cn 2 × 2cm aj aj+1 h v d 0.0 20.0 40.0 60.0 80.0 100.0 120.0 140.0 100.0k 1.0M 10.0M 100.0M 1.0G time[ns] resolution [pel] proposed OpenJPEG JasPer FFmpeg 21 / 24
  • 22. Contributions of the thesis Aims improved image transform performance and resource consumption Objectives eliminated the shortcomings of existing methods Evaluation assessed experimentally (performance, memory requirements) evaluation performed: 2-D on CPU, 3-D on CPU, 2-D on GPU, 2-D on FPGA, JPEG 2000 on CPU 22 / 24
  • 23. Selected papers Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Software Architecture for JPEG 2000. In Data Compression Conference (DCC), 2016 Barina, D.; Musil, M.; Musil, P.; et al.: Single-Loop Approach to 2-D Wavelet Lifting with JPEG 2000 Compatibility. In Workshop on Applications for MultiCore Architectures (WAMCA), 2015 Barina, D.; Zemcik, P.: Minimum Memory Vectorisation of Wavelet Lifting. In Advanced Concepts for Intelligent Vision Systems (ACIVS), 2013 Barina, D.; Zemcik, P.: Wavelet Lifting on Application Specific Vector Processor. In GraphiCon, 2013 Barina, D.; Zemcik, P.: Diagonal Vectorisation of 2-D Wavelet Lifting. In IEEE International Conference on Image Processing (ICIP), 2014 Barina, D.; Zemcik, P.: Real-Time 3-D Wavelet Lifting. In International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG), 2015 Barina, D.; Zemcik, P.: Vectorization and parallelization of 2-D wavelet lifting. Journal of Real-Time Image Processing (JRTIP), in press Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Architecture for JPEG 2000. In: Image and Signal Processing (ICISP), 2016 Kula, M.; Barina, D.; Zemcik, P.: Block-based Approach to 2-D Wavelet Transform on GPUs. In International Conference on Information Technology – New Generations (ITNG), 2016 Kucis, M.; Barina, D.; Kula, M.; et al.: 2-D Discrete Wavelet Transform Using GPU. In Workshop on Application for Multi-Core Architectures (WAMCA), 2014 23 / 24
  • 24. Summary the core computing unit which processes the data in a single pass, can suspend/resume execution, can processes the data in many different orders, can handle signal boundaries (is aware of coordinates), can be easily SIMD vectorized and parallelized, and whose underlying scheme can be reorganized. 24 / 24