SlideShare a Scribd company logo
1 of 13
Download to read offline
Wavelet Lifting on
Application Specific Vector Processor
David Barina Pavel Zemcik
Faculty of Information Technology, BUT, Czech Republic
September 17, 2013
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 1 / 13
Motivation
discrete wavelet transform (DWT)
compression, e.g. JPEG 2000, Dirac
lifting scheme is SIMD-friendly
feasible for embedded systems
evaluated on Application Specific Vector Processor (ASVP)
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 2 / 13
Separability
2D using 1D
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 3 / 13
Lifting CDF 9/7
lifting scheme
P(z) =
1 α(1 + z−1)
0 1
1 0
β(1 + z) 1
1 γ(1 + z−1)
0 1
1 0
δ(1 + z) 1
ζ 0
0 1/ζ
α, β, γ, δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 4 / 13
Lifting CDF 9/7
lifting scheme practically
s
(0)
l = x2l
d
(0)
l = x2l+1
d
(1)
l = d
(0)
l +α (s
(0)
l + s
(0)
l+1)
s
(1)
l = s
(0)
l +β (d
(1)
l + d
(1)
l−1)
d
(2)
l = d
(1)
l +γ (s
(1)
l + s
(1)
l+1)
s
(2)
l = s
(1)
l +δ (d
(2)
l + d
(2)
l−1)
sl = ζ s
(2)
l
dl = d
(2)
l /ζ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 5 / 13
Data flow graph
α
β
γ
δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 6 / 13
Horizontal vectorisation
α
β
γ
δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 7 / 13
Vertical vectorisation
α
β
γ
δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 8 / 13
ASVP platform
bus
CPU
BCE
DMA
banks
SDRAM
sCPU VPU
memory
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 9 / 13
FPGA kit SP605
Figure : Xilinx Spartan-6 FPGA SP605 Evaluation Kit
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 10 / 13
ASVP platform
two C source codes (MicroBlaze + PicoBlaze)
accelerate operations on vectors of floats
Vector operations
VCOPY M0[i] ← M1[j]
VADD M0[i] ← M1[j] + M2[k]
VSUB M0[i] ← M1[j] − M2[k]
VMUL M0[i] ← M1[j] · M2[k]
VMAC M0[i] ← M3[l] + (M1[j] · M2[k])
VMSUBAC M0[i] ← M3[l] − (M1[j] · M2[k])
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 11 / 13
Evaluation
100n
1µ
10µ
100µ
10.0 100.0 1.0k 10.0k 100.0k 1.0M
seconds/sample
horizontal BCE
vertical
samples
horizontal CPU
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 12 / 13
Summary
two methods compared on ASVP platform
achieved speedup is up to 2.6×
next research: an adaptation to the 2-D wavelet transform
vectorisation µs/sample speedup
CPU horizontal 1.1 1.0
CPU vertical 0.8 1.4
BCE horizontal 0.4 2.6
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 13 / 13

More Related Content

Similar to Wavelet Lifting on Application Specific Vector Processor

Costin, francillon ghost is in the air(traffic)
Costin, francillon   ghost is in the air(traffic)Costin, francillon   ghost is in the air(traffic)
Costin, francillon ghost is in the air(traffic)DefconRussia
 
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfDell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfAhmedMaher181036
 
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015Sckipio
 
Assessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus ApplicationsAssessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus ApplicationsDavid Ingram
 
Booster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation noteBooster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation noteAli Elkashef
 

Similar to Wavelet Lifting on Application Specific Vector Processor (6)

Costin, francillon ghost is in the air(traffic)
Costin, francillon   ghost is in the air(traffic)Costin, francillon   ghost is in the air(traffic)
Costin, francillon ghost is in the air(traffic)
 
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfDell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
 
RTL2838 DVB-T Deep dive
RTL2838 DVB-T Deep diveRTL2838 DVB-T Deep dive
RTL2838 DVB-T Deep dive
 
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
 
Assessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus ApplicationsAssessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus Applications
 
Booster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation noteBooster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation note
 

More from David Bařina

Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field CompressionDavid Bařina
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiositiesDavid Bařina
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG FormatDavid Bařina
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDavid Bařina
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformDavid Bařina
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesDavid Bařina
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000David Bařina
 
Lifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet TransformLifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet TransformDavid Bařina
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingDavid Bařina
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceDavid Bařina
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDDavid Bařina
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersDavid Bařina
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmeticDavid Bařina
 

More from David Bařina (20)

CCSDS 122.0
CCSDS 122.0CCSDS 122.0
CCSDS 122.0
 
Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field Compression
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiosities
 
C/C++ tricks
C/C++ tricksC/C++ tricks
C/C++ tricks
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG Format
 
JPEG
JPEGJPEG
JPEG
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel Architectures
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet Transform
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000
 
Lifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet TransformLifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet Transform
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet Lifting
 
Wavelet News
Wavelet NewsWavelet News
Wavelet News
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkce
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMD
 
GStreamer
GStreamerGStreamer
GStreamer
 
FFmpeg
FFmpegFFmpeg
FFmpeg
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: Integers
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmetic
 
Wavelets @ CPU
Wavelets @ CPUWavelets @ CPU
Wavelets @ CPU
 

Recently uploaded

Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Cherry
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCherry
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Cherry
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptxMuhammadRazzaq31
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACherry
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.Cherry
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 

Recently uploaded (20)

Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
 
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNA
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 

Wavelet Lifting on Application Specific Vector Processor

  • 1. Wavelet Lifting on Application Specific Vector Processor David Barina Pavel Zemcik Faculty of Information Technology, BUT, Czech Republic September 17, 2013 David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 1 / 13
  • 2. Motivation discrete wavelet transform (DWT) compression, e.g. JPEG 2000, Dirac lifting scheme is SIMD-friendly feasible for embedded systems evaluated on Application Specific Vector Processor (ASVP) David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 2 / 13
  • 3. Separability 2D using 1D David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 3 / 13
  • 4. Lifting CDF 9/7 lifting scheme P(z) = 1 α(1 + z−1) 0 1 1 0 β(1 + z) 1 1 γ(1 + z−1) 0 1 1 0 δ(1 + z) 1 ζ 0 0 1/ζ α, β, γ, δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 4 / 13
  • 5. Lifting CDF 9/7 lifting scheme practically s (0) l = x2l d (0) l = x2l+1 d (1) l = d (0) l +α (s (0) l + s (0) l+1) s (1) l = s (0) l +β (d (1) l + d (1) l−1) d (2) l = d (1) l +γ (s (1) l + s (1) l+1) s (2) l = s (1) l +δ (d (2) l + d (2) l−1) sl = ζ s (2) l dl = d (2) l /ζ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 5 / 13
  • 6. Data flow graph α β γ δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 6 / 13
  • 7. Horizontal vectorisation α β γ δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 7 / 13
  • 8. Vertical vectorisation α β γ δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 8 / 13
  • 9. ASVP platform bus CPU BCE DMA banks SDRAM sCPU VPU memory David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 9 / 13
  • 10. FPGA kit SP605 Figure : Xilinx Spartan-6 FPGA SP605 Evaluation Kit David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 10 / 13
  • 11. ASVP platform two C source codes (MicroBlaze + PicoBlaze) accelerate operations on vectors of floats Vector operations VCOPY M0[i] ← M1[j] VADD M0[i] ← M1[j] + M2[k] VSUB M0[i] ← M1[j] − M2[k] VMUL M0[i] ← M1[j] · M2[k] VMAC M0[i] ← M3[l] + (M1[j] · M2[k]) VMSUBAC M0[i] ← M3[l] − (M1[j] · M2[k]) David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 11 / 13
  • 12. Evaluation 100n 1µ 10µ 100µ 10.0 100.0 1.0k 10.0k 100.0k 1.0M seconds/sample horizontal BCE vertical samples horizontal CPU David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 12 / 13
  • 13. Summary two methods compared on ASVP platform achieved speedup is up to 2.6× next research: an adaptation to the 2-D wavelet transform vectorisation µs/sample speedup CPU horizontal 1.1 1.0 CPU vertical 0.8 1.4 BCE horizontal 0.4 2.6 David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 13 / 13