SlideShare a Scribd company logo
1 of 42
Visual Quality Enhancement in DCTDomain Spatial Downscaling Transcoding
Using Generalized DCT Decimation.

Presented by:
Marwa Ahmed
Mona Ragheb
Sara Serag
Yara Ali
Agenda
•

Introduction



Definition
What is meant by:






Transcoding
Spatial Domain & DCT Domain
Downscaling
Alias
Quantization





•

Video Adaptation

Frequency Synthesis

Generalized DCT Decimation For Spatial
Downscaling
Agenda
•

Computation Reduction Using Sparse matrix
representation

•

Analysis of the proposed DCT decimation filter

•

Experimental results


Optimal list squares Up-Scaling filters ( Steps )

•

Peak Signal to Noise ratio

•

Conclusion
Visual Quality Enhancement in DCT-Domain Spatial
Downscaling Transcoding Using Generalized DCT
Decimation.

•The goal image enhancement is to improve the
image quality so that the processed image PSNR is
high and less computational complexity
Abstract
1. we propose a generalized discrete cosine transform
(DCT) decimation scheme for DCT-domain spatial
downscaling which performs two-fold decimation on
subframes of a flexible size larger than the traditional
8 ᵡ block size to improve the visual quality.
8
2.Efficient sparse-matrix:
representations are then derived to reduce the
computation of the proposed DCT decimation method.
Abstract (cont.)
3.We compare
the filtering performances and computational complexities of the
proposed scheme and the pixel-domain downscaling schemes
Our analysis shows that :

 proposed scheme can reduce the aliasing artifact compared to
the pixel-domain downscaling schemes,
Where as the computational complexity may be increased We also:
 integrate the proposed decimation scheme into the cascaded
DCT-domain transcoder for spatial downscaling of a pre encoded
video into its quarter size
 Experiments show the proposed approach can achieve better
visual quality than the existing schemes
What is meant by:


Transcoding :

Video transcoding is an operation
of converting a video bit-stream into from one format
into another format (e.g., bit-rate , frame-rate, spatial
resolution, and coding syntax). It is an efficient means of
achieving fine and dynamic video adaptation.



Video adaptation :

convert the video bit rate
according to the channel conditions. Since the preencoded video is encoded at high quality and bit rate.
For low bandwidth connections, the video bit rate needs
to be converted to low bit rate.
Spatial Domain & DCT Domain
Spatial Domain (Image Enhancement):

Definition:
is manipulating or changing an image representing an object in
space to enhance the image for a given application.
•Techniques are based on direct manipulation of pixels in an
image
•Used for filtering basics, smoothing filters, sharpening
filters, unsharp masking and laplacian
Discrete Cosine Transform (DCT) domain
•This allows us to discard those equations involving the higher
frequency components, reducing the size of the equation set
considerably.
•in the DCT domain, each equation’s significance is
dependent on the corresponding DCT frequency
•Does not affect the compressibility of the original image
because it enhance the image in the decompression
Aliasing:
• When a signal is under-sampled, aliasing can result
•Aliasing is when high frequency components masquerade as low
frequency ones, and can result from improper image sampling

Downscailing : The operation of retaining the low-frequency
coefficients of aDCT sub-frame and taking the half-size IDCT
Each N M sub-frame is extracting only the (N/2) (M/2)
low-frequency.
Quantization: is the process of converting a continuous analog
audio signal to a digital signal with discrete numerical values.

Frequency synthesis : downscaling method first synthesizes an
incoming macroblock consisting of four 8 ᵡ DCT blocks into
8
one 16 ᵡ DCT block, and then obtains the downscaled 8 ᵡ
16
8
DCT block by extracting the 8 ᵡ low-frequency DCT coefficients
8
of the 16 ᵡ DCT block
16
•In realizing a transcoder, the computational cost and the
picture quality are usually the two most important concerns.
A cascaded DCT-domain transcoder (CDDT):
as depicted in Fig. 1, was first proposed in for spatial
downscaling where a DCT-domain bilinear filter was used as
the anti aliasing filter for the spatial downscaling.
•cascade a decoder followed by an encoder. This cascaded
pixel-domain architecture is flexible and can be used for bit
rate adaptation and spatio-temporal resolution conversion
without drift.
MC: reduces the temporal redundancy.
DCT: reduces the spatial redundancy and achieves energy compaction

Quantization is performed to achieve higher compression ratio.
Variable-length coding.
 VLC: is applied after the quantization to reduce the remaining
redundancy.
 decoder decodes the compressed input video
 encoder re encodes the decoded video into the target format
A video picture is predicted from its reference pictures and only the
prediction errors are coded.
the encoder reuses the motion vectors along with other information
extracted from the input video bit stream.
II. Generalized DCT decimation
for spatial downscaling
Formulation of generalized DCT
decimation
Formulation of generalized
DCT decimation
STEP 1:
A group of consecutive 8-samples DCT vectors are
first transformed into an N-pixel vector by 8-point
IDCT, Where N is a multiple of 8.
Formulation of generalized
DCT decimation
The N-pixel vector is then transformed
into its corresponding DCT vector by Npoint DCT
Formulation of generalized
DCT decimation
The N-point DCT representation of fN can be computed
by:

fN: N-pixel vector that’s composed of 8-pixel vectors bi , i=
1……, N/8

TN: N-point DCT transform matrix that’s divided into N/8
columns of submatrices TN,i of size Nx8
Formulation of generalized
DCT decimation
Formulation of generalized
DCT decimation
STEP 2:
DCT decimation is subsequently performed on
the N-sample DCT vector by extracting the N/2
low-frequency DCT coefficients followed by N/2point IDCT to obtain a downscaled N/2-pixel
vector
Formulation of generalized
DCT decimation
Formulation of generalized
DCT decimation
STEP 3:
The N/2-Pixel vector is transformed into a group
of consecutive 8-Sample DCT vectors by 8-Point
DCT to form the output DCT array
Formulation of generalized
DCT decimation
•Computation reduction using
sparse matrix representations

•Analyses of DCT-DECIMATION
downscaling filters


To reduce computation for matrix operations in
(4) and (7)

can be represented in sparse matrix form
The following characteristics have been noted in
with dimension (N/2) * 8:
1.
General case:
The entries of r th row in
are all zeros except the r th entry where r
= 0, N/8, 2N/8, 3N/8
About N/8 of the entries are zeros.
2.
Special case 1:
For K = N/8 is even


Where i = 1, …….., N/16 r = 0, ……, N/2 and c = 0, …, 7
3.
Special case 2:
for K = N/8 is odd
Where i = 1, …….., N/16 r = 0, ……,, N/2 and c = 0, …, 7
for matrix with i = N/16 + 1 , the entries of odd values of r + c is
zero for
r ≠ 0, N/8, 2N/8 , 3N/8
at most half of the entries are zeros.
Based on the previous facts:

are defined to reduce computations ,
For i = 1, …., k/2 where k is even :

Substituting in

:







The operation of retaining the low-frequency coefficients of a DCT subframe and taking the half-size IDCT is, in effect, to perform anti-aliasing
filtering and then followed by downsampling on the sub-frame in the pixel
domain.
Following is the analysis of the performances and complexities of various
downscaling filters for the 1-D case.
For N samples of 1-D signal x, when downscaled by a factor of two, the
downscaled N/2-sample signal y is obtained as follows:
The downscaling filter is defined as :

Which is considered as a linear filter.




The linear transform can be represented as
an N-band filter bank structure
the z-transform of the output y can be
obtained by:
N increases, the gain of
DCT decimation
filters, |F0(z)|, becomes
much flatter in the low
frequency part
As

(0~π/2), whereas the gain
decreases rapidly in the highfrequency part (π/2~π).
For the bilinear filter, the
gain in the high-frequency
part is always larger than its
counterparts of DCT
decimation filter and 7-tap
filter.
The smaller gain in the
high-frequency part implies
less visible aliasing artifacts
the magnitude responses of the two pixel-domain in the downscaledfilter
filters: the bilinear image.
and the 7-tap filter, and the generalized DCT decimation filters with N =
8, 16, 72, and 288
N

•increasing the sub-frame size for the
DCT decimation filters will lead to
better quality of downscaled image
but it will also increase the
computational complexity
significantly.
•The shown table lists computational
complexities with different N values
and different filter:

Tab length

Bilinear
7-tab
Gaussian

Avg.
Multiplicati
ons
8

Avg.
additions

56

48

8

Average computational complexity for
pixel-domain downscaling scheme

Avg.
Multiplications
Gen.
DCT
decima
tion

Avg. additions

Sparse
Gen.
Sparse
matrix
DCT
matrix
decom decima decom
positio
tion
positio
n
n
2788
4208
2792

352

4224

32

384

228

368

232

16

192

100

176

104

8

64

20

64

20

Average computational complexity
using generalized DCT-decimation
scheme








We use:
One CIF (352* 288)
Two ITUR(704*576)
In each video 150 frame
Encoded by front-end MPEG-2 encoder
Each coded video is transcoded by using
CDDT






Resulting in a spatially downscaled video
of quarter size
We implement bilinear filter and 7-tap
gaussian filter
Each downscaled image is decoded and
up-scaled to its original size


The optimal least- squares upscaling filter
matrix minimize the error between
original sized image & its reconstructed
(downscaled & then upscaling)







Steps:
Divide each downscaled pixel vector
into N/2 sample pixel vector then
transform it to N/2 sample DCT vector.
Expand the size of each N/2 sample DCT
to N sample by padding zero coefficient in
high frequency bands
Apply N-point IDCT






We compare PSNR values of o/p image of
downscaling filter followed by upscaling
filter & final o/p of transcoders.
PSNR, is an engineering term for the
ratio between the maximum possible
power of a signal and the power of
corrupting noise .
So if PSNR is high the noise is low so the
quality is high
PSNR= 20 log10 (255/ RMSE)






We experiment results in N=8,16,32

In N=16,32 we have better visual quality
over that N=8 but in the same time we
increased computational complexity
So we can use N=8 in low activity region &
N=16,32 in high activity region to achieve
good trade-off between computational
complexity & visual quality.
Thank You !

More Related Content

What's hot

Discrete cosine transform
Discrete cosine transform   Discrete cosine transform
Discrete cosine transform Rashmi Karkra
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compressionsipij
 
COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...
COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...
COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...sipij
 
Empirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionEmpirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionCSCJournals
 
Labview with dwt for denoising the blurred biometric images
Labview with dwt for denoising the blurred biometric imagesLabview with dwt for denoising the blurred biometric images
Labview with dwt for denoising the blurred biometric imagesijcsa
 
Image Resolution Enhancement Using Undecimated Double Density Wavelet Transform
Image Resolution Enhancement Using Undecimated Double Density Wavelet TransformImage Resolution Enhancement Using Undecimated Double Density Wavelet Transform
Image Resolution Enhancement Using Undecimated Double Density Wavelet TransformCSCJournals
 
Local Vibrational Modes
Local Vibrational ModesLocal Vibrational Modes
Local Vibrational ModesDani Setiawan
 
Multimedia image compression standards
Multimedia image compression standardsMultimedia image compression standards
Multimedia image compression standardsMazin Alwaaly
 
Gray Image Watermarking using slant transform - digital image processing
Gray Image Watermarking using slant transform - digital image processingGray Image Watermarking using slant transform - digital image processing
Gray Image Watermarking using slant transform - digital image processingNITHIN KALLE PALLY
 
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...cscpconf
 
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...CSCJournals
 
Wavelet based image fusion
Wavelet based image fusionWavelet based image fusion
Wavelet based image fusionUmed Paliwal
 
Discrete wavelet transform using matlab
Discrete wavelet transform using matlabDiscrete wavelet transform using matlab
Discrete wavelet transform using matlabIAEME Publication
 
Lightspeed Preprint
Lightspeed PreprintLightspeed Preprint
Lightspeed Preprintjustanimate
 

What's hot (19)

Discrete cosine transform
Discrete cosine transform   Discrete cosine transform
Discrete cosine transform
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compression
 
COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...
COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...
COMPARISON OF DENOISING ALGORITHMS FOR DEMOSACING LOW LIGHTING IMAGES USING C...
 
B042107012
B042107012B042107012
B042107012
 
wavelet packets
wavelet packetswavelet packets
wavelet packets
 
Empirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video CompressionEmpirical Evaluation of Decomposition Strategy for Wavelet Video Compression
Empirical Evaluation of Decomposition Strategy for Wavelet Video Compression
 
Labview with dwt for denoising the blurred biometric images
Labview with dwt for denoising the blurred biometric imagesLabview with dwt for denoising the blurred biometric images
Labview with dwt for denoising the blurred biometric images
 
Image Resolution Enhancement Using Undecimated Double Density Wavelet Transform
Image Resolution Enhancement Using Undecimated Double Density Wavelet TransformImage Resolution Enhancement Using Undecimated Double Density Wavelet Transform
Image Resolution Enhancement Using Undecimated Double Density Wavelet Transform
 
Local Vibrational Modes
Local Vibrational ModesLocal Vibrational Modes
Local Vibrational Modes
 
Multimedia image compression standards
Multimedia image compression standardsMultimedia image compression standards
Multimedia image compression standards
 
Jpeg
JpegJpeg
Jpeg
 
Gray Image Watermarking using slant transform - digital image processing
Gray Image Watermarking using slant transform - digital image processingGray Image Watermarking using slant transform - digital image processing
Gray Image Watermarking using slant transform - digital image processing
 
Jpeg compression
Jpeg compressionJpeg compression
Jpeg compression
 
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
 
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
 
Wavelet based image fusion
Wavelet based image fusionWavelet based image fusion
Wavelet based image fusion
 
Discrete wavelet transform using matlab
Discrete wavelet transform using matlabDiscrete wavelet transform using matlab
Discrete wavelet transform using matlab
 
48
4848
48
 
Lightspeed Preprint
Lightspeed PreprintLightspeed Preprint
Lightspeed Preprint
 

Similar to Quality enhamcment

Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standardsComparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standardsRishab2612
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Design of Low Power Sigma Delta ADC
Design of Low Power Sigma Delta ADCDesign of Low Power Sigma Delta ADC
Design of Low Power Sigma Delta ADCVLSICS Design
 
Modified approximate 8-point multiplier less DCT like transform
Modified approximate 8-point multiplier less DCT like transformModified approximate 8-point multiplier less DCT like transform
Modified approximate 8-point multiplier less DCT like transformIJERA Editor
 
Video Compression Basics
Video Compression BasicsVideo Compression Basics
Video Compression BasicsSanjiv Malik
 
A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...
A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...
A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...arpublication
 
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMSA COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMSKate Campbell
 
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...VLSICS Design
 
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...VLSICS Design
 
Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...
Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...
Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...IJERA Editor
 
SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN
SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN
SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN ijcseit
 
Design of 17-Bit Audio Band Delta-Sigma Analog to Digital Converter
Design of 17-Bit Audio Band Delta-Sigma Analog to Digital ConverterDesign of 17-Bit Audio Band Delta-Sigma Analog to Digital Converter
Design of 17-Bit Audio Band Delta-Sigma Analog to Digital ConverterKarthik Rathinavel
 
Aiar. unit v. machine vision 1462642546237
Aiar. unit v. machine vision 1462642546237Aiar. unit v. machine vision 1462642546237
Aiar. unit v. machine vision 1462642546237Kunal mane
 
DSP_FOEHU - Lec 11 - IIR Filter Design
DSP_FOEHU - Lec 11 - IIR Filter DesignDSP_FOEHU - Lec 11 - IIR Filter Design
DSP_FOEHU - Lec 11 - IIR Filter DesignAmr E. Mohamed
 
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...IJERA Editor
 

Similar to Quality enhamcment (20)

I3602061067
I3602061067I3602061067
I3602061067
 
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standardsComparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Design of Low Power Sigma Delta ADC
Design of Low Power Sigma Delta ADCDesign of Low Power Sigma Delta ADC
Design of Low Power Sigma Delta ADC
 
Jv2517361741
Jv2517361741Jv2517361741
Jv2517361741
 
Jv2517361741
Jv2517361741Jv2517361741
Jv2517361741
 
Modified approximate 8-point multiplier less DCT like transform
Modified approximate 8-point multiplier less DCT like transformModified approximate 8-point multiplier less DCT like transform
Modified approximate 8-point multiplier less DCT like transform
 
Image compression Algorithms
Image compression AlgorithmsImage compression Algorithms
Image compression Algorithms
 
Video Compression Basics
Video Compression BasicsVideo Compression Basics
Video Compression Basics
 
A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...
A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...
A Low Hardware Complex Bilinear Interpolation Algorithm of Image Scaling for ...
 
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMSA COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
 
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
 
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
 
Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...
Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...
Design of Continuous Time Multibit Sigma Delta ADC for Next Generation Wirele...
 
SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN
SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN
SECURED COLOR IMAGE WATERMARKING TECHNIQUE IN DWT-DCT DOMAIN
 
F0213137
F0213137F0213137
F0213137
 
Design of 17-Bit Audio Band Delta-Sigma Analog to Digital Converter
Design of 17-Bit Audio Band Delta-Sigma Analog to Digital ConverterDesign of 17-Bit Audio Band Delta-Sigma Analog to Digital Converter
Design of 17-Bit Audio Band Delta-Sigma Analog to Digital Converter
 
Aiar. unit v. machine vision 1462642546237
Aiar. unit v. machine vision 1462642546237Aiar. unit v. machine vision 1462642546237
Aiar. unit v. machine vision 1462642546237
 
DSP_FOEHU - Lec 11 - IIR Filter Design
DSP_FOEHU - Lec 11 - IIR Filter DesignDSP_FOEHU - Lec 11 - IIR Filter Design
DSP_FOEHU - Lec 11 - IIR Filter Design
 
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
 

More from Yara Ali

Generating a time shrunk lecture video by event
Generating a time shrunk lecture video by eventGenerating a time shrunk lecture video by event
Generating a time shrunk lecture video by eventYara Ali
 
Localization in WSN
Localization in WSNLocalization in WSN
Localization in WSNYara Ali
 
Interference mitigation by dynamic self power control in femtocell
Interference mitigation by dynamic self power control in femtocellInterference mitigation by dynamic self power control in femtocell
Interference mitigation by dynamic self power control in femtocellYara Ali
 
Content based filtering, pub sub, bloom filters
Content based filtering, pub   sub, bloom filtersContent based filtering, pub   sub, bloom filters
Content based filtering, pub sub, bloom filtersYara Ali
 
Intel® core™ i5 700 desktop processor
Intel® core™ i5 700 desktop processorIntel® core™ i5 700 desktop processor
Intel® core™ i5 700 desktop processorYara Ali
 

More from Yara Ali (6)

Generating a time shrunk lecture video by event
Generating a time shrunk lecture video by eventGenerating a time shrunk lecture video by event
Generating a time shrunk lecture video by event
 
Sudoku
SudokuSudoku
Sudoku
 
Localization in WSN
Localization in WSNLocalization in WSN
Localization in WSN
 
Interference mitigation by dynamic self power control in femtocell
Interference mitigation by dynamic self power control in femtocellInterference mitigation by dynamic self power control in femtocell
Interference mitigation by dynamic self power control in femtocell
 
Content based filtering, pub sub, bloom filters
Content based filtering, pub   sub, bloom filtersContent based filtering, pub   sub, bloom filters
Content based filtering, pub sub, bloom filters
 
Intel® core™ i5 700 desktop processor
Intel® core™ i5 700 desktop processorIntel® core™ i5 700 desktop processor
Intel® core™ i5 700 desktop processor
 

Recently uploaded

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 

Recently uploaded (20)

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 

Quality enhamcment

  • 1. Visual Quality Enhancement in DCTDomain Spatial Downscaling Transcoding Using Generalized DCT Decimation. Presented by: Marwa Ahmed Mona Ragheb Sara Serag Yara Ali
  • 2. Agenda • Introduction   Definition What is meant by:      Transcoding Spatial Domain & DCT Domain Downscaling Alias Quantization   • Video Adaptation Frequency Synthesis Generalized DCT Decimation For Spatial Downscaling
  • 3. Agenda • Computation Reduction Using Sparse matrix representation • Analysis of the proposed DCT decimation filter • Experimental results  Optimal list squares Up-Scaling filters ( Steps ) • Peak Signal to Noise ratio • Conclusion
  • 4. Visual Quality Enhancement in DCT-Domain Spatial Downscaling Transcoding Using Generalized DCT Decimation. •The goal image enhancement is to improve the image quality so that the processed image PSNR is high and less computational complexity
  • 5. Abstract 1. we propose a generalized discrete cosine transform (DCT) decimation scheme for DCT-domain spatial downscaling which performs two-fold decimation on subframes of a flexible size larger than the traditional 8 ᵡ block size to improve the visual quality. 8 2.Efficient sparse-matrix: representations are then derived to reduce the computation of the proposed DCT decimation method.
  • 6. Abstract (cont.) 3.We compare the filtering performances and computational complexities of the proposed scheme and the pixel-domain downscaling schemes Our analysis shows that :  proposed scheme can reduce the aliasing artifact compared to the pixel-domain downscaling schemes, Where as the computational complexity may be increased We also:  integrate the proposed decimation scheme into the cascaded DCT-domain transcoder for spatial downscaling of a pre encoded video into its quarter size  Experiments show the proposed approach can achieve better visual quality than the existing schemes
  • 7. What is meant by:  Transcoding : Video transcoding is an operation of converting a video bit-stream into from one format into another format (e.g., bit-rate , frame-rate, spatial resolution, and coding syntax). It is an efficient means of achieving fine and dynamic video adaptation.  Video adaptation : convert the video bit rate according to the channel conditions. Since the preencoded video is encoded at high quality and bit rate. For low bandwidth connections, the video bit rate needs to be converted to low bit rate.
  • 8. Spatial Domain & DCT Domain Spatial Domain (Image Enhancement): Definition: is manipulating or changing an image representing an object in space to enhance the image for a given application. •Techniques are based on direct manipulation of pixels in an image •Used for filtering basics, smoothing filters, sharpening filters, unsharp masking and laplacian
  • 9. Discrete Cosine Transform (DCT) domain •This allows us to discard those equations involving the higher frequency components, reducing the size of the equation set considerably. •in the DCT domain, each equation’s significance is dependent on the corresponding DCT frequency •Does not affect the compressibility of the original image because it enhance the image in the decompression
  • 10. Aliasing: • When a signal is under-sampled, aliasing can result •Aliasing is when high frequency components masquerade as low frequency ones, and can result from improper image sampling Downscailing : The operation of retaining the low-frequency coefficients of aDCT sub-frame and taking the half-size IDCT Each N M sub-frame is extracting only the (N/2) (M/2) low-frequency.
  • 11. Quantization: is the process of converting a continuous analog audio signal to a digital signal with discrete numerical values. Frequency synthesis : downscaling method first synthesizes an incoming macroblock consisting of four 8 ᵡ DCT blocks into 8 one 16 ᵡ DCT block, and then obtains the downscaled 8 ᵡ 16 8 DCT block by extracting the 8 ᵡ low-frequency DCT coefficients 8 of the 16 ᵡ DCT block 16
  • 12. •In realizing a transcoder, the computational cost and the picture quality are usually the two most important concerns. A cascaded DCT-domain transcoder (CDDT): as depicted in Fig. 1, was first proposed in for spatial downscaling where a DCT-domain bilinear filter was used as the anti aliasing filter for the spatial downscaling. •cascade a decoder followed by an encoder. This cascaded pixel-domain architecture is flexible and can be used for bit rate adaptation and spatio-temporal resolution conversion without drift.
  • 13.
  • 14.
  • 15. MC: reduces the temporal redundancy. DCT: reduces the spatial redundancy and achieves energy compaction Quantization is performed to achieve higher compression ratio. Variable-length coding.  VLC: is applied after the quantization to reduce the remaining redundancy.  decoder decodes the compressed input video  encoder re encodes the decoded video into the target format A video picture is predicted from its reference pictures and only the prediction errors are coded. the encoder reuses the motion vectors along with other information extracted from the input video bit stream.
  • 16. II. Generalized DCT decimation for spatial downscaling
  • 17. Formulation of generalized DCT decimation
  • 18. Formulation of generalized DCT decimation STEP 1: A group of consecutive 8-samples DCT vectors are first transformed into an N-pixel vector by 8-point IDCT, Where N is a multiple of 8.
  • 19. Formulation of generalized DCT decimation The N-pixel vector is then transformed into its corresponding DCT vector by Npoint DCT
  • 20. Formulation of generalized DCT decimation The N-point DCT representation of fN can be computed by: fN: N-pixel vector that’s composed of 8-pixel vectors bi , i= 1……, N/8 TN: N-point DCT transform matrix that’s divided into N/8 columns of submatrices TN,i of size Nx8
  • 22. Formulation of generalized DCT decimation STEP 2: DCT decimation is subsequently performed on the N-sample DCT vector by extracting the N/2 low-frequency DCT coefficients followed by N/2point IDCT to obtain a downscaled N/2-pixel vector
  • 24. Formulation of generalized DCT decimation STEP 3: The N/2-Pixel vector is transformed into a group of consecutive 8-Sample DCT vectors by 8-Point DCT to form the output DCT array
  • 26. •Computation reduction using sparse matrix representations •Analyses of DCT-DECIMATION downscaling filters
  • 27.  To reduce computation for matrix operations in (4) and (7) can be represented in sparse matrix form
  • 28. The following characteristics have been noted in with dimension (N/2) * 8: 1. General case: The entries of r th row in are all zeros except the r th entry where r = 0, N/8, 2N/8, 3N/8 About N/8 of the entries are zeros. 2. Special case 1: For K = N/8 is even  Where i = 1, …….., N/16 r = 0, ……, N/2 and c = 0, …, 7 3. Special case 2: for K = N/8 is odd Where i = 1, …….., N/16 r = 0, ……,, N/2 and c = 0, …, 7 for matrix with i = N/16 + 1 , the entries of odd values of r + c is zero for r ≠ 0, N/8, 2N/8 , 3N/8 at most half of the entries are zeros.
  • 29. Based on the previous facts:  are defined to reduce computations , For i = 1, …., k/2 where k is even : Substituting in :
  • 30.     The operation of retaining the low-frequency coefficients of a DCT subframe and taking the half-size IDCT is, in effect, to perform anti-aliasing filtering and then followed by downsampling on the sub-frame in the pixel domain. Following is the analysis of the performances and complexities of various downscaling filters for the 1-D case. For N samples of 1-D signal x, when downscaled by a factor of two, the downscaled N/2-sample signal y is obtained as follows: The downscaling filter is defined as : Which is considered as a linear filter.
  • 31.   The linear transform can be represented as an N-band filter bank structure the z-transform of the output y can be obtained by:
  • 32. N increases, the gain of DCT decimation filters, |F0(z)|, becomes much flatter in the low frequency part As (0~π/2), whereas the gain decreases rapidly in the highfrequency part (π/2~π). For the bilinear filter, the gain in the high-frequency part is always larger than its counterparts of DCT decimation filter and 7-tap filter. The smaller gain in the high-frequency part implies less visible aliasing artifacts the magnitude responses of the two pixel-domain in the downscaledfilter filters: the bilinear image. and the 7-tap filter, and the generalized DCT decimation filters with N = 8, 16, 72, and 288
  • 33. N •increasing the sub-frame size for the DCT decimation filters will lead to better quality of downscaled image but it will also increase the computational complexity significantly. •The shown table lists computational complexities with different N values and different filter: Tab length Bilinear 7-tab Gaussian Avg. Multiplicati ons 8 Avg. additions 56 48 8 Average computational complexity for pixel-domain downscaling scheme Avg. Multiplications Gen. DCT decima tion Avg. additions Sparse Gen. Sparse matrix DCT matrix decom decima decom positio tion positio n n 2788 4208 2792 352 4224 32 384 228 368 232 16 192 100 176 104 8 64 20 64 20 Average computational complexity using generalized DCT-decimation scheme
  • 34.
  • 35.       We use: One CIF (352* 288) Two ITUR(704*576) In each video 150 frame Encoded by front-end MPEG-2 encoder Each coded video is transcoded by using CDDT
  • 36.    Resulting in a spatially downscaled video of quarter size We implement bilinear filter and 7-tap gaussian filter Each downscaled image is decoded and up-scaled to its original size
  • 37.  The optimal least- squares upscaling filter matrix minimize the error between original sized image & its reconstructed (downscaled & then upscaling)
  • 38.     Steps: Divide each downscaled pixel vector into N/2 sample pixel vector then transform it to N/2 sample DCT vector. Expand the size of each N/2 sample DCT to N sample by padding zero coefficient in high frequency bands Apply N-point IDCT
  • 39.
  • 40.    We compare PSNR values of o/p image of downscaling filter followed by upscaling filter & final o/p of transcoders. PSNR, is an engineering term for the ratio between the maximum possible power of a signal and the power of corrupting noise . So if PSNR is high the noise is low so the quality is high PSNR= 20 log10 (255/ RMSE)
  • 41.    We experiment results in N=8,16,32 In N=16,32 we have better visual quality over that N=8 but in the same time we increased computational complexity So we can use N=8 in low activity region & N=16,32 in high activity region to achieve good trade-off between computational complexity & visual quality.

Editor's Notes

  1. This cascaded architecture isflexible and can be used for bitrate adaptation, spatial andtemporal resolution-conversion without drift. It is, however,computationally intensive for real-time applications , eventhough the motion-vectors and coding-modes of the incomingbit-stream can be reused for fast processing.
  2. A cascaded DCT-domaindownscaling transcoder (CDDT) architecture was first proposedin [5] as depicted in Fig. 1, where a bilinear filtering scheme wasused for the spatial resolution downscaling in the DCT domain.
  3. As N increases, the gain of DCTdecimation filters, |F0(z)|, becomes much flatter in the low frequencypart (0~π/2), whereas the gain decreases rapidly in thehigh-frequency part (π/2~π). For the bilinear filter, the gain inthe high-frequency part is always larger than its counterparts ofDCT decimation filter and 7-tap filter. The smaller gain in thehigh-frequency part implies less visible aliasing artifacts in thedownscaled image.