SlideShare a Scribd company logo
1 of 4
Download to read offline
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE)
e-ISSN: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver. III (Mar - Apr.2015), PP 41-44
www.iosrjournals.org
DOI: 10.9790/2834-10234144 www.iosrjournals.org 41 | Page
Audio Compression Using Daubechie Wavelet
1
P.K.R.Manohar, 2
M.Pratyusha,3
R.Satheesh, 4
S. Geetanjali, 5
N.Rajasekhar
1,2,3,4,
Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram, India.
5.
Assistant professor, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,
India.
Abstract: Compression of audio signal is one of the basic technologies in multimedia. This paper describes
about Daubechie wavelet based audio compression. This method uses optimal wavelet selection and wavelet
coefficients quantization procedure together with different Companding methods like A-law and U-law
Compandings. This paper presents a technique to incorporate psychoacoustic models into an adaptive wavelet
packet scheme to achieve perceptually transparent compression of high-quality audio signals [1].
Keywords: A-law companding, U-law companding, Daubechie wavelet, Bior Wavelet, Symlet Wavelet.
I. Introduction
Compression is process of converting an input data stream into another data stream that has smaller
size. Compression provides the reduction in redundancy also used to reduce storage requirements overall
program execution time may be reduced. This is because reduction in storage will result in reduction of disc
access attempts. The compression algorithmhelp to reduce the bandwidth requirements and also provide a level
of security for the data being transmitted. The wavelets consist of banks of low pass filters, high pass filters and
down sampling units. Half of the filter convolution results are discarded because of the down sampling at each
wavelet decomposition stage. Only the approximation part of the Daubechie wavelet results is kept so that the
number of samples is reduced by half[6].
Different Compression Techniques
Different types of compression techniques are available they are lossless compression and lossy
compression techniques. Redundancy information present in audio signal will removed in loss less compression.
It is disadvantages such as it doesn’t give the constant output data rate and very small compression ratio, and
advantage isit can be applied
To any data stream. In loss compression the information is irrelevant in that the receiver will not able to
recognize the missing.
Wavelet Transforms:
The wavelet theory allows a very general and flexible description to transform signal from time domain
to time-frequency domain, so called time-scale domain. Wavelet transform uses short window for high. Wavelet
Transform uses short window for high frequencies, leading to a good time resolution and larger windows for
low frequencies leading to a good frequency resolution.
II. Wavelet Compression
Wavelet compression is a form of data compression well suited for audio compression, video
compression,image compression. Wavelet compression methods are adequate for representing transients. Such
as percussion sounds in audio, high-frequency components in 2-D images. This means transient elements of data
signal can be represented by a smaller amount of information that would be the case of some other transforms
LikeSymletswfilters, Coifletswfilters, Biorthogonalwfilters, Reverse BiorthogonalFilters,Discrete
Meyer,Harr[5]
Wavelet Representation For Audio Signal
A wavelet transform can be defined as a “small wave” that has its energy concentrated in time, and it
provides a tool for the analysis of transient, non-stationary or time varying phenomenon. It has oscillating wave
like property. Wavelet is a waveform of limited duration having an average value zero. They are localized in
space. Wavelet transform provides a time-frequency representation of the signal. In wavelet transform, the
signal is decomposed into set of basic functions also known as WAVELETS. Wavelets with large number of
vanishing moments are useful for this audio compression method, because if a wavelet with a large number of
vanishing moments is used, a precise specification of the pass bands of each sub band in the wavelet
Audio Compression Using Daubechie Wavelet
DOI: 10.9790/2834-10234144 www.iosrjournals.org 42 | Page
decomposition is possible. Thus, we can approximate the critical band division given by the auditory system
with this structure and quantization noise power could be integrated [3][4].
Wavelet Based Compression Techniques:
Wavelets concentrate speech signals into a few neighbouring coefficients. By taking the wavelet
transform of a signal, many of its‟ coefficients will either be zero or have negligible magnitudes. Data
compression can then be done by treating the small valued coefficients as insignificant data and discarding
them. Compressing a speech signal using wavelets involves the following stages[2].
Figure 1: Compression System Design
a) Thresholding:
After the coefficients are received from different transforms, thresholding is done. Very few DCT
coefficients represent 99% of signal energy; hence thresholding is calculated and applied to the coefficients.
Coefficients having values less than threshold values are removed.
b) Quantization:
It is a process of mapping a set of continuous valued data to a set of discrete valued data. The aim of
quantization is to reduce the information found in threshold coefficients. This process makes sure that it
produces minimum errors. We basically perform uniform quantization process.
c) Encoding:
We use different encoding techniques like decomposition using N equal frames Encoding method is
used to remove data that are repetitively occurring. In encoding we can also reduce the number of coefficients
by removing the redundant data. Encoding can use any of the two compression techniques, lossless or lossy.
This helps in reducing the bandwidth of the signal hence compression can be achieved.The compressed speech
signal can be reconstructed to form the original signal by DECODING followed by DE-QUANTIZATION. This
would reproduce the original signal.
Psychoacoustic Model
Simplified masking model
Auditory masking depends on time and frequency of both the masking signal and the masked signal. It
is assumed in this paper that masking is additive, so they estimate the total masked power at any frequency by
adding the masked power due to the components of the signal at each frequency. Thus, minimum masked power
within each band can be calculated. From this, they get the final estimate of masked noise power. Then the idea
is that a listener will tolerate an additive noise power in the reproduced audio signal as long as the power
spectrum is less than the masked noise power at each frequency[1][2]. The authors noted that even though this
masking model has several drawbacks, it yields reasonable coding gains. The main problems that this
psychoacoustic model has are:
• The shape of the masking property used is valid for masking by tonal signals, which is not the same for
masking by noise.
• The model is based on psychoacoustic studies for the masking of a single tone like signal (quantization error
could happen if it contains several components).
• Masking is assumed to be additive (a power law rule of addition should be used instead).
Masking constraint in the Wavelet Domain
This masking model is incorporated within the framework of the wavelet transform based coder. The idea is to
convert the perceptual threshold of each sub-band into a wavelet constrain. To do that the authors defined e, an
N x 1 error vector consisting of the value of the Discrete Fourier Transform of the error in reconstructing the
signal from a sequence of approximate wavelet coefficients (N is the length of the audio frame). Also RD is
defined as a diagonal matrix with entries equal to discretized value of one over the masked noise power. The
psychoacoustic model implies that the reconstruction error due to the quantization or approximation of the
wavelet coefficients corresponding to the given audio signal may be made inaudible
Audio Compression Using Daubechie Wavelet
DOI: 10.9790/2834-10234144 www.iosrjournals.org 43 | Page
Reducing The Number Of Non-Zero Coefficients Optimization Criterion:
For each frame, an optimum wavelet representation is selected to minimize the number of bits required
to represent the frame while keeping any distortion inaudible. This wavelet selection is the strongest
compression technique, because it highly reduces the number of non-zero wavelet coefficients. In addition to
that, those coefficients may be encoded using a small number of bits. Therefore, this technique choosing an
analysis wavelet and allocating bits to each coefficient in the resulting wavelet representation. Figure explains
how this technique works. It shows a signal vector representation by a particular choice of a basis. The radius of
the sphere shown is equal of the norm of the time domain signal, and the error ellipse corresponds to the
perceptual semi-norm calculated by the psychoacoustic model. The audio segment can be represented using any
vector whose tip lies inside the error ellipse with no perceptual distortion. Hence, the projection of the error
ellipsoid along each coordinate axis specifies the coarsest quantization that can be used along the axis without
producing any perceptual degradation. Therefore, a large projection along a particular coordinate axis implies
that only a small number of bits to quantize that coordinate need to used. Exploiting this fact, a low bit rate
representation of the signal can be achieved by the rotation of the vector representation of the signal via a
unitary wavelet transformation.
To apply this technique, let ( ) be the number of bits assigned to the quantization ofthe kth
transform coefficient when the wavelet identified by the vector O is used todecompose frame x. The
goal is to minimize.
R( )=
by properly choosing O and the number of bits ( )assigned to the quantization of each transform
coefficient The minimization must be done under the constraint on the perceptual encoding error. It is
proven in the paper that, for a particular choice of a wavelet, the bit rate requirement may be computed using the
following formula directly from the transform coefficients.
III. Simulation Result
Figure: Fordb10wavelet
Audio Compression Using Daubechie Wavelet
DOI: 10.9790/2834-10234144 www.iosrjournals.org 44 | Page
Figure: For BIOR 1.5 wavelet
IV. Conclusion
A brief summary of the audio compression techniques using Daubechie Wavelets and the main
considerations that people do when evaluating their results has been presented.A MATLAB simulation of the
selected paper was successfully implemented, simplifying some of its features, but keeping its main structure
and contributions.The quality of the compressed signal obtained with the MATLAB implementation is lower
than any current standard compressing schemes, but considerable better than one obtained just by “blindly
compressing” the signal.
References
[1]. High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic
Modeling. IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 46, NO. 4, APRIL 1998.
[2]. Deepen Sinha. Low Bit Rate Transparent Audio Compression using Adapted Wavelets. IEEE
TRANSACTIONS ON SIGNAL PROCESSING.VOL. 41. NO. 12. DECEMBER 1993.
[3]. Johnson IhyehAgbinya.discrete wavelet transform techniques in speech processing 1996 IEEETENCON -
Digital Signal Processing Applications
[4]. M.V.PatilAudio and Speech Compression Using DCT and DWT Techniques.International Journal of
Innovative Research in Science, Engineering and Technology Vol. 2, Issue 5, May 2013
[5]. CiprianDoruGiurc˘aneanu ,IoanT˘abus¸ and JaakkoAstola.Integerwavelet Transform Based Lossless
Audio Compression.
[6]. C.D. Giurc˘aneanu, I. T˘abus¸, and J. Astola.Adaptivecontext based sequential prediction for lossless
audiocompression. In Proc. Eusipco-98, IX European SignalProcessing Conference, volume 4, pages
2349–2352, Rhodes, Greece, Sept. 1998

More Related Content

What's hot

44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...IRJET Journal
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringTejus Adiga M
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementHarshal Ladhe
 
Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...iosrjce
 
Improvement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation methodImprovement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation methodCSCJournals
 
Wavelet based image compression technique
Wavelet based image compression techniqueWavelet based image compression technique
Wavelet based image compression techniquePriyanka Pachori
 
Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionIOSRJVSP
 
OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...
OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...
OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...IJERA Editor
 
Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...
Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...
Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...ijtsrd
 
Signal to-noise-ratio of signal acquisition in global navigation satellite sy...
Signal to-noise-ratio of signal acquisition in global navigation satellite sy...Signal to-noise-ratio of signal acquisition in global navigation satellite sy...
Signal to-noise-ratio of signal acquisition in global navigation satellite sy...Alexander Decker
 
IRJET - Audio Emotion Analysis
IRJET - Audio Emotion AnalysisIRJET - Audio Emotion Analysis
IRJET - Audio Emotion AnalysisIRJET Journal
 
Audio Steganography Coding Using the Discreet Wavelet Transforms
Audio Steganography Coding Using the Discreet Wavelet TransformsAudio Steganography Coding Using the Discreet Wavelet Transforms
Audio Steganography Coding Using the Discreet Wavelet TransformsCSCJournals
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...IRJET Journal
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...CSCJournals
 
Introduction to wavelet transform
Introduction to wavelet transformIntroduction to wavelet transform
Introduction to wavelet transformRaj Endiran
 

What's hot (18)

44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency Filtering
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancement
 
Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...
 
Improvement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation methodImprovement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation method
 
Wavelet based image compression technique
Wavelet based image compression techniqueWavelet based image compression technique
Wavelet based image compression technique
 
Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
 
OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...
OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...
OFDM PAPR Reduction Using Hybrid Partial Transmit Sequences Based On Cuckoo S...
 
Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...
Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...
Simulation of Direct Sequence Spread Spectrum for Wireless Communication Syst...
 
Signal to-noise-ratio of signal acquisition in global navigation satellite sy...
Signal to-noise-ratio of signal acquisition in global navigation satellite sy...Signal to-noise-ratio of signal acquisition in global navigation satellite sy...
Signal to-noise-ratio of signal acquisition in global navigation satellite sy...
 
Dq33705710
Dq33705710Dq33705710
Dq33705710
 
IRJET - Audio Emotion Analysis
IRJET - Audio Emotion AnalysisIRJET - Audio Emotion Analysis
IRJET - Audio Emotion Analysis
 
Audio Steganography Coding Using the Discreet Wavelet Transforms
Audio Steganography Coding Using the Discreet Wavelet TransformsAudio Steganography Coding Using the Discreet Wavelet Transforms
Audio Steganography Coding Using the Discreet Wavelet Transforms
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
 
D011132635
D011132635D011132635
D011132635
 
Introduction to wavelet transform
Introduction to wavelet transformIntroduction to wavelet transform
Introduction to wavelet transform
 

Viewers also liked (20)

N0175496100
N0175496100N0175496100
N0175496100
 
Normas apa sexta edición
Normas apa sexta ediciónNormas apa sexta edición
Normas apa sexta edición
 
S01232153159
S01232153159S01232153159
S01232153159
 
E017422935
E017422935E017422935
E017422935
 
A012120110
A012120110A012120110
A012120110
 
K1803027074
K1803027074K1803027074
K1803027074
 
E018142329
E018142329E018142329
E018142329
 
H017264852
H017264852H017264852
H017264852
 
D1803032632
D1803032632D1803032632
D1803032632
 
E010143037
E010143037E010143037
E010143037
 
A010610110
A010610110A010610110
A010610110
 
9555569222 Supertech Basera Affordable Housing Sector 79 Gurgaon, Haryana Aff...
9555569222 Supertech Basera Affordable Housing Sector 79 Gurgaon, Haryana Aff...9555569222 Supertech Basera Affordable Housing Sector 79 Gurgaon, Haryana Aff...
9555569222 Supertech Basera Affordable Housing Sector 79 Gurgaon, Haryana Aff...
 
I1803035762
I1803035762I1803035762
I1803035762
 
A017620123
A017620123A017620123
A017620123
 
K018146372
K018146372K018146372
K018146372
 
M0174491101
M0174491101M0174491101
M0174491101
 
M017419499
M017419499M017419499
M017419499
 
J017114852
J017114852J017114852
J017114852
 
G012134044
G012134044G012134044
G012134044
 
J1102026164
J1102026164J1102026164
J1102026164
 

Similar to Audio Compression Using Daubechie Wavelets

Speech Compression Using Wavelets
Speech Compression Using Wavelets Speech Compression Using Wavelets
Speech Compression Using Wavelets IJMER
 
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET Journal
 
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...IJMTST Journal
 
Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 AudioNovel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audioinventy
 
SPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEWSPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEWijiert bestjournal
 
Attention gated encoder-decoder for ultrasonic signal denoising
Attention gated encoder-decoder for ultrasonic signal denoisingAttention gated encoder-decoder for ultrasonic signal denoising
Attention gated encoder-decoder for ultrasonic signal denoisingIAESIJAI
 
Signal denoising techniques
Signal denoising techniquesSignal denoising techniques
Signal denoising techniquesShwetaRevankar4
 
Wavelet based denoisiong of acoustic signal
Wavelet based denoisiong of acoustic signalWavelet based denoisiong of acoustic signal
Wavelet based denoisiong of acoustic signaleSAT Publishing House
 
30 9762 extension paper id 0030 (edit i)
30 9762 extension paper id 0030 (edit i)30 9762 extension paper id 0030 (edit i)
30 9762 extension paper id 0030 (edit i)IAESIJEECS
 
20575-38936-1-PB.pdf
20575-38936-1-PB.pdf20575-38936-1-PB.pdf
20575-38936-1-PB.pdfIjictTeam
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...IJMER
 

Similar to Audio Compression Using Daubechie Wavelets (20)

I017535359
I017535359I017535359
I017535359
 
Speech Compression Using Wavelets
Speech Compression Using Wavelets Speech Compression Using Wavelets
Speech Compression Using Wavelets
 
N017657985
N017657985N017657985
N017657985
 
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
 
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
An Effective Approach for Colour Image Transmission using DWT Over OFDM for B...
 
Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 AudioNovel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
 
SPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEWSPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEW
 
Dq33705710
Dq33705710Dq33705710
Dq33705710
 
D1103023339
D1103023339D1103023339
D1103023339
 
Attention gated encoder-decoder for ultrasonic signal denoising
Attention gated encoder-decoder for ultrasonic signal denoisingAttention gated encoder-decoder for ultrasonic signal denoising
Attention gated encoder-decoder for ultrasonic signal denoising
 
B046040611
B046040611B046040611
B046040611
 
Signal denoising techniques
Signal denoising techniquesSignal denoising techniques
Signal denoising techniques
 
Wavelet based denoisiong of acoustic signal
Wavelet based denoisiong of acoustic signalWavelet based denoisiong of acoustic signal
Wavelet based denoisiong of acoustic signal
 
L010218691
L010218691L010218691
L010218691
 
G010513644
G010513644G010513644
G010513644
 
30 9762 extension paper id 0030 (edit i)
30 9762 extension paper id 0030 (edit i)30 9762 extension paper id 0030 (edit i)
30 9762 extension paper id 0030 (edit i)
 
20575-38936-1-PB.pdf
20575-38936-1-PB.pdf20575-38936-1-PB.pdf
20575-38936-1-PB.pdf
 
1801 1805
1801 18051801 1805
1801 1805
 
1801 1805
1801 18051801 1805
1801 1805
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 
C011121114
C011121114C011121114
C011121114
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 

Audio Compression Using Daubechie Wavelets

  • 1. IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-ISSN: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver. III (Mar - Apr.2015), PP 41-44 www.iosrjournals.org DOI: 10.9790/2834-10234144 www.iosrjournals.org 41 | Page Audio Compression Using Daubechie Wavelet 1 P.K.R.Manohar, 2 M.Pratyusha,3 R.Satheesh, 4 S. Geetanjali, 5 N.Rajasekhar 1,2,3,4, Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram, India. 5. Assistant professor, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram, India. Abstract: Compression of audio signal is one of the basic technologies in multimedia. This paper describes about Daubechie wavelet based audio compression. This method uses optimal wavelet selection and wavelet coefficients quantization procedure together with different Companding methods like A-law and U-law Compandings. This paper presents a technique to incorporate psychoacoustic models into an adaptive wavelet packet scheme to achieve perceptually transparent compression of high-quality audio signals [1]. Keywords: A-law companding, U-law companding, Daubechie wavelet, Bior Wavelet, Symlet Wavelet. I. Introduction Compression is process of converting an input data stream into another data stream that has smaller size. Compression provides the reduction in redundancy also used to reduce storage requirements overall program execution time may be reduced. This is because reduction in storage will result in reduction of disc access attempts. The compression algorithmhelp to reduce the bandwidth requirements and also provide a level of security for the data being transmitted. The wavelets consist of banks of low pass filters, high pass filters and down sampling units. Half of the filter convolution results are discarded because of the down sampling at each wavelet decomposition stage. Only the approximation part of the Daubechie wavelet results is kept so that the number of samples is reduced by half[6]. Different Compression Techniques Different types of compression techniques are available they are lossless compression and lossy compression techniques. Redundancy information present in audio signal will removed in loss less compression. It is disadvantages such as it doesn’t give the constant output data rate and very small compression ratio, and advantage isit can be applied To any data stream. In loss compression the information is irrelevant in that the receiver will not able to recognize the missing. Wavelet Transforms: The wavelet theory allows a very general and flexible description to transform signal from time domain to time-frequency domain, so called time-scale domain. Wavelet transform uses short window for high. Wavelet Transform uses short window for high frequencies, leading to a good time resolution and larger windows for low frequencies leading to a good frequency resolution. II. Wavelet Compression Wavelet compression is a form of data compression well suited for audio compression, video compression,image compression. Wavelet compression methods are adequate for representing transients. Such as percussion sounds in audio, high-frequency components in 2-D images. This means transient elements of data signal can be represented by a smaller amount of information that would be the case of some other transforms LikeSymletswfilters, Coifletswfilters, Biorthogonalwfilters, Reverse BiorthogonalFilters,Discrete Meyer,Harr[5] Wavelet Representation For Audio Signal A wavelet transform can be defined as a “small wave” that has its energy concentrated in time, and it provides a tool for the analysis of transient, non-stationary or time varying phenomenon. It has oscillating wave like property. Wavelet is a waveform of limited duration having an average value zero. They are localized in space. Wavelet transform provides a time-frequency representation of the signal. In wavelet transform, the signal is decomposed into set of basic functions also known as WAVELETS. Wavelets with large number of vanishing moments are useful for this audio compression method, because if a wavelet with a large number of vanishing moments is used, a precise specification of the pass bands of each sub band in the wavelet
  • 2. Audio Compression Using Daubechie Wavelet DOI: 10.9790/2834-10234144 www.iosrjournals.org 42 | Page decomposition is possible. Thus, we can approximate the critical band division given by the auditory system with this structure and quantization noise power could be integrated [3][4]. Wavelet Based Compression Techniques: Wavelets concentrate speech signals into a few neighbouring coefficients. By taking the wavelet transform of a signal, many of its‟ coefficients will either be zero or have negligible magnitudes. Data compression can then be done by treating the small valued coefficients as insignificant data and discarding them. Compressing a speech signal using wavelets involves the following stages[2]. Figure 1: Compression System Design a) Thresholding: After the coefficients are received from different transforms, thresholding is done. Very few DCT coefficients represent 99% of signal energy; hence thresholding is calculated and applied to the coefficients. Coefficients having values less than threshold values are removed. b) Quantization: It is a process of mapping a set of continuous valued data to a set of discrete valued data. The aim of quantization is to reduce the information found in threshold coefficients. This process makes sure that it produces minimum errors. We basically perform uniform quantization process. c) Encoding: We use different encoding techniques like decomposition using N equal frames Encoding method is used to remove data that are repetitively occurring. In encoding we can also reduce the number of coefficients by removing the redundant data. Encoding can use any of the two compression techniques, lossless or lossy. This helps in reducing the bandwidth of the signal hence compression can be achieved.The compressed speech signal can be reconstructed to form the original signal by DECODING followed by DE-QUANTIZATION. This would reproduce the original signal. Psychoacoustic Model Simplified masking model Auditory masking depends on time and frequency of both the masking signal and the masked signal. It is assumed in this paper that masking is additive, so they estimate the total masked power at any frequency by adding the masked power due to the components of the signal at each frequency. Thus, minimum masked power within each band can be calculated. From this, they get the final estimate of masked noise power. Then the idea is that a listener will tolerate an additive noise power in the reproduced audio signal as long as the power spectrum is less than the masked noise power at each frequency[1][2]. The authors noted that even though this masking model has several drawbacks, it yields reasonable coding gains. The main problems that this psychoacoustic model has are: • The shape of the masking property used is valid for masking by tonal signals, which is not the same for masking by noise. • The model is based on psychoacoustic studies for the masking of a single tone like signal (quantization error could happen if it contains several components). • Masking is assumed to be additive (a power law rule of addition should be used instead). Masking constraint in the Wavelet Domain This masking model is incorporated within the framework of the wavelet transform based coder. The idea is to convert the perceptual threshold of each sub-band into a wavelet constrain. To do that the authors defined e, an N x 1 error vector consisting of the value of the Discrete Fourier Transform of the error in reconstructing the signal from a sequence of approximate wavelet coefficients (N is the length of the audio frame). Also RD is defined as a diagonal matrix with entries equal to discretized value of one over the masked noise power. The psychoacoustic model implies that the reconstruction error due to the quantization or approximation of the wavelet coefficients corresponding to the given audio signal may be made inaudible
  • 3. Audio Compression Using Daubechie Wavelet DOI: 10.9790/2834-10234144 www.iosrjournals.org 43 | Page Reducing The Number Of Non-Zero Coefficients Optimization Criterion: For each frame, an optimum wavelet representation is selected to minimize the number of bits required to represent the frame while keeping any distortion inaudible. This wavelet selection is the strongest compression technique, because it highly reduces the number of non-zero wavelet coefficients. In addition to that, those coefficients may be encoded using a small number of bits. Therefore, this technique choosing an analysis wavelet and allocating bits to each coefficient in the resulting wavelet representation. Figure explains how this technique works. It shows a signal vector representation by a particular choice of a basis. The radius of the sphere shown is equal of the norm of the time domain signal, and the error ellipse corresponds to the perceptual semi-norm calculated by the psychoacoustic model. The audio segment can be represented using any vector whose tip lies inside the error ellipse with no perceptual distortion. Hence, the projection of the error ellipsoid along each coordinate axis specifies the coarsest quantization that can be used along the axis without producing any perceptual degradation. Therefore, a large projection along a particular coordinate axis implies that only a small number of bits to quantize that coordinate need to used. Exploiting this fact, a low bit rate representation of the signal can be achieved by the rotation of the vector representation of the signal via a unitary wavelet transformation. To apply this technique, let ( ) be the number of bits assigned to the quantization ofthe kth transform coefficient when the wavelet identified by the vector O is used todecompose frame x. The goal is to minimize. R( )= by properly choosing O and the number of bits ( )assigned to the quantization of each transform coefficient The minimization must be done under the constraint on the perceptual encoding error. It is proven in the paper that, for a particular choice of a wavelet, the bit rate requirement may be computed using the following formula directly from the transform coefficients. III. Simulation Result Figure: Fordb10wavelet
  • 4. Audio Compression Using Daubechie Wavelet DOI: 10.9790/2834-10234144 www.iosrjournals.org 44 | Page Figure: For BIOR 1.5 wavelet IV. Conclusion A brief summary of the audio compression techniques using Daubechie Wavelets and the main considerations that people do when evaluating their results has been presented.A MATLAB simulation of the selected paper was successfully implemented, simplifying some of its features, but keeping its main structure and contributions.The quality of the compressed signal obtained with the MATLAB implementation is lower than any current standard compressing schemes, but considerable better than one obtained just by “blindly compressing” the signal. References [1]. High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling. IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 46, NO. 4, APRIL 1998. [2]. Deepen Sinha. Low Bit Rate Transparent Audio Compression using Adapted Wavelets. IEEE TRANSACTIONS ON SIGNAL PROCESSING.VOL. 41. NO. 12. DECEMBER 1993. [3]. Johnson IhyehAgbinya.discrete wavelet transform techniques in speech processing 1996 IEEETENCON - Digital Signal Processing Applications [4]. M.V.PatilAudio and Speech Compression Using DCT and DWT Techniques.International Journal of Innovative Research in Science, Engineering and Technology Vol. 2, Issue 5, May 2013 [5]. CiprianDoruGiurc˘aneanu ,IoanT˘abus¸ and JaakkoAstola.Integerwavelet Transform Based Lossless Audio Compression. [6]. C.D. Giurc˘aneanu, I. T˘abus¸, and J. Astola.Adaptivecontext based sequential prediction for lossless audiocompression. In Proc. Eusipco-98, IX European SignalProcessing Conference, volume 4, pages 2349–2352, Rhodes, Greece, Sept. 1998