SlideShare a Scribd company logo
1 of 3
Download to read offline
Prof. Vijaya Sugandhi / International Journal of Engineering Research and Applications
                    (IJERA)          ISSN: 2248-9622        www.ijera.com
                      Vol. 3, Issue 1, January-February 2013, pp.074-076
             Spectral Analysis in Speech Processing Techniques
                                         Prof. Vijaya Sugandhi
          Deptt. of Electrical & Elex. Engg. Shri VaishnavSM Institute of Tech. & science,Indore(M.P.)


ABSTRACT
          The corruption of speech due to presence         corresponding frequency spectrum. This includes
of additive background noise causes severe                 familiar concepts such as visible light (color),
difficulties    in     various     communication           musical notes, radio/TV channels, and even the
environments. This paper addresses the problem             regular rotation of the earth. When these physical
of reduction of additive background noise in               phenomena are represented in the form of a
speech. The proposed approach is a frequency               frequency spectrum, certain physical descriptions of
dependent speech enhancement method based on               their internal processes become much simpler.
the proven spectral subtraction method. Most               Often,     the     frequency    spectrum     clearly
implementations and variations of the basic                shows harmonics, visible as distinct spikes or lines
spectral     subtraction    technique     advocate         that provide insight into the mechanisms that
subtraction of the noise spectrum estimate over            generate the entire signal.
the entire speech spectrum. However, real world
noise is mostly colored and does not affect the            II     BACKGROUND
speech signal uniformly over the entire spectrum.                    In the past decades, research in the field of
This method provides a greater degree of                   speech enhancement has focused on the suppression
flexibility and control on the noise subtraction           of additive background noise [3] [4] [5]. From the
levels that reduces artifacts in the enhanced              point of view of signal processing, additive noise is
speech, resulting in improved speech quality.              easier to deal with than convolutive noise or
                                                           nonlinear disturbances. The ultimate goal of speech
Key words: spectrum, noise subtraction.                    enhancement is to eliminate the additive noise
                                                           present in speech signal and restore the speech signal
I    INTRODUCTION                                          to its original form. Several methods have been
          Speech processing has been a growing and         developed as a result of these research efforts. Most
dynamic field for more than two decades and there is       of these methods have been developed with some or
every indication that this growth will continue and        the other auditory, perceptual or statistical
even accelerate. During this growth there has been a       constraints placed on the speech and noise signals.
close relationship between the development of new          However, in real world situations, it is very difficult
algorithms and theoretical results, new filtering          to reliably predict the characteristics of the
techniques are also of consideration to the success of     interfering noise signal or the exact characteristics of
speech processing. One of the common adaptive              the speech waveform. Hence, in effect, the speech
filtering techniques that are applied to speech is the     enhancement methods are sub-optimal and can only
Wiener filter.                                             reduce the amount of noise in the signal to some
A spectrogram is       a     time-varying       spectral   extent. Due to the sub-optimal nature of these
representation (forming an image) that shows how           methods, some of the speech signal can be distorted
the spectral density of a signal varies with time. Also    during the process. Hence, there is a trade-off
known                                        as spectral   between distortions in the processed speech and the
waterfalls, sonograms, voiceprints, or voicegrams,         amount of noise suppressed.
spectrograms are used to identify phonetic sounds, to
analyses the cries of animals; they were also used in      III.   PROPOSED METHODOLOGY
many                     other                    fields            The following simplified model for voiced
including music, sonar/radar, speech       processing,     speech production, where the speech signal s(n) is
seismology, etc.The instrument that generates a            formed as the convolution [1]
spectrogram is called a spectrograph.                                s(n) = e(n)*θ(n),                (1.1)
                                                           where e(n) is the excitation source and θ (n) the
The frequency spectrum of a time-domain signal is a        vocal tract response. For speech recognition,
representation of that signal in the frequency             extracting the vocal tract response and discarding the
domain. The frequency spectrum can be generated            excitation information from the resulting signal is
via a Fourier transform of the signal, and the             useful, as the information relevant for distinguishing
resulting    values      are     usually   presented       the spoken words is mainly in the vocal tract
as amplitude and phase,           both        plotted      response, while the excitation source primarily
versus frequency. Any signal that can be represented       contains the irrelevant pitch information.
as amplitude that varies with time has a

                                                                                                    74 | P a g e
Prof. Vijaya Sugandhi / International Journal of Engineering Research and Applications
                    (IJERA)          ISSN: 2248-9622        www.ijera.com
                      Vol. 3, Issue 1, January-February 2013, pp.074-076
If we assume that y(n) , the discrete noisy input          artifacts related to signal truncation. The specgram
signal, is composed of the clean speech signal s(n)        function computes a time-frequency plot of a signal
and the uncorrelated additive noise signal d(n) , then     where color represents spectral magnitude
we can represent it as:                                    amplitude.
y(n)=s(n)+d(n)                    (1.2)                                                            audio spectra1
                                                                             1
Processing is done on a frames-by-frame basis.
Analysis of overlapping frames of the noisy signal is                       0.9


implemented by using the Discrete Fourier                                   0.8


Transform (DFT) preceded by a Hamming window.                               0.7

The power spectrum of the noisy signal can be                               0.6

written as:




                                                                Frequency
                                                                            0.5
|Y(k)|2≈|S(k)|2+|D(k)|2              (1.3)
                                                                            0.4
Since the noise spectrum D(k) cannot be directly
obtained, a time-average of the power spectrum Dˆ                           0.3


(k) is calculated during a period of silence.                               0.2

Assuming that noise is uncorrelated with the speech                         0.1

signal, an estimate of the modified speech spectrum                          0
can be given as:                                                                    1     2    3       4
                                                                                                           Time
                                                                                                                  5       6       7       8
                                                                                                                                                 x 10
                                                                                                                                                     9
                                                                                                                                                     4


|Sˆ(k)|2= |Y (k) |2 - | Dˆ (k) |2       (1.5)
                                                                                         Fig 1 Audio Spectra
From Eq. (1.5) it can be seen that the subtraction
process involves the subtraction of an average                  1800
estimate of the noise from the instantaneous speech
                                                                1600
spectrum. Due to the error in computing the noise
spectrum, we may have some negative values in the               1400

modified spectrum. These values are set to zero.                1200
This process is called half-wave rectification.
                                                                1000

                                                                     800
Distortions due to half / full wave rectification
         The modified speech spectrum obtained                       600

from Eq. 1.5 may contain some negative values due                    400
to the errors in the estimated noise spectrum. These
                                                                     200
values are rectified using half wave rectification (set
to zero) or full-wave rectification (set to its absolute                    0
                                                                                0   1     2   3    4          5       6       7       8          9
value). This can lead to further distortions in the                                                                                              4
                                                                                                                                              x 10
resulting time signal.                                                                  Fig 2 Spectral Analysis

Modifications to spectral subtraction                      V. CONCLUSION
          Several variants of the spectral subtraction               The work in this paper addressed the
method originally proposed by Boll [2] have been           problem of enhancing speech in noisy conditions. A
developed to address the problems of the basic             multi-band spectral subtraction method, based on the
technique, especially the presence of musical noise.       direct estimation of the short-term spectral amplitude
Still other methods based on this method have been         of speech and the non-uniform effect of noise on
developed that perform noise suppression in the            speech, was proposed. The results establish the
autocorrelation, cepstral , logarithmic and sub-space      superiority of the proposed method over the
domains. A variety of                                      conventional spectral subtraction method with
pre and post processing methods have also proved to        respect to speech quality of the enhanced signal and
help reduce the presence of musical noise while            reduced residual noise. The major contributions of
minimizing speech distortion.                              this paper are development of a multi-band speech
                                                           enhancement strategy based on the spectral
IV. RESULTS                                                subtraction method. Speech processed by the new
         This result computes an Audio Spectra in          algorithm shows reduced levels of residual noise and
MAT LAB. The fft function computes the FFT of a            good speech quality.
specified signal. In general, either the magnitude or
phase values of the FFT coefficients are find, which       VI                Acknowledgment
are in Matlab can be determined using the abs and                   This author thanks to Principal and
angle functions. A variety of windows can be               Administrative       Officer      of     SVITS
applied to a signal before the computation of the          Indore,Prof.Hamant Chouhan, Prof. Dev Kumar Rai
FFT using the functions hann, hamming, blackman.           thanks to author’s family for their motivation
Time-domain windows can help minimize spectral             support and encouragement.

                                                                                                                              75 | P a g e
Prof. Vijaya Sugandhi / International Journal of Engineering Research and Applications
                   (IJERA)          ISSN: 2248-9622        www.ijera.com
                     Vol. 3, Issue 1, January-February 2013, pp.074-076
Reference
 [1]   Deller, J. R., Hansen, J. H., and Proaxis, J.
       G. Discrete-Time Processing of Speech
       Signals. IEEE Press, 2000.
 [2]   S. Boll, “Suppression of acoustic noise in
       speech using spectral subtraction,” IEEE
       Trans. Acoust., Speech, Signal Process.,
       vol.27, pp. 113-120, Apr. 1979.
 [3]   J. Deller Jr., J. Hansen and J. Proakis,
       “Discrete-Time Processing of Speech
       Signals”, NY: IEEE Press, 2000.
 [4]   H. Levitt, “Noise reduction in hearing aids:
       An overview”, Journal of Rehabilitation
       Research and Development, vol. 38, No. 1,
       January/February 2001.
 [5]   J. Lim and A. Oppenheim, “All-pole
       modeling of degraded speech,” IEEE
       Transactions on Acoustics, Speech and
       Signal Processing, vol. 26, No. 3, pp. 197-
       210, June 1978.

                    Ms.Vijaya sugandhi received
                    B.E.and           M.E.(digital
                    techniques &instrumentation)
                    Degree with in Electrical
                    engineering in 2006 and 2011
                    from Rajiv Gandhi Technical
       University, Bhopal, Madhypradesh, India




                                                                                 76 | P a g e

More Related Content

What's hot

PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)Nidhi Gopal
 
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...IJORCS
 
Sound Source Localization with microphone arrays
Sound Source Localization with microphone arraysSound Source Localization with microphone arrays
Sound Source Localization with microphone arraysRamin Anushiravani
 
Lecture: Digital Signal Processing Batch 2009
Lecture: Digital Signal Processing Batch 2009Lecture: Digital Signal Processing Batch 2009
Lecture: Digital Signal Processing Batch 2009ubaidis
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
Wavelet based image compression technique
Wavelet based image compression techniqueWavelet based image compression technique
Wavelet based image compression techniquePriyanka Pachori
 
DIGITAL IMAGE WATERMARKING USING DFT ALGORITHM
DIGITAL IMAGE WATERMARKING USING DFT ALGORITHMDIGITAL IMAGE WATERMARKING USING DFT ALGORITHM
DIGITAL IMAGE WATERMARKING USING DFT ALGORITHMacijjournal
 
Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...iosrjce
 
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...CSCJournals
 
Introduction to wavelet transform
Introduction to wavelet transformIntroduction to wavelet transform
Introduction to wavelet transformRaj Endiran
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringTejus Adiga M
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniquePankaj Kumar
 
Wavelets AND counterlets
Wavelets  AND  counterletsWavelets  AND  counterlets
Wavelets AND counterletsAvichal Sharma
 

What's hot (20)

F010334548
F010334548F010334548
F010334548
 
PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)
 
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
 
Sound Source Localization with microphone arrays
Sound Source Localization with microphone arraysSound Source Localization with microphone arrays
Sound Source Localization with microphone arrays
 
Lecture: Digital Signal Processing Batch 2009
Lecture: Digital Signal Processing Batch 2009Lecture: Digital Signal Processing Batch 2009
Lecture: Digital Signal Processing Batch 2009
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Final presentation
Final presentationFinal presentation
Final presentation
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
Wavelet based image compression technique
Wavelet based image compression techniqueWavelet based image compression technique
Wavelet based image compression technique
 
DIGITAL IMAGE WATERMARKING USING DFT ALGORITHM
DIGITAL IMAGE WATERMARKING USING DFT ALGORITHMDIGITAL IMAGE WATERMARKING USING DFT ALGORITHM
DIGITAL IMAGE WATERMARKING USING DFT ALGORITHM
 
Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...Data Compression using Multiple Transformation Techniques for Audio Applicati...
Data Compression using Multiple Transformation Techniques for Audio Applicati...
 
Nd2421622165
Nd2421622165Nd2421622165
Nd2421622165
 
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
Real-time DSP Implementation of Audio Crosstalk Cancellation using Mixed Unif...
 
Iy2617051711
Iy2617051711Iy2617051711
Iy2617051711
 
Introduction to wavelet transform
Introduction to wavelet transformIntroduction to wavelet transform
Introduction to wavelet transform
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency Filtering
 
Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC technique
 
Wavelets AND counterlets
Wavelets  AND  counterletsWavelets  AND  counterlets
Wavelets AND counterlets
 
B046040611
B046040611B046040611
B046040611
 

Viewers also liked (20)

Fr3310111015
Fr3310111015Fr3310111015
Fr3310111015
 
Fn33984987
Fn33984987Fn33984987
Fn33984987
 
G33029037
G33029037G33029037
G33029037
 
Fw3310501057
Fw3310501057Fw3310501057
Fw3310501057
 
Ft3310201031
Ft3310201031Ft3310201031
Ft3310201031
 
Gb3310821084
Gb3310821084Gb3310821084
Gb3310821084
 
Fo33988994
Fo33988994Fo33988994
Fo33988994
 
Fb33925928
Fb33925928Fb33925928
Fb33925928
 
Gc3310851089
Gc3310851089Gc3310851089
Gc3310851089
 
Fh3110611064
Fh3110611064Fh3110611064
Fh3110611064
 
Ga3111671172
Ga3111671172Ga3111671172
Ga3111671172
 
Fi3110651072
Fi3110651072Fi3110651072
Fi3110651072
 
Fp3210491056
Fp3210491056Fp3210491056
Fp3210491056
 
Di32687692
Di32687692Di32687692
Di32687692
 
R34114118
R34114118R34114118
R34114118
 
Ag34200206
Ag34200206Ag34200206
Ag34200206
 
Dv34745751
Dv34745751Dv34745751
Dv34745751
 
Ae35171177
Ae35171177Ae35171177
Ae35171177
 
Aj35198205
Aj35198205Aj35198205
Aj35198205
 
Actividad 3_Impress
Actividad 3_ImpressActividad 3_Impress
Actividad 3_Impress
 

Similar to K31074076

Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing IJECEIAES
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...ijsrd.com
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...CSCJournals
 
Investigations on the role of analysis window shape parameter in speech enhan...
Investigations on the role of analysis window shape parameter in speech enhan...Investigations on the role of analysis window shape parameter in speech enhan...
Investigations on the role of analysis window shape parameter in speech enhan...karthik annam
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...IRJET Journal
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environmentssipij
 
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfA_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfBala Murugan
 
Speech measurement using laser doppler vibrometer
Speech measurement using laser doppler vibrometerSpeech measurement using laser doppler vibrometer
Speech measurement using laser doppler vibrometerI'am Ajas
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...eSAT Journals
 
IRJET- A Review for Reduction of Noise by Wavelet Transform in Audio Signals
IRJET- A Review for Reduction of Noise by Wavelet Transform in Audio SignalsIRJET- A Review for Reduction of Noise by Wavelet Transform in Audio Signals
IRJET- A Review for Reduction of Noise by Wavelet Transform in Audio SignalsIRJET Journal
 
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...Roman Atachiants
 
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...IRJET Journal
 
Audio Signal Processing
Audio Signal Processing Audio Signal Processing
Audio Signal Processing Ahmed A. Arefin
 
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsA Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsIJMTST Journal
 

Similar to K31074076 (20)

Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
 
B45010811
B45010811B45010811
B45010811
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
 
Investigations on the role of analysis window shape parameter in speech enhan...
Investigations on the role of analysis window shape parameter in speech enhan...Investigations on the role of analysis window shape parameter in speech enhan...
Investigations on the role of analysis window shape parameter in speech enhan...
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environments
 
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfA_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
 
D011132635
D011132635D011132635
D011132635
 
75 78
75 7875 78
75 78
 
75 78
75 7875 78
75 78
 
Speech measurement using laser doppler vibrometer
Speech measurement using laser doppler vibrometerSpeech measurement using laser doppler vibrometer
Speech measurement using laser doppler vibrometer
 
H010234144
H010234144H010234144
H010234144
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
 
IRJET- A Review for Reduction of Noise by Wavelet Transform in Audio Signals
IRJET- A Review for Reduction of Noise by Wavelet Transform in Audio SignalsIRJET- A Review for Reduction of Noise by Wavelet Transform in Audio Signals
IRJET- A Review for Reduction of Noise by Wavelet Transform in Audio Signals
 
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
 
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
 
Audio Signal Processing
Audio Signal Processing Audio Signal Processing
Audio Signal Processing
 
Ky2418521856
Ky2418521856Ky2418521856
Ky2418521856
 
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsA Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
 

K31074076

  • 1. Prof. Vijaya Sugandhi / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 1, January-February 2013, pp.074-076 Spectral Analysis in Speech Processing Techniques Prof. Vijaya Sugandhi Deptt. of Electrical & Elex. Engg. Shri VaishnavSM Institute of Tech. & science,Indore(M.P.) ABSTRACT The corruption of speech due to presence corresponding frequency spectrum. This includes of additive background noise causes severe familiar concepts such as visible light (color), difficulties in various communication musical notes, radio/TV channels, and even the environments. This paper addresses the problem regular rotation of the earth. When these physical of reduction of additive background noise in phenomena are represented in the form of a speech. The proposed approach is a frequency frequency spectrum, certain physical descriptions of dependent speech enhancement method based on their internal processes become much simpler. the proven spectral subtraction method. Most Often, the frequency spectrum clearly implementations and variations of the basic shows harmonics, visible as distinct spikes or lines spectral subtraction technique advocate that provide insight into the mechanisms that subtraction of the noise spectrum estimate over generate the entire signal. the entire speech spectrum. However, real world noise is mostly colored and does not affect the II BACKGROUND speech signal uniformly over the entire spectrum. In the past decades, research in the field of This method provides a greater degree of speech enhancement has focused on the suppression flexibility and control on the noise subtraction of additive background noise [3] [4] [5]. From the levels that reduces artifacts in the enhanced point of view of signal processing, additive noise is speech, resulting in improved speech quality. easier to deal with than convolutive noise or nonlinear disturbances. The ultimate goal of speech Key words: spectrum, noise subtraction. enhancement is to eliminate the additive noise present in speech signal and restore the speech signal I INTRODUCTION to its original form. Several methods have been Speech processing has been a growing and developed as a result of these research efforts. Most dynamic field for more than two decades and there is of these methods have been developed with some or every indication that this growth will continue and the other auditory, perceptual or statistical even accelerate. During this growth there has been a constraints placed on the speech and noise signals. close relationship between the development of new However, in real world situations, it is very difficult algorithms and theoretical results, new filtering to reliably predict the characteristics of the techniques are also of consideration to the success of interfering noise signal or the exact characteristics of speech processing. One of the common adaptive the speech waveform. Hence, in effect, the speech filtering techniques that are applied to speech is the enhancement methods are sub-optimal and can only Wiener filter. reduce the amount of noise in the signal to some A spectrogram is a time-varying spectral extent. Due to the sub-optimal nature of these representation (forming an image) that shows how methods, some of the speech signal can be distorted the spectral density of a signal varies with time. Also during the process. Hence, there is a trade-off known as spectral between distortions in the processed speech and the waterfalls, sonograms, voiceprints, or voicegrams, amount of noise suppressed. spectrograms are used to identify phonetic sounds, to analyses the cries of animals; they were also used in III. PROPOSED METHODOLOGY many other fields The following simplified model for voiced including music, sonar/radar, speech processing, speech production, where the speech signal s(n) is seismology, etc.The instrument that generates a formed as the convolution [1] spectrogram is called a spectrograph. s(n) = e(n)*θ(n), (1.1) where e(n) is the excitation source and θ (n) the The frequency spectrum of a time-domain signal is a vocal tract response. For speech recognition, representation of that signal in the frequency extracting the vocal tract response and discarding the domain. The frequency spectrum can be generated excitation information from the resulting signal is via a Fourier transform of the signal, and the useful, as the information relevant for distinguishing resulting values are usually presented the spoken words is mainly in the vocal tract as amplitude and phase, both plotted response, while the excitation source primarily versus frequency. Any signal that can be represented contains the irrelevant pitch information. as amplitude that varies with time has a 74 | P a g e
  • 2. Prof. Vijaya Sugandhi / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 1, January-February 2013, pp.074-076 If we assume that y(n) , the discrete noisy input artifacts related to signal truncation. The specgram signal, is composed of the clean speech signal s(n) function computes a time-frequency plot of a signal and the uncorrelated additive noise signal d(n) , then where color represents spectral magnitude we can represent it as: amplitude. y(n)=s(n)+d(n) (1.2) audio spectra1 1 Processing is done on a frames-by-frame basis. Analysis of overlapping frames of the noisy signal is 0.9 implemented by using the Discrete Fourier 0.8 Transform (DFT) preceded by a Hamming window. 0.7 The power spectrum of the noisy signal can be 0.6 written as: Frequency 0.5 |Y(k)|2≈|S(k)|2+|D(k)|2 (1.3) 0.4 Since the noise spectrum D(k) cannot be directly obtained, a time-average of the power spectrum Dˆ 0.3 (k) is calculated during a period of silence. 0.2 Assuming that noise is uncorrelated with the speech 0.1 signal, an estimate of the modified speech spectrum 0 can be given as: 1 2 3 4 Time 5 6 7 8 x 10 9 4 |Sˆ(k)|2= |Y (k) |2 - | Dˆ (k) |2 (1.5) Fig 1 Audio Spectra From Eq. (1.5) it can be seen that the subtraction process involves the subtraction of an average 1800 estimate of the noise from the instantaneous speech 1600 spectrum. Due to the error in computing the noise spectrum, we may have some negative values in the 1400 modified spectrum. These values are set to zero. 1200 This process is called half-wave rectification. 1000 800 Distortions due to half / full wave rectification The modified speech spectrum obtained 600 from Eq. 1.5 may contain some negative values due 400 to the errors in the estimated noise spectrum. These 200 values are rectified using half wave rectification (set to zero) or full-wave rectification (set to its absolute 0 0 1 2 3 4 5 6 7 8 9 value). This can lead to further distortions in the 4 x 10 resulting time signal. Fig 2 Spectral Analysis Modifications to spectral subtraction V. CONCLUSION Several variants of the spectral subtraction The work in this paper addressed the method originally proposed by Boll [2] have been problem of enhancing speech in noisy conditions. A developed to address the problems of the basic multi-band spectral subtraction method, based on the technique, especially the presence of musical noise. direct estimation of the short-term spectral amplitude Still other methods based on this method have been of speech and the non-uniform effect of noise on developed that perform noise suppression in the speech, was proposed. The results establish the autocorrelation, cepstral , logarithmic and sub-space superiority of the proposed method over the domains. A variety of conventional spectral subtraction method with pre and post processing methods have also proved to respect to speech quality of the enhanced signal and help reduce the presence of musical noise while reduced residual noise. The major contributions of minimizing speech distortion. this paper are development of a multi-band speech enhancement strategy based on the spectral IV. RESULTS subtraction method. Speech processed by the new This result computes an Audio Spectra in algorithm shows reduced levels of residual noise and MAT LAB. The fft function computes the FFT of a good speech quality. specified signal. In general, either the magnitude or phase values of the FFT coefficients are find, which VI Acknowledgment are in Matlab can be determined using the abs and This author thanks to Principal and angle functions. A variety of windows can be Administrative Officer of SVITS applied to a signal before the computation of the Indore,Prof.Hamant Chouhan, Prof. Dev Kumar Rai FFT using the functions hann, hamming, blackman. thanks to author’s family for their motivation Time-domain windows can help minimize spectral support and encouragement. 75 | P a g e
  • 3. Prof. Vijaya Sugandhi / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 1, January-February 2013, pp.074-076 Reference [1] Deller, J. R., Hansen, J. H., and Proaxis, J. G. Discrete-Time Processing of Speech Signals. IEEE Press, 2000. [2] S. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, Signal Process., vol.27, pp. 113-120, Apr. 1979. [3] J. Deller Jr., J. Hansen and J. Proakis, “Discrete-Time Processing of Speech Signals”, NY: IEEE Press, 2000. [4] H. Levitt, “Noise reduction in hearing aids: An overview”, Journal of Rehabilitation Research and Development, vol. 38, No. 1, January/February 2001. [5] J. Lim and A. Oppenheim, “All-pole modeling of degraded speech,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 26, No. 3, pp. 197- 210, June 1978. Ms.Vijaya sugandhi received B.E.and M.E.(digital techniques &instrumentation) Degree with in Electrical engineering in 2006 and 2011 from Rajiv Gandhi Technical University, Bhopal, Madhypradesh, India 76 | P a g e