SlideShare a Scribd company logo
Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications
                           (IJERA) ISSN: 2248-9622 www.ijera.com
                          Vol. 2, Issue4, July-August 2012, pp.2162-2165
           Frequency Depended Spectral Subtraction For Speech
                             Enhancement

                                      Y Prasanthi, K Jyothi, GIET

Abstract:
         The corruption of speech due to presence of         To understand what the speaker is saying. Noise
additive background noise causes severe difficulties in      reduction or speech enhancement algorithms are used to
various communications environments. This paper              suppress such background noise and improve the
addresses the problem of reduction of additive               perceptual quality and intelligibility of speech. Even
background noise in speech. The proposed approach is         though speech is perceptible in a moderately noisy
a frequency dependent speech enhancement method              environment, many applications like mobile
based on the proven spectral subtraction method. Most        communications, speech recognition and aids for the
implementations and variations of the basic spectral         hearing handicapped, to name a few, drive the effort to
subtraction technique advocate subtraction of the noise      build more effective noise reduction algorithms for
spectrum estimate over the entire speech spectrum.           better performance. Over the years engineers have
However real world noise is mostly colored and does          developed a variety of theoretical and relatively
not affect the speech signal uniformly over the entire       effective techniques to combat this issue. However, the
spectrum. In this paper, new spectral subtraction            problem of cleaning noisy speech still poses a challenge
method proposed which takes into account the fact that       to the area of signal processing. Removing various
colored noise affects the speech spectrum differently at     types of noise is difficult due to the random nature of
various frequencies. This method provides a greater          the noise and the inherent complexities of speech.
degree of flexibility and control on the noise subtraction   Noise reduction techniques usually have a trade off
levels that reduces artifacts in the enhanced speech,        between the amount of noise removal and speech
resulting in improved speech quality. This method            distortions introduced due the processing of the speech
outperforms the conventional spectral subtraction            signal. Complexity and ease of implementation of the
method with respect to speech quality and reduced            noise reduction algorithms is also of concern in
musical noise.                                               applications especially those related to portable devices
                                                             such as mobile communications and digital hearing
1. INTRODUCTION                                              aids. The spectral subtraction method is a well-known
          A major part of the interaction between            noise reduction technique. Most implementations and
humans takes place via speech communication.                 variations of the basic technique advocate subtraction
Hence, research in speech and hearing sciences has           of the noise spectrum estimate over the entire speech
been going on for centuries to understand the dynamics       spectrum. However, real world noise is mostly colored
and processes involved in the production and                 and does not affect the speech signal uniformly over the
perception of speech. The field of speech processing is      entire spectrum. In this paper, we propose a multi-band
essentially an application of signal processing              spectral subtraction approach that takes into account the
techniques to acoustic signals using the knowledge           fact that colored noise affects the speech spectrum
offered by researchers in the field of hearing sciences.     differently at various frequencies. This method
The explosive advances in recent years in the field of       outperforms the standard power spectral subtraction
digital computing have provided a tremendous boost to        method resulting in superior speech quality and largely
the field of speech processing. Digital signal processing    reduced musical noise.
techniques are more sophisticated and advanced as
compared to their analog counterparts. Ease and speed        1.2 Short-term Spectral Amplitude Techniques
of representing, storing, retrieving and processing          The short-term spectral amplitude (STSA) of speech
speech data has contributed to the development of            has been exploited successfully in the development of
efficient and effective speech processing techniques to      various speech enhancement algorithms. The basic idea
address the issues related to speech. The presence of        is to use the STSA of the noisy speech input and
background noise in speech significantly reduces the         recover an estimate of the clean STSA by removing the
intelligibility of speech. Degradation of speech severely    part contributed by the additive noise. Expressed as
affects the ability of person, whether impaired or
normal hearing,



                                                                                                     2162 | P a g e
Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications
                           (IJERA) ISSN: 2248-9622 www.ijera.com
                          Vol. 2, Issue4, July-August 2012, pp.2162-2165
                                                            values in the enhanced spectrum were floored to the
                                                            noisy spectrum as
                                               (1)
Where S(k) and Y(k) are magnitude spectrum of clean
speech and the noisy speech. Estimated noise
magnitude              is calculated during periods of
speech, α is an over-subtraction factor.                    (3)
A general representation of STSA technique is given in
Figure 1.                                                   Where The spectral floor parameter was set to β=0.001.

                                                            3.     IMPLEMENTATION                               AND
                                                            PERFORMANCE EVALUTION
                                                            3.1 Objective Measures for Performance Evaluation
                                                                      In the evaluation of speech enhancement
                                                            algorithm, it is necessary to identify the similarities and
                                                            differences in perceived quality and subjectively
                                                            measured intelligibility. Speech quality is an indicator
                                                            of the naturalness of the processed speech signal.
                                                            Intelligibility of speech signals is a measure of the
                                                            amount of speech information present in the signal that
                                                            is responsible for conveying what that speaker is
                                                            saying. The interrelationship between perceived speech
Figure 1: Diagrammatic Representation of the short-         and intelligibility is not clearly understood.
time Spectral Magnitude Enhancement System                  Performance evaluation tests can be done by subjective
                                                            quality measures or objective quality measures. While
         The input to the system is the noise-corrupted     subjective measures provide a broad measure of
signal y(n) . While there are many methods for the          performance since a large difference in quality is
analysis-synthesis processing, the Short-term Fourier       necessary to be distinguishable to the listener. Hence, it
Transform (STFT) of the signal with OverLap and Add         becomes difficult to get a reliable measure of changes
(OLA) is the most commonly used method. Spectral            due to algorithm parameters. Objective measures, on
subtraction is a well-known noise reduction method          the other hand, provide a measure that can be easily
based on the STSA estimation technique. The basic           implemented and reliably reproduced.
power spectral subtraction technique, as proposed by                  It is necessary to conduct off-line simulations
Boll, is popular due to its simple underlying concept       to check the validity and feasibility of an algorithm
and its effectiveness in enhancing speech degraded by       before it can be implemented on a real-time system.
additive noise but drawback of the power spectral           Implementation on a workstation permits modifications
subtraction technique is residual noise.                    and changes to the algorithm without constraints of
                                                            time, memory or computational power. The simulations
2. FREQUENCY DEPENDED SPECTRAL                              were carried out on an using Matlab, a technical
SUBTRACTION                                                 computing software. The speech signal is first
          Along with the actual noise suppression           Hamming windowed using a 20-ms window and a 10-
operation, some pre-processing methods are needed to        ms overlap between frames. The windowed speech
achieving good speech quality. In frequency depended        frame is then analyzed using the Fast Fourier Transform
spectral subtraction, the speech spectrum is divided into   (FFT). Smoothing of the magnitude spectrum was
non-overlapping bands, and spectral subtraction is          found to reduce the variance of the speech spectrum
performed independently in each band. The fact that         and contribute to the enhancement in speech quality. A
colored noise affects the speech spectrum differently at    weighted spectral average is taken over preceding and
various frequencies.                                        succeeding frames of data as given by Equation.


                                                                                                             (4)
(2)                                                         Where j is the frame index 0<Wi<1. The averaging is
         Where bi and ei are beginning and ending           done over M preceding and succeeding frames of
frequency bins of the ith frequency band.δi is tweaking     speech. The number of frames M is limited to 2 to
factor that can be individually set for each band to        prevent smearing of the speech spectral content. The
customize the noise removal properties.The negative



                                                                                                     2163 | P a g e
Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications
                           (IJERA) ISSN: 2248-9622 www.ijera.com
                          Vol. 2, Issue4, July-August 2012, pp.2162-2165
filter weights Wi were empirically determined and set
to W= [0.09, 0.25, 0.32, 0.25, 0.09]




                                                      (5)

         Where fi is the upper frequency of the ith
band, and Fs is the sampling frequency in Hz. The            Figure 2: Intelligibility Test Results for Seven Subjects
motivation for using smaller δi values for the low           scored on Percentage Words Correct
frequency bands is to minimize speech distortion, since
most of the speech energy is present in the lower
frequencies. Relaxed subtraction is also used for the
high frequency bands. Subtraction is performed over
each band as indicated in equation and the negative
values are rectified using the spectral floor. A small
amount of the original noisy spectrum can be
introduced back into the enhanced spectrum to mask
any remaining musical noise. In this implementation,
5% of the original noisy spectrum within each band is
combined, and the enhanced signal is obtained by
taking the IFFT of the enhanced spectrum using the           Figure 3: signal representation of the sentence ''I am
phase of the original noisy spectrum. Finally, the           David''. The top representation is the original signal, the
standard overlap-and-add method is used to obtain the        middle representation is corrupted signal, and the
enhanced signal.                                             bottom representation is the enhanced signal obtained
                                                             by the frequency depended spectral subtraction
3.2 Subjective Evaluation of Speech Intelligibility          method
          Subjective tests are conducted by having some
human subjects listen to the prepared test speech files
and evaluate based on some criteria. Intelligibility test
were carried out at the Callier Center for
Communication Disorders/UTD on seven hearing-
impaired subjects with severe to profound hearing loss.
Speech enhanced by the MBSS algorithm with four
linearly spaced frequency bands was evaluated against
the noisy speech. The sentences were corrupted using
speech-shaped noise at 0 dB SNR. The noise-corrupted
sentences were played in a random order through              Figure 4: Spectrogram of the sentence ''I am David''.
speakers in a sound insulated booth. The subjects were       The top spectrogram is the original signal, the middle
asked to repeat the sentence they heard. Intelligibility     spectrogram is the corrupted signal, the bottom
was measured in terms of percentage of words correct.        spectrogram is the enhanced signal obtained by
Figure 2 gives the bar plots of the intelligibility scores   frequency depended spectral subtraction method.
achieved by each subject and the average score for both
the test conditions. The results obtained by objective
evaluation are an indicator of the best speech quality       5. CONCLUSIONS
that can be obtained by the different configurations of               The work in this paper addressed the problem
the algorithm. The proposed multi-band spectral              of enhancing speech in noisy conditions. A multi-band
subtraction approach(with number of bands>3)                 spectral subtraction method, based on the direct
performed better than the PSS approach for both SNRs.        estimation of the short-term spectral amplitude of
                                                             speech and the non-uniform effect of noise on speech.
4. RESULTS                                                   The results establish the superiority of the proposed
                                                             method over the conventional spectral subtraction



                                                                                                       2164 | P a g e
Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications
                           (IJERA) ISSN: 2248-9622 www.ijera.com
                          Vol. 2, Issue4, July-August 2012, pp.2162-2165
method with respect to speech quality of the enhanced              Processing, vol.7, pp. 2819-2822, Sydney,
signal and reduced residual noise.                                 Australia, Dec.1998.
The major contributions of this paper are                   [11]   C. He and G. Zweig, “Adaptive two-band
(a) Development of a multi-band speech enhancement                 spectral subtraction with multi-window
strategy based on the spectral subtraction method.                 spectral estimation,” ICASSP, vol.2, pp. 793-
Speech processed by the new algorithm shows reduced                796, 1999.
levels of residual noise and good speech quality.           [12]   Y. Hu, M. Bhatnagar and P. Loizou, “A cross-
(b) Proposed a band subtraction factor that provides               correlation technique for enhancing speech
greater control over the subtraction process in each               corrupted with correlated noise,” ICASSP, vol.
band and can be tweaked to minimize speech distortion.             1, pp. 673-676, 2001.
(c) Evaluation of various pre-processing strategies for     [13]     S. Kamath and P. Loizou, “A multi-band
improving the output speech quality. It was shown that             spectral subtraction method for enhancing
spectral smoothing and weighted spectral averaging of              speech corrupted by colored noise,” submitted
the input speech spectrum helped preserve the speech               to ICASSP 2002.
content and improved speech quality.                        [14]   P.     Kasthuri,      “Multichannel      speech
                                                                   enhancement for a digital programmable
Reference/Bibliography                                             hearing aid,” Master thesis, University of New
  [1]   L. Arslan, A. McCree and V. Viswanathan, “                 Mexico, 1999.
        New methods for adaptive noise suppression,”        [15]    W. Kim, S. Kang and H. Ko, “Spectral
        ICASSP, vol.1, pp. 812-815, May 1995.                      subtraction based on phonetic dependency and
  [2]   M. Berouti, R. Schwartz and J. Makhoul,                    masking effects,” Proc. IEEE Vis. Image
        “Enhancement of speech corrupted by acoustic               Signal Procs., vol. 147, No. 5, Oct. 2000.
        noise,” Proc. IEEE Int. Conf. on Acoust.,           [16]   H. Levitt, “Noise reduction in hearing aids: An
        Speech, Signal Procs., pp. 208-211, Apr. 1979.             overview”, Journal of Rehabilitation Research
  [3]   S. Boll, “Suppression of acoustic noise in                 and       Development,        vol.      38,No.1
        speech using spectral subtraction,” IEEE                   January/February 2001.
        Trans. Acoust., Speech, Signal Process.,            [17]   J. Lim and A. Oppenheim, “All-pole modeling
        vol.27, pp. 113-120, Apr. 1979.                            of degraded speech,” IEEE Transactions on
  [4]   Y. Cheng and D. O'Shaughnessy, “Speech                     Acoustics, Speech and Signal Processing, vol.
        enhancement based conceptually on auditory                 26, No. 3, pp. 197-210, June 1978.
        evidence,” ICASSP, vol.2, pp. 961-964, Apr.         [18]   J. Lim and A. Oppenheim, “Enhancement and
        1991.                                                      bandwidth compression of noisy speech,”
  [5] J. Deller Jr., J. Hansen and J. Proakis, “Discrete-          Proc. IEEE, vol. 67, No. 12, pp. 221-239, Dec.
        Time Processing of Speech Signals”, NY:                    1979
        IEEE Press, 2000.
  [6]   Y. Ephraim, “Statistical-model-based speech
        enhancement systems,” Proc. IEEE,80, No.10,
        pp. 1526-1555, Oct.1992.
  [7] Y. Ephraim and D. Malah, “Speech enhancement
        using a minimum mean-square error short-
        term spectral amplitude        estimator,” IEEE
        Trans. on Acoust.,Speech,Signal Proc.,
        vol.ASSP-32, No.6, pp. 1109-1121, Dec.1984.
  [8]   Y. Ephraim and H. Van Trees, “A signal
        subspace approach for speech enhancement,”
        IEEE Trans. Speech Audio Procs., vol. 3, pp.
        251-266, Jul. 1995.
   [9] Z. Goh, K.Tan and T. Tan, “Postprocessing
        method for suppressing musical noise
        generated by spectral subtraction,” IEEE
        Trans. Speech Audio Procs., vol. 6, pp. 287-
        292, May 1998.
  [10] J. Hansen and B. Pellom, “An effective quality
        evaluation protocol for speech enhancements
        algorithms,” Inter. Conf. on Spoken Language




                                                                                                 2165 | P a g e

More Related Content

What's hot

Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
Editor IJARCET
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
sipij
 
Paper id 28201448
Paper id 28201448Paper id 28201448
Paper id 28201448
IJRAT
 
article_EURASIP
article_EURASIParticle_EURASIP
article_EURASIP
Marie Séveno
 
75 78
75 7875 78
VAW-GAN for disentanglement and recomposition of emotional elements in speech
VAW-GAN for disentanglement and recomposition of emotional elements in speechVAW-GAN for disentanglement and recomposition of emotional elements in speech
VAW-GAN for disentanglement and recomposition of emotional elements in speech
KunZhou18
 
Review Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different TechniquesReview Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different Techniques
IRJET Journal
 
IEEE ICASSP 2021
IEEE ICASSP 2021IEEE ICASSP 2021
IEEE ICASSP 2021
KunZhou18
 
Oral Qualification Examination_Kun_Zhou
Oral Qualification Examination_Kun_ZhouOral Qualification Examination_Kun_Zhou
Oral Qualification Examination_Kun_Zhou
KunZhou18
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
Arcanjo Salazaku
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
sipij
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
ijiert bestjournal
 
G010424248
G010424248G010424248
G010424248
IOSR Journals
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
IJERA Editor
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...
eSAT Journals
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
Pradeep Reddy Guvvala
 
Voice Identification And Recognition System, Matlab
Voice Identification And Recognition System, MatlabVoice Identification And Recognition System, Matlab
Voice Identification And Recognition System, Matlab
Sohaib Tallat
 

What's hot (17)

Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
 
Paper id 28201448
Paper id 28201448Paper id 28201448
Paper id 28201448
 
article_EURASIP
article_EURASIParticle_EURASIP
article_EURASIP
 
75 78
75 7875 78
75 78
 
VAW-GAN for disentanglement and recomposition of emotional elements in speech
VAW-GAN for disentanglement and recomposition of emotional elements in speechVAW-GAN for disentanglement and recomposition of emotional elements in speech
VAW-GAN for disentanglement and recomposition of emotional elements in speech
 
Review Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different TechniquesReview Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different Techniques
 
IEEE ICASSP 2021
IEEE ICASSP 2021IEEE ICASSP 2021
IEEE ICASSP 2021
 
Oral Qualification Examination_Kun_Zhou
Oral Qualification Examination_Kun_ZhouOral Qualification Examination_Kun_Zhou
Oral Qualification Examination_Kun_Zhou
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 
G010424248
G010424248G010424248
G010424248
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Voice Identification And Recognition System, Matlab
Voice Identification And Recognition System, MatlabVoice Identification And Recognition System, Matlab
Voice Identification And Recognition System, Matlab
 

Viewers also liked

Humanismo y renacimiento
Humanismo y renacimientoHumanismo y renacimiento
Humanismo y renacimiento
elgranlato09
 
Rockin rio nova versão
Rockin rio nova versãoRockin rio nova versão
Rockin rio nova versão
TiagoChicoRei
 
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Jansäter Kommunikation AB
 
Derecho a la vivienda
Derecho a la viviendaDerecho a la vivienda
Derecho a la vivienda
IU Badajoz Asamblea Local
 
el reciclaje
el reciclajeel reciclaje
el reciclaje
mcplaa
 
S solar m terrestres
S solar  m terrestresS solar  m terrestres
S solar m terrestres
SEDUC, CAMPECHE
 
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika AldabaLightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
ux singapore
 
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job? Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Stanford GSB Corporate Governance Research Initiative
 

Viewers also liked (8)

Humanismo y renacimiento
Humanismo y renacimientoHumanismo y renacimiento
Humanismo y renacimiento
 
Rockin rio nova versão
Rockin rio nova versãoRockin rio nova versão
Rockin rio nova versão
 
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
 
Derecho a la vivienda
Derecho a la viviendaDerecho a la vivienda
Derecho a la vivienda
 
el reciclaje
el reciclajeel reciclaje
el reciclaje
 
S solar m terrestres
S solar  m terrestresS solar  m terrestres
S solar m terrestres
 
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika AldabaLightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
 
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job? Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
 

Similar to Nd2421622165

75 78
75 7875 78
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environments
sipij
 
F010334548
F010334548F010334548
F010334548
IOSR Journals
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
CSCJournals
 
K31074076
K31074076K31074076
K31074076
IJERA Editor
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
eSAT Journals
 
Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
Editor IJARCET
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
IJECEIAES
 
D011132635
D011132635D011132635
D011132635
IOSR Journals
 
An effective evaluation study of objective measures using spectral subtractiv...
An effective evaluation study of objective measures using spectral subtractiv...An effective evaluation study of objective measures using spectral subtractiv...
An effective evaluation study of objective measures using spectral subtractiv...
eSAT Journals
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
eSAT Publishing House
 
Servey
ServeyServey
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSA REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
ijiert bestjournal
 
Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...
IRJET Journal
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
IRJET Journal
 
A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence FunctionA Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
VikashSharma325767
 
Приложение Petralex для обучения слабослышащих студентов
Приложение Petralex для обучения слабослышащих студентовПриложение Petralex для обучения слабослышащих студентов
Приложение Petralex для обучения слабослышащих студентов
Danil Dintsis, Ph. D., PgMP
 
Implementation of a Digital Hearing Aid with User-Settable Frequency Response...
Implementation of a Digital Hearing Aid with User-Settable Frequency Response...Implementation of a Digital Hearing Aid with User-Settable Frequency Response...
Implementation of a Digital Hearing Aid with User-Settable Frequency Response...
IAMCP MENTORING
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Audio Noise Removal – The State of the Art
Audio Noise Removal – The State of the ArtAudio Noise Removal – The State of the Art
Audio Noise Removal – The State of the Art
ijceronline
 

Similar to Nd2421622165 (20)

75 78
75 7875 78
75 78
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environments
 
F010334548
F010334548F010334548
F010334548
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
 
K31074076
K31074076K31074076
K31074076
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
 
Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189Volume 2-issue-6-2186-2189
Volume 2-issue-6-2186-2189
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
 
D011132635
D011132635D011132635
D011132635
 
An effective evaluation study of objective measures using spectral subtractiv...
An effective evaluation study of objective measures using spectral subtractiv...An effective evaluation study of objective measures using spectral subtractiv...
An effective evaluation study of objective measures using spectral subtractiv...
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Servey
ServeyServey
Servey
 
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSA REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
 
Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
 
A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence FunctionA Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
 
Приложение Petralex для обучения слабослышащих студентов
Приложение Petralex для обучения слабослышащих студентовПриложение Petralex для обучения слабослышащих студентов
Приложение Petralex для обучения слабослышащих студентов
 
Implementation of a Digital Hearing Aid with User-Settable Frequency Response...
Implementation of a Digital Hearing Aid with User-Settable Frequency Response...Implementation of a Digital Hearing Aid with User-Settable Frequency Response...
Implementation of a Digital Hearing Aid with User-Settable Frequency Response...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Audio Noise Removal – The State of the Art
Audio Noise Removal – The State of the ArtAudio Noise Removal – The State of the Art
Audio Noise Removal – The State of the Art
 

Nd2421622165

  • 1. Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue4, July-August 2012, pp.2162-2165 Frequency Depended Spectral Subtraction For Speech Enhancement Y Prasanthi, K Jyothi, GIET Abstract: The corruption of speech due to presence of To understand what the speaker is saying. Noise additive background noise causes severe difficulties in reduction or speech enhancement algorithms are used to various communications environments. This paper suppress such background noise and improve the addresses the problem of reduction of additive perceptual quality and intelligibility of speech. Even background noise in speech. The proposed approach is though speech is perceptible in a moderately noisy a frequency dependent speech enhancement method environment, many applications like mobile based on the proven spectral subtraction method. Most communications, speech recognition and aids for the implementations and variations of the basic spectral hearing handicapped, to name a few, drive the effort to subtraction technique advocate subtraction of the noise build more effective noise reduction algorithms for spectrum estimate over the entire speech spectrum. better performance. Over the years engineers have However real world noise is mostly colored and does developed a variety of theoretical and relatively not affect the speech signal uniformly over the entire effective techniques to combat this issue. However, the spectrum. In this paper, new spectral subtraction problem of cleaning noisy speech still poses a challenge method proposed which takes into account the fact that to the area of signal processing. Removing various colored noise affects the speech spectrum differently at types of noise is difficult due to the random nature of various frequencies. This method provides a greater the noise and the inherent complexities of speech. degree of flexibility and control on the noise subtraction Noise reduction techniques usually have a trade off levels that reduces artifacts in the enhanced speech, between the amount of noise removal and speech resulting in improved speech quality. This method distortions introduced due the processing of the speech outperforms the conventional spectral subtraction signal. Complexity and ease of implementation of the method with respect to speech quality and reduced noise reduction algorithms is also of concern in musical noise. applications especially those related to portable devices such as mobile communications and digital hearing 1. INTRODUCTION aids. The spectral subtraction method is a well-known A major part of the interaction between noise reduction technique. Most implementations and humans takes place via speech communication. variations of the basic technique advocate subtraction Hence, research in speech and hearing sciences has of the noise spectrum estimate over the entire speech been going on for centuries to understand the dynamics spectrum. However, real world noise is mostly colored and processes involved in the production and and does not affect the speech signal uniformly over the perception of speech. The field of speech processing is entire spectrum. In this paper, we propose a multi-band essentially an application of signal processing spectral subtraction approach that takes into account the techniques to acoustic signals using the knowledge fact that colored noise affects the speech spectrum offered by researchers in the field of hearing sciences. differently at various frequencies. This method The explosive advances in recent years in the field of outperforms the standard power spectral subtraction digital computing have provided a tremendous boost to method resulting in superior speech quality and largely the field of speech processing. Digital signal processing reduced musical noise. techniques are more sophisticated and advanced as compared to their analog counterparts. Ease and speed 1.2 Short-term Spectral Amplitude Techniques of representing, storing, retrieving and processing The short-term spectral amplitude (STSA) of speech speech data has contributed to the development of has been exploited successfully in the development of efficient and effective speech processing techniques to various speech enhancement algorithms. The basic idea address the issues related to speech. The presence of is to use the STSA of the noisy speech input and background noise in speech significantly reduces the recover an estimate of the clean STSA by removing the intelligibility of speech. Degradation of speech severely part contributed by the additive noise. Expressed as affects the ability of person, whether impaired or normal hearing, 2162 | P a g e
  • 2. Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue4, July-August 2012, pp.2162-2165 values in the enhanced spectrum were floored to the noisy spectrum as (1) Where S(k) and Y(k) are magnitude spectrum of clean speech and the noisy speech. Estimated noise magnitude is calculated during periods of speech, α is an over-subtraction factor. (3) A general representation of STSA technique is given in Figure 1. Where The spectral floor parameter was set to β=0.001. 3. IMPLEMENTATION AND PERFORMANCE EVALUTION 3.1 Objective Measures for Performance Evaluation In the evaluation of speech enhancement algorithm, it is necessary to identify the similarities and differences in perceived quality and subjectively measured intelligibility. Speech quality is an indicator of the naturalness of the processed speech signal. Intelligibility of speech signals is a measure of the amount of speech information present in the signal that is responsible for conveying what that speaker is saying. The interrelationship between perceived speech Figure 1: Diagrammatic Representation of the short- and intelligibility is not clearly understood. time Spectral Magnitude Enhancement System Performance evaluation tests can be done by subjective quality measures or objective quality measures. While The input to the system is the noise-corrupted subjective measures provide a broad measure of signal y(n) . While there are many methods for the performance since a large difference in quality is analysis-synthesis processing, the Short-term Fourier necessary to be distinguishable to the listener. Hence, it Transform (STFT) of the signal with OverLap and Add becomes difficult to get a reliable measure of changes (OLA) is the most commonly used method. Spectral due to algorithm parameters. Objective measures, on subtraction is a well-known noise reduction method the other hand, provide a measure that can be easily based on the STSA estimation technique. The basic implemented and reliably reproduced. power spectral subtraction technique, as proposed by It is necessary to conduct off-line simulations Boll, is popular due to its simple underlying concept to check the validity and feasibility of an algorithm and its effectiveness in enhancing speech degraded by before it can be implemented on a real-time system. additive noise but drawback of the power spectral Implementation on a workstation permits modifications subtraction technique is residual noise. and changes to the algorithm without constraints of time, memory or computational power. The simulations 2. FREQUENCY DEPENDED SPECTRAL were carried out on an using Matlab, a technical SUBTRACTION computing software. The speech signal is first Along with the actual noise suppression Hamming windowed using a 20-ms window and a 10- operation, some pre-processing methods are needed to ms overlap between frames. The windowed speech achieving good speech quality. In frequency depended frame is then analyzed using the Fast Fourier Transform spectral subtraction, the speech spectrum is divided into (FFT). Smoothing of the magnitude spectrum was non-overlapping bands, and spectral subtraction is found to reduce the variance of the speech spectrum performed independently in each band. The fact that and contribute to the enhancement in speech quality. A colored noise affects the speech spectrum differently at weighted spectral average is taken over preceding and various frequencies. succeeding frames of data as given by Equation. (4) (2) Where j is the frame index 0<Wi<1. The averaging is Where bi and ei are beginning and ending done over M preceding and succeeding frames of frequency bins of the ith frequency band.δi is tweaking speech. The number of frames M is limited to 2 to factor that can be individually set for each band to prevent smearing of the speech spectral content. The customize the noise removal properties.The negative 2163 | P a g e
  • 3. Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue4, July-August 2012, pp.2162-2165 filter weights Wi were empirically determined and set to W= [0.09, 0.25, 0.32, 0.25, 0.09] (5) Where fi is the upper frequency of the ith band, and Fs is the sampling frequency in Hz. The Figure 2: Intelligibility Test Results for Seven Subjects motivation for using smaller δi values for the low scored on Percentage Words Correct frequency bands is to minimize speech distortion, since most of the speech energy is present in the lower frequencies. Relaxed subtraction is also used for the high frequency bands. Subtraction is performed over each band as indicated in equation and the negative values are rectified using the spectral floor. A small amount of the original noisy spectrum can be introduced back into the enhanced spectrum to mask any remaining musical noise. In this implementation, 5% of the original noisy spectrum within each band is combined, and the enhanced signal is obtained by taking the IFFT of the enhanced spectrum using the Figure 3: signal representation of the sentence ''I am phase of the original noisy spectrum. Finally, the David''. The top representation is the original signal, the standard overlap-and-add method is used to obtain the middle representation is corrupted signal, and the enhanced signal. bottom representation is the enhanced signal obtained by the frequency depended spectral subtraction 3.2 Subjective Evaluation of Speech Intelligibility method Subjective tests are conducted by having some human subjects listen to the prepared test speech files and evaluate based on some criteria. Intelligibility test were carried out at the Callier Center for Communication Disorders/UTD on seven hearing- impaired subjects with severe to profound hearing loss. Speech enhanced by the MBSS algorithm with four linearly spaced frequency bands was evaluated against the noisy speech. The sentences were corrupted using speech-shaped noise at 0 dB SNR. The noise-corrupted sentences were played in a random order through Figure 4: Spectrogram of the sentence ''I am David''. speakers in a sound insulated booth. The subjects were The top spectrogram is the original signal, the middle asked to repeat the sentence they heard. Intelligibility spectrogram is the corrupted signal, the bottom was measured in terms of percentage of words correct. spectrogram is the enhanced signal obtained by Figure 2 gives the bar plots of the intelligibility scores frequency depended spectral subtraction method. achieved by each subject and the average score for both the test conditions. The results obtained by objective evaluation are an indicator of the best speech quality 5. CONCLUSIONS that can be obtained by the different configurations of The work in this paper addressed the problem the algorithm. The proposed multi-band spectral of enhancing speech in noisy conditions. A multi-band subtraction approach(with number of bands>3) spectral subtraction method, based on the direct performed better than the PSS approach for both SNRs. estimation of the short-term spectral amplitude of speech and the non-uniform effect of noise on speech. 4. RESULTS The results establish the superiority of the proposed method over the conventional spectral subtraction 2164 | P a g e
  • 4. Y Prasanthi, K Jyothi, GIET / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue4, July-August 2012, pp.2162-2165 method with respect to speech quality of the enhanced Processing, vol.7, pp. 2819-2822, Sydney, signal and reduced residual noise. Australia, Dec.1998. The major contributions of this paper are [11] C. He and G. Zweig, “Adaptive two-band (a) Development of a multi-band speech enhancement spectral subtraction with multi-window strategy based on the spectral subtraction method. spectral estimation,” ICASSP, vol.2, pp. 793- Speech processed by the new algorithm shows reduced 796, 1999. levels of residual noise and good speech quality. [12] Y. Hu, M. Bhatnagar and P. Loizou, “A cross- (b) Proposed a band subtraction factor that provides correlation technique for enhancing speech greater control over the subtraction process in each corrupted with correlated noise,” ICASSP, vol. band and can be tweaked to minimize speech distortion. 1, pp. 673-676, 2001. (c) Evaluation of various pre-processing strategies for [13] S. Kamath and P. Loizou, “A multi-band improving the output speech quality. It was shown that spectral subtraction method for enhancing spectral smoothing and weighted spectral averaging of speech corrupted by colored noise,” submitted the input speech spectrum helped preserve the speech to ICASSP 2002. content and improved speech quality. [14] P. Kasthuri, “Multichannel speech enhancement for a digital programmable Reference/Bibliography hearing aid,” Master thesis, University of New [1] L. Arslan, A. McCree and V. Viswanathan, “ Mexico, 1999. New methods for adaptive noise suppression,” [15] W. Kim, S. Kang and H. Ko, “Spectral ICASSP, vol.1, pp. 812-815, May 1995. subtraction based on phonetic dependency and [2] M. Berouti, R. Schwartz and J. Makhoul, masking effects,” Proc. IEEE Vis. Image “Enhancement of speech corrupted by acoustic Signal Procs., vol. 147, No. 5, Oct. 2000. noise,” Proc. IEEE Int. Conf. on Acoust., [16] H. Levitt, “Noise reduction in hearing aids: An Speech, Signal Procs., pp. 208-211, Apr. 1979. overview”, Journal of Rehabilitation Research [3] S. Boll, “Suppression of acoustic noise in and Development, vol. 38,No.1 speech using spectral subtraction,” IEEE January/February 2001. Trans. Acoust., Speech, Signal Process., [17] J. Lim and A. Oppenheim, “All-pole modeling vol.27, pp. 113-120, Apr. 1979. of degraded speech,” IEEE Transactions on [4] Y. Cheng and D. O'Shaughnessy, “Speech Acoustics, Speech and Signal Processing, vol. enhancement based conceptually on auditory 26, No. 3, pp. 197-210, June 1978. evidence,” ICASSP, vol.2, pp. 961-964, Apr. [18] J. Lim and A. Oppenheim, “Enhancement and 1991. bandwidth compression of noisy speech,” [5] J. Deller Jr., J. Hansen and J. Proakis, “Discrete- Proc. IEEE, vol. 67, No. 12, pp. 221-239, Dec. Time Processing of Speech Signals”, NY: 1979 IEEE Press, 2000. [6] Y. Ephraim, “Statistical-model-based speech enhancement systems,” Proc. IEEE,80, No.10, pp. 1526-1555, Oct.1992. [7] Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error short- term spectral amplitude estimator,” IEEE Trans. on Acoust.,Speech,Signal Proc., vol.ASSP-32, No.6, pp. 1109-1121, Dec.1984. [8] Y. Ephraim and H. Van Trees, “A signal subspace approach for speech enhancement,” IEEE Trans. Speech Audio Procs., vol. 3, pp. 251-266, Jul. 1995. [9] Z. Goh, K.Tan and T. Tan, “Postprocessing method for suppressing musical noise generated by spectral subtraction,” IEEE Trans. Speech Audio Procs., vol. 6, pp. 287- 292, May 1998. [10] J. Hansen and B. Pellom, “An effective quality evaluation protocol for speech enhancements algorithms,” Inter. Conf. on Spoken Language 2165 | P a g e