SlideShare a Scribd company logo
1 of 5
Download to read offline
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/224640875
Subjective Comparison of Speech Enhancement Algorithms
Conference Paper  in  Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on · June 2006
DOI: 10.1109/ICASSP.2006.1659980 · Source: IEEE Xplore
CITATIONS
201
READS
1,607
2 authors, including:
Yi Hu
University of Wisconsin - Milwaukee
47 PUBLICATIONS   4,812 CITATIONS   
SEE PROFILE
All content following this page was uploaded by Yi Hu on 21 May 2014.
The user has requested enhancement of the downloaded file.
SUBJECTIVE COMPARISON OF SPEECH ENHANCEMENT ALGORITHMS
Yi Hu and Philipos C. Loizou ∗
Department of Electrical Engineering
University of Texas at Dallas
Richardson, Texas 75083-0688
{yihuyxy, loizou}@utdallas.edu
ABSTRACT
We report on the development of a noisy speech corpus suit-
able for evaluation of speech enhancement algorithms. This
corpus is used for the subjective evaluation of 13 speech en-
hancement methods encompassing four classes of algorithms:
spectral subtractive, subspace, statistical-model based and Wi-
ener algorithms. The subjective evaluation was performed by
Dynastat, Inc. using the ITU-T P.835 methodology designed
to evaluate the speech quality along three dimensions: sig-
nal distortion, noise distortion and overall quality. This paper
reports the results of the subjective tests.
1. INTRODUCTION
Over the past three decades, various speech enhancement al-
gorithms have been proposed to improve the performance of
modern communication devices in noisy environments. Yet,
it still remains unclear as to which speech enhancement algo-
rithm performs well in real-world listening situations where
the background noise level and characteristics are constantly
changing. Reliable and fair comparison between algorithms
has been elusive for several reasons, including lack of com-
mon speech database for evaluation of new algorithms, differ-
ences in the types of noise used and differences in the testing
methodology. Subjective evaluation of speech enhancement
algorithms is further complicated by the fact that the quality
of enhanced speech has both signal and noise distortion com-
ponents, and it is not clear as to whether listeners base their
quality judgments on the signal distortion, noise distortion or
both. Without having access to a common speech database, it
is nearly impossible for researchers to compare at very least
the objective performance of their algorithms with that of oth-
ers.
In this paper, we report on the development of a noisy
speech corpus (NOIZEUS) suitable for evaluation of speech
enhancement algorithms. This corpus is subsequently used in
a comprehensive subjective evaluation of 13 speech enhance-
ment algorithms encompassing four different classes of algo-
rithms: spectral subtractive, subspace, statistical-model based
∗Research supported in part by NIDCD/NIH.
and Wiener algorithms. The enhanced speech files were sent
to Dynastat, Inc (Austin, TX) for subjective evaluation using
the recently standardized methodology for evaluating noise
suppression algorithms based on ITU-T P.835 [1].
2. NOIZEUS: A NOISY SPEECH CORPUS FOR
EVALUATION OF ENHANCEMENT ALGORITHMS
NOIZEUS1
is a noisy speech corpus recorded in our lab to fa-
cilitate comparison of speech enhancement algorithms among
research groups. The noisy database contains 30 IEEE sen-
tences [2] produced by three male and three female speak-
ers, and was corrupted by eight different real-world noises
at different SNRs. Thirty sentences from the IEEE sentence
database were recorded in a sound-proof booth using Tucker
Davis Technologies (TDT) recording equipment. The sen-
tences were produced by three male and three female speak-
ers (5 sentences/speaker). The IEEE database was used as it
contains phonetically-balanced sentences with relatively low
word-co-ntext predictability. The thirty sentences were se-
lected from the IEEE database so as to include all phonemes
in the American English language. The sentences were orig-
inally sampled at 25 kHz and downsampled to 8 kHz. To
simulate the receiving frequency characteristics of telephone
handsets, the speech and noise signals were filtered by the
modified Intermediate Reference System (IRS) filters used in
ITU-T P.862 for evaluation of the PESQ measure.
Noise was artificially added to the speech signal as fol-
lows. The IRS filter was independently applied to the clean
and noise signals. The active speech level of the filtered clean
speech signal was first determined using the method B of
ITU-T P.56. A noise segment of the same length as the speech
signal was randomly cut out of the noise recordings, appropri-
ately scaled to reach the desired SNR level and finally added
to the filtered clean speech signal. Noise signals were taken
from the AURORA database [3] and included the following
recordings from different places: babble (crowd of people),
car, exhibition hall, restaurant, street, airport, train station,
and train. The noise signals were added to the speech signals
1Available at: http://www.utdallas.edu/˜loizou/speech/noizeus/.
Algorithm Equation/parameters Ref
KLT Eq. 14,48 [8]
pKLT Eq. 34, ν=0.08 [9]
MMSE-SPU Eq. 7,51, q=0.3 [10]
logMMSE Eq. 20 [11]
logMMSE-ne Eq. 20 [11]
logMMSE-SPU Eq. 2,8,10,16 [12]
pMMSE Eq. 12 [13]
RDC Eq. 6,7,10,14,15 [14]
RDC-ne Eq. 6,7,10,14,15 [14]
MB Eq. 4-7 [15]
WavThr Eq. 11,25 [16]
Wiener as Eq. 3-7 [4]
AudSup Eq. 26,38, νb(i)=1,2 iterations [17]
Table 1. List of 13 speech enhancement algorithms evaluated.
SPU=speech presence uncertainty, ne=noise estimation.
at SNRs of 0dB, 5dB, 10dB and 15dB.
3. ALGORITHMS EVALUATED
A total of 13 different speech enhancement methods were
evaluated based on our own implementation. Representative
algorithms from four different cla-sses of enhancement al-
gorithms were chosen: three spectral subtractive algorithms,
two subspace algorithms, three Wiener algorithms2
and five
statistical-model based algorithms. A subset of those algo-
rithms were evaluated with and without noise-estimation al-
gorithms. The parameters used in the implementation of these
algorithms were the same as those published unless stated
otherwise3
. Table 1 shows the list of algorithms evaluated
with the associated parameters and Equations given in the
references. The decision-directed approach was used for es-
timating the a priori SNR in the statistical methods and the
Wiener as method [4] with a=0.98.
The majority of the algorithms utilized a voice activity
detector [5] to update the noise spectrum during the speech-
absent periods. The subspace methods used a different VAD
method [6] with threshold value set to 1.2. To assess the merit
of noise-estimation algorithms [7], two speech-enhancement
algorithms were implemented with both VAD and noise esti-
mation algorithms. These algorithms are indicated in Table 1
with the suffix ‘-ne’.
2The Wiener-type algorithms were grouped separately since these algo-
rithms estimate the complex spectrum while the statistical-model algorithms
estimate the magnitude spectrum in the mean square sense.
3No adjustments were made for algorithms (e.g., [12]) originally designed
for 16 kHz.
4. SUBJECTIVE EVALUATION
To reduce the length and cost of the subjective evaluations,
only a subset of the NOIZEUS corpus was processed by the
13 algorithms and submitted to Dynastat, Inc. for formal sub-
jective evaluation. A total of 20 sentences corrupted in four
background noise environments (car, street, babble and train)
at two levels of SNR (5dB and 10dB) were processed and pre-
sented to 32 listeners for evaluation. These sentences were
spoken by two male speakers and two female speakers.
The subjective tests were designed according to ITU-T
recommendation P.835. The P.835 methodology was designed
to reduce the listener’s uncertainty in a subjective test as to
which component(s) of a noisy speech signal, i.e., the speech
signal, the background noise, or both, should form the basis of
their ratings of overall quality. This method instructs the lis-
tener to successively attend to and rate the enhanced speech
signal on: a) the speech signal alone using a scale of signal
distortion (SIG) - [1= very unnatural, 5=very natural], b) the
background noise alone using a scale of background conspic-
uous/intrusiveness (BAK) - [1=very conspicuous, very intru-
sive, 5=not noticeable], c) the overall effect using the scale of
the Mean Opinion Score (OVRL) - [1=bad, 5=excellent].
The process of rating the signal and background of noisy
speech was designed to lead the listener to integrate the ef-
fects of both the signal and the background in making their
ratings of overall quality. Each trial in a P.835 test involves a
triad of speech samples – three samples of the test condition
where each sample is a short segment of speech recorded in
background noise, e.g., a single sentence. For each sample
within the triad, listeners successively used one of the three
five-point rating scales, SIG, BAK, and OVRL, to register
their judgments of the quality of the test condition. In addi-
tion to the experimental conditions, each experiment included
a number of reference conditions designed to independently
vary the listener’s SIG, BAK, and OVRL ratings over the en-
tire five-point range of the rating scales. More details about
the testing methodology can be found in [18]. The figures
show the mean scores for SIG, BAK, and OVRL scales for the
13 methods evaluated. The mean scores for the noisy speech
(unprocessed) files are also shown for reference.
5. DISCUSSION AND CONCLUSIONS
Of the two subspace algorithms examined, the generalized
subspace approach [8] performed consistently better in OVRL
scale across all SNR conditions and four types of noise. The
performance of these two methods was distinctively different
in +5dB car noise. Lower signal distortion (i.e., higher SIG
scores) were observed with the generalized subspace method
in most conditions. Of the five statistical-model based algo-
rithms examined, the log-MMSE and the perceptually moti-
vated MMSE (pM-MSE) algorithms performed the best. Per-
formance of the pMMSE algorithm was comparable to that
of the MMSE algorithm which incorporated speech-presence
uncertainty (the pMMSE algorithm did not). Lower noise
distortion (i.e., high-er BAK scores) was obtained with the
pMMSE method in several conditions (5dB train, 5dB car,
10dB street). It was surprising to see that the noise-estimation
algorithm [7] did not provide significant improvements to the
performance of the log-MMSE algorithm (small improvements
were noted only in street noise). Incorporating speech-presence
uncertainty as per [12] did not improve the performance of the
log-MMSE algorithm. In fact, it degraded performance. Of
the two spectral-subtractive algorithms tested, the multi-band
spectral subtraction algorithm [15] performed consistently the
best across all conditions. Incorporating a noise-estimation
algorithm did not improve the performance of the reduced-
delay spectral subtraction algorithm. One possible explana-
tion for that is that the speech files were too brief in duration
to observe the real benefit of noise-estimation algorithms. Fi-
nally, of the three Wiener filtering type of algorithms, the
method proposed in [4] based on a priori SNR, performed
the best. This method also produced consistently the low-
est signal distortion comparable to the statistical-model based
methods. It did, however, suffer from high noise distortion.
Overall, the statistical-model based methods performed
the best across all conditions, followed by the multi-band spec-
tral subtraction method [15].
6. REFERENCES
[1] ITU-T P.835, Subjective test methodology for evalu-
ating speech communication systems that include noise
suppression algorithms, ITU-T Recommendation P.835,
2003.
[2] IEEE Subcommittee, “IEEE recommended practice for
speech quality measurements,” IEEE Trans. Audio and
Electroacoustics, pp. 225–246, 1969.
[3] H. Hirsch and D. Pearce, “The AURORA experimen-
tal framework for the performance evaluation of speech
recognition systems under noisy conditions,” in ISCA
ITRW ASR2000, Sept. 2000, Paris, France.
[4] P. Scalart and J. Filho, “Speech enhancement based
on a priori signal to noise estimation,” in Proc. IEEE
Int. Conf. Acoust., Speech, Signal Processing, 1996, pp.
629–632.
[5] J. Sohn, N. Kim, and W. Sung, “A statistical model-
based voice activity detection,” IEEE Signal Processing
Letters, pp. 1–3, Jan. 1999.
[6] U. Mittal and N. Phamdo, “Signal/noise KLT based
approach for enhancing speech degraded by colored
noise,” IEEE Trans. Speech Audio Proc., vol. 8, pp.
159–167, Mar. 2000.
[7] S. Rangachari and P. C. Loizou, “A noise estima-
tion algorithm for highly non-stationary environments,”
Speech Communication, vol. 28, pp. 220–231, Feb.
2006.
[8] Y. Hu and P. C. Loizou, “A generalized subspace
approach for enhancing speech corrupted by colored
noise,” IEEE Trans. Speech Audio Proc., pp. 334–341,
July 2003.
[9] F. Jabloun and B. Champagne, “Incorporating the hu-
man hearing properties in the signal subspace approach
for speech enhancement,” IEEE Trans. Speech Audio
Proc., vol. 11, pp. 700–708, 2003.
[10] Y. Ephraim and D. Malah, “Speech enhancement using
a minimum mean-square error short-time spectral am-
plitude estimator,” IEEE Trans. Acoustics, Speech and
Signal Processing, vol. ASSP-32, pp. 1109–1121, 1984.
[11] Y. Ephraim and D. Malah, “Speech enhancement using a
minimum mean-square error log-spectral amplitude es-
timator,” IEEE Trans. Acoustics, Speech and Signal Pro-
cessing, vol. ASSP-33, pp. 443–445, 1985.
[12] I. Cohen, “Optimal speech enhancement under signal
presence uncertainty using log-spectral amplitude esti-
mator,” IEEE Signal Processing Letters, vol. 9, pp. 113–
116, Apr. 2002.
[13] P. C. Loizou, “Speech enhancement based on percep-
tually motivated Bayesian estimators of the magnitude
spectrum,” IEEE Trans. Speech Audio Proc., pp. 857–
869, Sept. 2005.
[14] H. Gustafsson, S. Nordholm, and I. Claesson, “Spectral
subtraction using reduced delay convolution and adap-
tive averaging,” IEEE Trans. Speech Audio Proc., pp.
799–807, 2001.
[15] S. Kamath and P. C. Loizou, “A multi-band spectral
subtraction method for enhancing speech corrupted by
colored noise,” in Proc. IEEE Int. Conf. Acoust., Speech,
Signal Processing, 2002.
[16] Y. Hu and P. C. Loizou, “Speech enhancement based
on wavelet thresholding the multitaper spectrum,” IEEE
Trans. Speech Audio Proc., pp. 59–67, Jan. 2004.
[17] D. E. Tsoukalas, J. N. Mourjopoulos, and G. Kokki-
nakis, “Speech enhancement based on audible noise
suppression,” IEEE Trans. Speech Audio Proc., vol. 5,
pp. 479–514, Nov. 1997.
[18] Y. Hu and P. C. Loizou, “Subjective evaluation and com-
parison of speech enhancement algorithms,” submitted
to Speech Communication.
5dB babble noise
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
10dB babble
noise
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
5dB car noise
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
10dB car noise
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
5dB street noise
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
10dB street noise
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
5 dB Train
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
10 dB Train
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
KLT
pKLT
MMSE-SPU
logMMSE
logMMSE-ne
logMMSE-SPU
pMMSE
RDC
RDC_ne
MB
WavThr
Wiener_as
AudSup
noisy
SIG
BAK
OVRL
Subspace
algorithms
Statistical-
model based
Spectral
subtractive
Wiener-type
algorithms
View publication stats
View publication stats

More Related Content

What's hot

PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...
PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...
PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...sipij
 
Acoustic echo cancellation using nlms adaptive algorithm ranbeer
Acoustic echo cancellation using nlms adaptive algorithm ranbeerAcoustic echo cancellation using nlms adaptive algorithm ranbeer
Acoustic echo cancellation using nlms adaptive algorithm ranbeerRanbeer Tyagi
 
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...iosrjce
 
Adaptive Noise Cancellation
Adaptive Noise CancellationAdaptive Noise Cancellation
Adaptive Noise Cancellationtazim68
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellationchintanajoshi
 
Low power vlsi implementation adaptive noise cancellor based on least means s...
Low power vlsi implementation adaptive noise cancellor based on least means s...Low power vlsi implementation adaptive noise cancellor based on least means s...
Low power vlsi implementation adaptive noise cancellor based on least means s...shaik chand basha
 
Implementation Adaptive Noise Canceler
Implementation Adaptive Noise Canceler Implementation Adaptive Noise Canceler
Implementation Adaptive Noise Canceler Akshatha suresh
 
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...Raj Kumar Thenua
 
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative StudyEcho Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Studyidescitation
 
FPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMS
FPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMSFPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMS
FPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMSEditor IJMTER
 
Basics of Digital Filters
Basics of Digital FiltersBasics of Digital Filters
Basics of Digital Filtersop205
 
Adaptive equalization
Adaptive equalizationAdaptive equalization
Adaptive equalizationKamal Bhatt
 

What's hot (20)

PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...
PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...
PERFORMANCE ANALYIS OF LMS ADAPTIVE FIR FILTER AND RLS ADAPTIVE FIR FILTER FO...
 
Adaptive filter
Adaptive filterAdaptive filter
Adaptive filter
 
Introduction to Adaptive filters
Introduction to Adaptive filtersIntroduction to Adaptive filters
Introduction to Adaptive filters
 
Acoustic echo cancellation using nlms adaptive algorithm ranbeer
Acoustic echo cancellation using nlms adaptive algorithm ranbeerAcoustic echo cancellation using nlms adaptive algorithm ranbeer
Acoustic echo cancellation using nlms adaptive algorithm ranbeer
 
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
 
Adaptive filters
Adaptive filtersAdaptive filters
Adaptive filters
 
Adaptive Noise Cancellation
Adaptive Noise CancellationAdaptive Noise Cancellation
Adaptive Noise Cancellation
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellation
 
Low power vlsi implementation adaptive noise cancellor based on least means s...
Low power vlsi implementation adaptive noise cancellor based on least means s...Low power vlsi implementation adaptive noise cancellor based on least means s...
Low power vlsi implementation adaptive noise cancellor based on least means s...
 
Av 738 - Adaptive Filtering Lecture 1 - Introduction
Av 738 - Adaptive Filtering Lecture 1 - IntroductionAv 738 - Adaptive Filtering Lecture 1 - Introduction
Av 738 - Adaptive Filtering Lecture 1 - Introduction
 
Implementation Adaptive Noise Canceler
Implementation Adaptive Noise Canceler Implementation Adaptive Noise Canceler
Implementation Adaptive Noise Canceler
 
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
Simulation and hardware implementation of Adaptive algorithms on tms320 c6713...
 
Dsp ppt madhuri.anudeep
Dsp ppt madhuri.anudeepDsp ppt madhuri.anudeep
Dsp ppt madhuri.anudeep
 
ADAPTIVE NOISE CANCELLATION
ADAPTIVE NOISE CANCELLATIONADAPTIVE NOISE CANCELLATION
ADAPTIVE NOISE CANCELLATION
 
Adaptive equalization
Adaptive equalizationAdaptive equalization
Adaptive equalization
 
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative StudyEcho Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Study
 
FPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMS
FPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMSFPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMS
FPGA IMPLEMENTATION OF NOISE CANCELLATION USING ADAPTIVE ALGORITHMS
 
Basics of Digital Filters
Basics of Digital FiltersBasics of Digital Filters
Basics of Digital Filters
 
Adaptive equalization
Adaptive equalizationAdaptive equalization
Adaptive equalization
 
Multrate dsp
Multrate dspMultrate dsp
Multrate dsp
 

Similar to Subjective comparison of_speech_enhancement_algori (1)

A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...IRJET Journal
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Reviewijiert bestjournal
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing IJECEIAES
 
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...IJERA Editor
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementHarshal Ladhe
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...IJMER
 
Text independent speaker identification system using average pitch and forman...
Text independent speaker identification system using average pitch and forman...Text independent speaker identification system using average pitch and forman...
Text independent speaker identification system using average pitch and forman...ijitjournal
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniqueseSAT Journals
 
3. speech processing algorithms for perception improvement of hearing impaire...
3. speech processing algorithms for perception improvement of hearing impaire...3. speech processing algorithms for perception improvement of hearing impaire...
3. speech processing algorithms for perception improvement of hearing impaire...k srikanth
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundTELKOMNIKA JOURNAL
 
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...April Smith
 
Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...karthik annam
 
Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...
Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...
Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...tsysglobalsolutions
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...sipij
 

Similar to Subjective comparison of_speech_enhancement_algori (1) (20)

A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
 
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancement
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
 
Text independent speaker identification system using average pitch and forman...
Text independent speaker identification system using average pitch and forman...Text independent speaker identification system using average pitch and forman...
Text independent speaker identification system using average pitch and forman...
 
19 ijcse-01227
19 ijcse-0122719 ijcse-01227
19 ijcse-01227
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniques
 
3. speech processing algorithms for perception improvement of hearing impaire...
3. speech processing algorithms for perception improvement of hearing impaire...3. speech processing algorithms for perception improvement of hearing impaire...
3. speech processing algorithms for perception improvement of hearing impaire...
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
 
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
 
Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...
 
Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...
Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...
Ieee transactions on 2018 TOPICS with Abstract in audio, speech, and language...
 
1801 1805
1801 18051801 1805
1801 1805
 
1801 1805
1801 18051801 1805
1801 1805
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
 

Recently uploaded

Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 

Recently uploaded (20)

Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 

Subjective comparison of_speech_enhancement_algori (1)

  • 1. See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/224640875 Subjective Comparison of Speech Enhancement Algorithms Conference Paper  in  Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on · June 2006 DOI: 10.1109/ICASSP.2006.1659980 · Source: IEEE Xplore CITATIONS 201 READS 1,607 2 authors, including: Yi Hu University of Wisconsin - Milwaukee 47 PUBLICATIONS   4,812 CITATIONS    SEE PROFILE All content following this page was uploaded by Yi Hu on 21 May 2014. The user has requested enhancement of the downloaded file.
  • 2. SUBJECTIVE COMPARISON OF SPEECH ENHANCEMENT ALGORITHMS Yi Hu and Philipos C. Loizou ∗ Department of Electrical Engineering University of Texas at Dallas Richardson, Texas 75083-0688 {yihuyxy, loizou}@utdallas.edu ABSTRACT We report on the development of a noisy speech corpus suit- able for evaluation of speech enhancement algorithms. This corpus is used for the subjective evaluation of 13 speech en- hancement methods encompassing four classes of algorithms: spectral subtractive, subspace, statistical-model based and Wi- ener algorithms. The subjective evaluation was performed by Dynastat, Inc. using the ITU-T P.835 methodology designed to evaluate the speech quality along three dimensions: sig- nal distortion, noise distortion and overall quality. This paper reports the results of the subjective tests. 1. INTRODUCTION Over the past three decades, various speech enhancement al- gorithms have been proposed to improve the performance of modern communication devices in noisy environments. Yet, it still remains unclear as to which speech enhancement algo- rithm performs well in real-world listening situations where the background noise level and characteristics are constantly changing. Reliable and fair comparison between algorithms has been elusive for several reasons, including lack of com- mon speech database for evaluation of new algorithms, differ- ences in the types of noise used and differences in the testing methodology. Subjective evaluation of speech enhancement algorithms is further complicated by the fact that the quality of enhanced speech has both signal and noise distortion com- ponents, and it is not clear as to whether listeners base their quality judgments on the signal distortion, noise distortion or both. Without having access to a common speech database, it is nearly impossible for researchers to compare at very least the objective performance of their algorithms with that of oth- ers. In this paper, we report on the development of a noisy speech corpus (NOIZEUS) suitable for evaluation of speech enhancement algorithms. This corpus is subsequently used in a comprehensive subjective evaluation of 13 speech enhance- ment algorithms encompassing four different classes of algo- rithms: spectral subtractive, subspace, statistical-model based ∗Research supported in part by NIDCD/NIH. and Wiener algorithms. The enhanced speech files were sent to Dynastat, Inc (Austin, TX) for subjective evaluation using the recently standardized methodology for evaluating noise suppression algorithms based on ITU-T P.835 [1]. 2. NOIZEUS: A NOISY SPEECH CORPUS FOR EVALUATION OF ENHANCEMENT ALGORITHMS NOIZEUS1 is a noisy speech corpus recorded in our lab to fa- cilitate comparison of speech enhancement algorithms among research groups. The noisy database contains 30 IEEE sen- tences [2] produced by three male and three female speak- ers, and was corrupted by eight different real-world noises at different SNRs. Thirty sentences from the IEEE sentence database were recorded in a sound-proof booth using Tucker Davis Technologies (TDT) recording equipment. The sen- tences were produced by three male and three female speak- ers (5 sentences/speaker). The IEEE database was used as it contains phonetically-balanced sentences with relatively low word-co-ntext predictability. The thirty sentences were se- lected from the IEEE database so as to include all phonemes in the American English language. The sentences were orig- inally sampled at 25 kHz and downsampled to 8 kHz. To simulate the receiving frequency characteristics of telephone handsets, the speech and noise signals were filtered by the modified Intermediate Reference System (IRS) filters used in ITU-T P.862 for evaluation of the PESQ measure. Noise was artificially added to the speech signal as fol- lows. The IRS filter was independently applied to the clean and noise signals. The active speech level of the filtered clean speech signal was first determined using the method B of ITU-T P.56. A noise segment of the same length as the speech signal was randomly cut out of the noise recordings, appropri- ately scaled to reach the desired SNR level and finally added to the filtered clean speech signal. Noise signals were taken from the AURORA database [3] and included the following recordings from different places: babble (crowd of people), car, exhibition hall, restaurant, street, airport, train station, and train. The noise signals were added to the speech signals 1Available at: http://www.utdallas.edu/˜loizou/speech/noizeus/.
  • 3. Algorithm Equation/parameters Ref KLT Eq. 14,48 [8] pKLT Eq. 34, ν=0.08 [9] MMSE-SPU Eq. 7,51, q=0.3 [10] logMMSE Eq. 20 [11] logMMSE-ne Eq. 20 [11] logMMSE-SPU Eq. 2,8,10,16 [12] pMMSE Eq. 12 [13] RDC Eq. 6,7,10,14,15 [14] RDC-ne Eq. 6,7,10,14,15 [14] MB Eq. 4-7 [15] WavThr Eq. 11,25 [16] Wiener as Eq. 3-7 [4] AudSup Eq. 26,38, νb(i)=1,2 iterations [17] Table 1. List of 13 speech enhancement algorithms evaluated. SPU=speech presence uncertainty, ne=noise estimation. at SNRs of 0dB, 5dB, 10dB and 15dB. 3. ALGORITHMS EVALUATED A total of 13 different speech enhancement methods were evaluated based on our own implementation. Representative algorithms from four different cla-sses of enhancement al- gorithms were chosen: three spectral subtractive algorithms, two subspace algorithms, three Wiener algorithms2 and five statistical-model based algorithms. A subset of those algo- rithms were evaluated with and without noise-estimation al- gorithms. The parameters used in the implementation of these algorithms were the same as those published unless stated otherwise3 . Table 1 shows the list of algorithms evaluated with the associated parameters and Equations given in the references. The decision-directed approach was used for es- timating the a priori SNR in the statistical methods and the Wiener as method [4] with a=0.98. The majority of the algorithms utilized a voice activity detector [5] to update the noise spectrum during the speech- absent periods. The subspace methods used a different VAD method [6] with threshold value set to 1.2. To assess the merit of noise-estimation algorithms [7], two speech-enhancement algorithms were implemented with both VAD and noise esti- mation algorithms. These algorithms are indicated in Table 1 with the suffix ‘-ne’. 2The Wiener-type algorithms were grouped separately since these algo- rithms estimate the complex spectrum while the statistical-model algorithms estimate the magnitude spectrum in the mean square sense. 3No adjustments were made for algorithms (e.g., [12]) originally designed for 16 kHz. 4. SUBJECTIVE EVALUATION To reduce the length and cost of the subjective evaluations, only a subset of the NOIZEUS corpus was processed by the 13 algorithms and submitted to Dynastat, Inc. for formal sub- jective evaluation. A total of 20 sentences corrupted in four background noise environments (car, street, babble and train) at two levels of SNR (5dB and 10dB) were processed and pre- sented to 32 listeners for evaluation. These sentences were spoken by two male speakers and two female speakers. The subjective tests were designed according to ITU-T recommendation P.835. The P.835 methodology was designed to reduce the listener’s uncertainty in a subjective test as to which component(s) of a noisy speech signal, i.e., the speech signal, the background noise, or both, should form the basis of their ratings of overall quality. This method instructs the lis- tener to successively attend to and rate the enhanced speech signal on: a) the speech signal alone using a scale of signal distortion (SIG) - [1= very unnatural, 5=very natural], b) the background noise alone using a scale of background conspic- uous/intrusiveness (BAK) - [1=very conspicuous, very intru- sive, 5=not noticeable], c) the overall effect using the scale of the Mean Opinion Score (OVRL) - [1=bad, 5=excellent]. The process of rating the signal and background of noisy speech was designed to lead the listener to integrate the ef- fects of both the signal and the background in making their ratings of overall quality. Each trial in a P.835 test involves a triad of speech samples – three samples of the test condition where each sample is a short segment of speech recorded in background noise, e.g., a single sentence. For each sample within the triad, listeners successively used one of the three five-point rating scales, SIG, BAK, and OVRL, to register their judgments of the quality of the test condition. In addi- tion to the experimental conditions, each experiment included a number of reference conditions designed to independently vary the listener’s SIG, BAK, and OVRL ratings over the en- tire five-point range of the rating scales. More details about the testing methodology can be found in [18]. The figures show the mean scores for SIG, BAK, and OVRL scales for the 13 methods evaluated. The mean scores for the noisy speech (unprocessed) files are also shown for reference. 5. DISCUSSION AND CONCLUSIONS Of the two subspace algorithms examined, the generalized subspace approach [8] performed consistently better in OVRL scale across all SNR conditions and four types of noise. The performance of these two methods was distinctively different in +5dB car noise. Lower signal distortion (i.e., higher SIG scores) were observed with the generalized subspace method in most conditions. Of the five statistical-model based algo- rithms examined, the log-MMSE and the perceptually moti- vated MMSE (pM-MSE) algorithms performed the best. Per- formance of the pMMSE algorithm was comparable to that
  • 4. of the MMSE algorithm which incorporated speech-presence uncertainty (the pMMSE algorithm did not). Lower noise distortion (i.e., high-er BAK scores) was obtained with the pMMSE method in several conditions (5dB train, 5dB car, 10dB street). It was surprising to see that the noise-estimation algorithm [7] did not provide significant improvements to the performance of the log-MMSE algorithm (small improvements were noted only in street noise). Incorporating speech-presence uncertainty as per [12] did not improve the performance of the log-MMSE algorithm. In fact, it degraded performance. Of the two spectral-subtractive algorithms tested, the multi-band spectral subtraction algorithm [15] performed consistently the best across all conditions. Incorporating a noise-estimation algorithm did not improve the performance of the reduced- delay spectral subtraction algorithm. One possible explana- tion for that is that the speech files were too brief in duration to observe the real benefit of noise-estimation algorithms. Fi- nally, of the three Wiener filtering type of algorithms, the method proposed in [4] based on a priori SNR, performed the best. This method also produced consistently the low- est signal distortion comparable to the statistical-model based methods. It did, however, suffer from high noise distortion. Overall, the statistical-model based methods performed the best across all conditions, followed by the multi-band spec- tral subtraction method [15]. 6. REFERENCES [1] ITU-T P.835, Subjective test methodology for evalu- ating speech communication systems that include noise suppression algorithms, ITU-T Recommendation P.835, 2003. [2] IEEE Subcommittee, “IEEE recommended practice for speech quality measurements,” IEEE Trans. Audio and Electroacoustics, pp. 225–246, 1969. [3] H. Hirsch and D. Pearce, “The AURORA experimen- tal framework for the performance evaluation of speech recognition systems under noisy conditions,” in ISCA ITRW ASR2000, Sept. 2000, Paris, France. [4] P. Scalart and J. Filho, “Speech enhancement based on a priori signal to noise estimation,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1996, pp. 629–632. [5] J. Sohn, N. Kim, and W. Sung, “A statistical model- based voice activity detection,” IEEE Signal Processing Letters, pp. 1–3, Jan. 1999. [6] U. Mittal and N. Phamdo, “Signal/noise KLT based approach for enhancing speech degraded by colored noise,” IEEE Trans. Speech Audio Proc., vol. 8, pp. 159–167, Mar. 2000. [7] S. Rangachari and P. C. Loizou, “A noise estima- tion algorithm for highly non-stationary environments,” Speech Communication, vol. 28, pp. 220–231, Feb. 2006. [8] Y. Hu and P. C. Loizou, “A generalized subspace approach for enhancing speech corrupted by colored noise,” IEEE Trans. Speech Audio Proc., pp. 334–341, July 2003. [9] F. Jabloun and B. Champagne, “Incorporating the hu- man hearing properties in the signal subspace approach for speech enhancement,” IEEE Trans. Speech Audio Proc., vol. 11, pp. 700–708, 2003. [10] Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error short-time spectral am- plitude estimator,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. ASSP-32, pp. 1109–1121, 1984. [11] Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error log-spectral amplitude es- timator,” IEEE Trans. Acoustics, Speech and Signal Pro- cessing, vol. ASSP-33, pp. 443–445, 1985. [12] I. Cohen, “Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude esti- mator,” IEEE Signal Processing Letters, vol. 9, pp. 113– 116, Apr. 2002. [13] P. C. Loizou, “Speech enhancement based on percep- tually motivated Bayesian estimators of the magnitude spectrum,” IEEE Trans. Speech Audio Proc., pp. 857– 869, Sept. 2005. [14] H. Gustafsson, S. Nordholm, and I. Claesson, “Spectral subtraction using reduced delay convolution and adap- tive averaging,” IEEE Trans. Speech Audio Proc., pp. 799–807, 2001. [15] S. Kamath and P. C. Loizou, “A multi-band spectral subtraction method for enhancing speech corrupted by colored noise,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 2002. [16] Y. Hu and P. C. Loizou, “Speech enhancement based on wavelet thresholding the multitaper spectrum,” IEEE Trans. Speech Audio Proc., pp. 59–67, Jan. 2004. [17] D. E. Tsoukalas, J. N. Mourjopoulos, and G. Kokki- nakis, “Speech enhancement based on audible noise suppression,” IEEE Trans. Speech Audio Proc., vol. 5, pp. 479–514, Nov. 1997. [18] Y. Hu and P. C. Loizou, “Subjective evaluation and com- parison of speech enhancement algorithms,” submitted to Speech Communication.
  • 5. 5dB babble noise 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 10dB babble noise 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 5dB car noise 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 10dB car noise 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 5dB street noise 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 10dB street noise 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 5 dB Train 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms 10 dB Train 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 KLT pKLT MMSE-SPU logMMSE logMMSE-ne logMMSE-SPU pMMSE RDC RDC_ne MB WavThr Wiener_as AudSup noisy SIG BAK OVRL Subspace algorithms Statistical- model based Spectral subtractive Wiener-type algorithms View publication stats View publication stats