SlideShare a Scribd company logo
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 361
AN EFFECTIVE EVALUATION STUDY OF OBJECTIVE MEASURES
USING SPECTRAL SUBTRACTIVE ENHANCED SIGNAL
Sukhvinder Kaur1
, Anil Garg2
1
M.Tech Student MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India
2
Associate Prof. in ECE department MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India
Abstract
Unwanted noises have a negative influence over communication because it disturbs the conversation and make the
communication impossible. Speech enhancement algorithms are used for improving the quality and intelligibility or to reduce
listener fatigues. Assessment of speech quality can be done by using either subjective listening test or objective quality measure.
Evaluation of several objective measures with the speech processed by enhancement algorithms has been performed but these
having limitations to assess original speech signal. This paper represents the study of speech quality measures and compute the
values used for regression analyses of the objective measures evaluation study using spectral subtraction algorithm based
enhanced speech signal.
Keywords: MOS, ITU-T (P.835), SNRseg, log- likelihood ratio and itakura-saito.
---------------------------------------------------------------------***--------------------------------------------------------------------
1. INTRODUCTION
Speech Communication is an integral part of daily life.
Communication systems include various applications such
as, mobile phones, speech coding and compression, medical
devices such as hearing aids etc. All these are associated
with different types of noise or sounds. Unwanted sounds
are commonly referred as disturbances. It can be both noise
or interferences and which are not acceptable in the
communication process. Therefore, unwanted sounds are
having a negative influence over the communication
because they may disturb the conversation and make it
impossible to communicate at all. Signal Processing
methods are the effective way for aiding the speech
communication by reducing the disturbance levels with
respect to the level of the speech. Such speech processing
methods are known as speech enhancement methods.
Speech enhancement algorithms improve the quality and
intelligibility of speech or reduce listener fatigues.
Speech enhancement has several real world applications
which include: Telecommunications, Electronic hearing aids
and Automatic speaker recognition software. These all are
of important quality and intelligibility of speech which
vastly improves the user’s listening experience. Two criteria
are which generally used for measuring the performance of
speech: quality and intelligibility. It is quite hard to satisfy
both of them at the same time. Quality is the subjective
measure which indicates the amount of effort needed to
understand the speech material. Whereas Intelligibility, is an
objective measure which signifies the amount of speech
material correctly understood.
This paper is represented in such a manner that section 2
provides brief review of the different speech enhancement
techniques and spectral subtraction algorithm. The speech
quality measures are discussed in section 3. In the Section 4
evaluated results are shown and the last section 5 represents
the conclusion part. References are given in the last section
of the paper.
2. SPEECH ENHANCEMENT TECHNIQUES
Speech enhancement methods can be classified in two ways:
(1) Single channel enhancement technique (2) Multi channel
enhancement technique. Single channel enhancement
techniques are applied to situations in which only one
acquisition channel is available. Multi channel
enhancement techniques are employed in microphone arrays
and take advantage of availability of multiple signal
inputs to system, by making possible use of phase
alignment to reject the undesired noise components. Single
channel enhancement techniques are further divided again
into different categories: Short Time Spectral Amplitude
Estimation (i.e. Spectral subtraction and Non Causal Weiner
Filtering), Periodicity of Voiced Speech based, Adaptive
Comb Filtering based, Harmonic Selection based, adaptive
noise canceling techniques based, Speech model based (i.e.
Kalman Filtering Hidden Markov Modelling(HMM)).
Spectral-subtractive algorithms: Is the simplest speech
enhancement algorithms based on the principle of estimating
the clean signal spectrum by subtracting an estimated noise
spectrum from the noisy speech spectrum. Statistical model
based algorithm: are based on the estimation of linear and
non linear transform coefficients of clean signal by
measuring the Fourier transform coefficients of the noisy
signal. E.g., Wiener algorithm and minimum mean square
error (MMSE) algorithms. Subspace Algorithms: are based
on the principle that the clean signals estimation is done by
simply nulling the component of the noisy vector residing in
the noise subspace.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 362
2.1 Spectral-Subtractive Algorithms
The spectral subtraction algorithm is historically one of the
first algorithms proposed for noise reduction based on the
simple principle of assuming additive noise. Estimated clean
signal spectrum can be obtained by subtracting an average
noise spectrum from the noisy speech spectrum.
The noise spectrum can be estimated, and updated, during
the periods when the signal is absent or when only noise is
present. Subtraction process needs to be done carefully to
avoid speech distortions. If too much subtraction is done,
then some speech information might be removed while if the
subtraction is too little then much of the interfering noise
remains.
3. SPEECH QUALITY MEASURES
Speech processing algorithms usage is rapidly increasing,
which raises the need of speech quality evaluation. The
main evaluation method of speech enhancement system
affects the intelligibility of the speech signal and improves
the overall quality of the signal. The assessment of speech
quality can be done by either using subjective listening tests
or objective quality measures i.e. Subjective measures and
Objective measures.
3.1 Subjective Measures
Subjective evaluation involves comparisons of original and
processed speech signals by a group of listeners who are
asked to rate the quality of speech along a pre-determined
scale. This evaluation method requires the judgment of
human listeners. These can be further divided into two
categories: (1) Relative Preference Methods - multiple
signals are compared. (2)Absolute Category Rating Methods
- a single stimulus is tested.
Table 1.1 MOS rating scale
Rating Speech
quality
Level of distortion
5 Excellent Imperceptible
4 Good Just perceptible, but not annoying
3 Fair Perceptible and slightly annoying
2 Poor Annoying , but not objectionable
1 Bad Very annoying and objectionable
MOS (Mean Opinion score) is the most widely used
subjective evaluation method [8]. The MOS test is based on
five category rating scale of the speech quality given in
Table 1.1. ITU-T standard (P.835) is designed for
integrating the effect of signal and background distortions.
Subjective evaluation is the reliable method for assessment
of speech quality but these are time consuming and
expensive. This evaluation method is stringent in nature as it
needed a set of trained listeners for determining the quality
of speech. Therefore need of objective evaluation arises.
3.2 Objective Measures
Objective evaluation involves a mathematical comparison of
original and processed speech. In this evaluation method the
judgment is predicted with some analysis of the system. In
this method quality is quantified by measuring the numerical
distance between the original and processed signals. There
are different types of objective quality measures which are
further categorized as: (1) Time and frequency signal to
noise ratio measures, (2) Spectral distance measure based on
LPC, (3) Perceptually Motivated measures and (4)
Composite measures.
The objective measures accesses the quality of the processed
speech without accessing the original speech signal.
Objective measures of speech quality are implemented by
first segmenting the speech signal into 10-30 ms frames, and
then computing a distortion measure between the original
and processed signals. Distortion measure computation can
either be done in the time domain (e.g. signal to noise ratio
measures) or in the frequency domain (LPC spectral
distance measures). In the frequency domain measure
distortions and differences detected in the magnitude spectra
are correlated with speech quality.
In Time and Frequency Signal to Noise Ratio Measures, the
segmental signal to noise ratio can be evaluated either in
terms of time or frequency domain. The original and
processed signals both are aligned either in time domain or
in frequency domain for the calculation. The main
advantage of using frequency based segmental SNR over the
time domain (SNRseg) is the flexibility to place different
weights for different frequency bands of the spectrum.
In the Spectral Distance Measure Based on LPC several
LPC based objective measures are proposed on the basis of
all pole models of the clean and enhanced speech signals.
Three different LPC based objective measures include: The
LLR (log- likelihood ratio) measure, the IS ( itakura-saito)
measure and the Cepstrum distance measure. Cepstral
distance measure is derived from the LPC coefficients.
Perceptually Motivated Measures are used for the modeling
of auditory processes like normal listening frequency
selectivity and the perceived loudness. Weighted spectral
slope (WSS), Bark distortion measures (BSD) and
Perceptual evaluation of speech quality (PESQ) measures
are the perceptually motivated measures.
Composite Measures are formed by combining multiple
objective measures. Regression analysis is used to compute
the optimum combination of objective measure for
maximum correlation. The selection of objective measures
for composite measures depends basically on the
experimental evidence and intuition. MARS technique can
be used for composite measure evaluation.
Figure of Merits are obtained by statistical analysis. This
analysis is necessary for the validity of evaluation method of
objective measures. Statistical analysis is used to assess the
correlation between subjective score and the values of the
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 363
objective measures. Two types of figure of merit are
calculated in statistical analysis: (1) Pearson’s correlation
coefficient. (2) Standard error of the estimate.
The first figure of merit Pearson’s correlation coefficient is
used to obtain the correlation between subjective listening
scores and objective measures. Second figure of merit is an
estimate of the standard deviation of the error obtained by
objective measures to predict subjective listening scores.
3.2.1 Time and Frequency Signal to Noise Ratio
Measures
The segmental signal to noise ratio is evaluated either in
terms of time or frequency domain. The time domain
measure is the simplest objective measure which is used to
evaluate speech enhancement and speech coding algorithms.
In this measure original and processed signal are aligned in
time and phase errors are present. The segmental signal to
noise ratio is given by eqn. (3.1)
(3.1)
Where is the original signal (clean), is the
enhanced signal, is the frame length and is the number
of frames in the signal. The segmental SNR in terms of
frequency domain produce the frequency weighted
segmental SNR is given in eqn. (3.2)
(3.2)
Where is the weight placed on the jth frequency band,
is the number of bands, is the total number of frames in
the signal. is the filter bank amplitude (excitation
spectrum) of the clean signal and is the filter bank
amplitude of enhanced signal in the same bank.
3.2.2 LPC based Objective Measures
LPC based objective measures including the log likelihood
ratio (LLR), Itakura Saito distance measure (IS), cepstrum
distance measure (CEP).The two most commonly used
measures are LLR and IS. Cepstral distance measure is
derived from the LPC coefficients.The log likelihood ratio
(LLR) measure is defined as in eqn.
(3.3)
Where the coefficients of the clean signal are, are
the coefficients of enhanced signal and is the
autocorrelation matrix of clean signal.The Itakura Saito(IS)
measure is defined as in eqn.(3.4)
(3.4)
Where and are the all pole gains of the clean and
enhanced signals.The cepstrum distance measure (CEP) is
defined as in eqn. (3.5)
(3.5)
Where and are the cepstrum coefficients of the
clean and enhanced signals.
4. EVALUATION RESULTS
Results evaluated for the Statistical Analysis of spectral
subtractive enhanced signal is discussed in this section. In
this paper, the clean and processed signal is used for
statistical analysis. Speech signal is first segmented into
frames and then standard deviation and mean values are
computed for the noisy and enhanced signals shown in table
1.2. The original time waveform for clean noise, noisy and
enhanced signals are shown in fig 1. These are used as the
primary tool for the result analysis.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 364
Fig 1: Time waveform representation of clean, noise, noisy and spectrally subtractive enhanced signal
The spectrogram of clean, noise, noisy and enhanced signal
shown in fig 2. Spectrogram is utilized for detecting the
differences in the magnitude spectra of waveform
information. The quality of the processed speech is
determined by without needing access to the original speech
signal.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 365
Fig 2: Spectrogram representation of clean, noise, noisy and spectrally subtractive enhanced signal
Computation of the LPC measure is done by estimating the
correlation coefficient by using the mean value and standard
deviation values. Table 1.2 shows the respective values of
mean and standard deviation for different objective
measures such as SNR, IS, LR, LLR, WSM. These values
are the LPC coefficients that can be used for computing the
objective measures.
Table 1.2 Table for mean and standard deviation values of
objective measure
Objective
measure
Mean value Standard
deviation
SNR 1.388178 1.727957
IS 30.170003 21.972701
LR 0.205720 0.189852
LLR 0.176183 0.139966
WSM 3.686912 2.256240
SNR plot of enhanced signal with respect to the noisy signal
is shown in fig 3. The plot shows the smaller values of the
signal energy during the interval of silence in the speech
signal results the larger negative value of (SNR).
Fig 3: SNR plot with respect to the noisy signal spectrum
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 366
Itakura saito ratio (IS) plot of enhanced signal with respect
to the noisy signal is shown in fig 4. The plot shows the
smaller values of the signal energy during the interval when
the speech in the processed signal is of highest amplitude
i.e., speech is clearly understandable and distractive and
larger values of IS shown for the noise present in processed
signal.
Fig 4: IS (itakura- saito ratio) plot with respect to the noisy signal spectrum
LLR (log likelihood ratio) plot of enhanced signal with
respect to the noisy signal is shown in fig 5. The plot shows
the lower values of the signal energy during the interval of
speech signal and results the larger peak value of (LLR) for
the noise in the processed signal spectrum.
Fig 5: LLR (log likelihood ratio) plot with respect to the noisy signal spectrum.
5. CONCLUSION
Different speech enhancement algorithms have been
proposed to improve the performance of modern
communication devices in noisy environments. The key
conclusion drawn from this paper is that speech quality
assessment is done through subjective and objective
evaluation measures. In this paper study of objective
measure and computed values of the standard deviation and
mean for noisy and enhanced signal are given.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 367
It is very difficult to find reliable and fair comparison
among different algorithms due to lack of common speech
database used in algorithms evaluation, different types of
noise and differences in the testing methodology. The most
accurate method for evaluating speech quality is subjective
listening tests but these are time consuming and expensive.
This evaluation method is stringent in nature as it needed a
set of trained listeners for determining the quality of speech.
Therefore need of objective evaluation arises. As the frame
length increases SNR value decreases monotonically.
REFERENCES
[1]. Loizou, P., 2007. Speech Enhancement: Theory and
Practice. CRC Press, Boca Raton, FL.
[2]. Y. Hu and P. C. Loizou, “Subjective evaluation and
comparison of speech enhancement algorithms,” submitted
to Speech Communication.
[3]. Yi Hu and Philipos C. Loizou, “Evaluation of Objective
Quality Measures for Speech Enhancement”, IEEE Trans.
on Audio, Speech, and Language Processing, Vol. 16, No. 1,
January 2008.
[4]. P. Chu and D. Messerschmitt, “A frequency weighted
itakura-saito spectral distance measure”, IEEE Trans.
Acoustics, Speech and Signal Processing, vol. 30, pp. 545—
560, Aug. 1982.
[5]. Anuradha R. Fukane, “Enhancement of Noisy Speech
Signals for Hearing Aids”, 2011 International Conference
on Communication Systems and Network Technologies,
978-0-7695-4437-3/11© 2011 IEEE
[6]. Sunil D. Kamath,” A Multi-Band Spectral Subtraction
Method for Enhancing Speech Corrupted By Colored
Noise”, 2002
[7]. Jianfen Ma, Yi Hu and Philipos C. Loizou, “Objective
measures for predicting speech intelligibility in noisy
conditions based on new band-importance functions”, J.
Acoust. Soc. Am., Vol. 125, No. 5, May 2009.
[8]. Kuldip Paliwal, “Single-channel speech enhancement
using spectral subtraction in the short-time modulation
domain”, Elsevier, Speech Communication 52 450–475,
2010.

More Related Content

What's hot

Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...eSAT Publishing House
 
Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...eSAT Journals
 
Comparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewComparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewIJAEMSJORNAL
 
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESEFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESkevig
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From SpeechIJERA Editor
 
IRJET- Study of Effect of PCA on Speech Emotion Recognition
IRJET- Study of Effect of PCA on Speech Emotion RecognitionIRJET- Study of Effect of PCA on Speech Emotion Recognition
IRJET- Study of Effect of PCA on Speech Emotion RecognitionIRJET Journal
 
M sc thesis_presentation_
M sc thesis_presentation_M sc thesis_presentation_
M sc thesis_presentation_Dia Abdulkerim
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327IJMER
 
05 comparative study of voice print based acoustic features mfcc and lpcc
05 comparative study of voice print based acoustic features mfcc and lpcc05 comparative study of voice print based acoustic features mfcc and lpcc
05 comparative study of voice print based acoustic features mfcc and lpccIJAEMSJORNAL
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...ijceronline
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based ModelReal-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Modeladil raja
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing IJECEIAES
 
Dynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingDynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingCSCJournals
 
IRJET- Emotion recognition using Speech Signal: A Review
IRJET-  	  Emotion recognition using Speech Signal: A ReviewIRJET-  	  Emotion recognition using Speech Signal: A Review
IRJET- Emotion recognition using Speech Signal: A ReviewIRJET Journal
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Reviewijiert bestjournal
 

What's hot (20)

Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...
 
Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...
 
Comparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewComparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: Review
 
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESEFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
 
H42045359
H42045359H42045359
H42045359
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
 
IRJET- Study of Effect of PCA on Speech Emotion Recognition
IRJET- Study of Effect of PCA on Speech Emotion RecognitionIRJET- Study of Effect of PCA on Speech Emotion Recognition
IRJET- Study of Effect of PCA on Speech Emotion Recognition
 
M sc thesis_presentation_
M sc thesis_presentation_M sc thesis_presentation_
M sc thesis_presentation_
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
 
05 comparative study of voice print based acoustic features mfcc and lpcc
05 comparative study of voice print based acoustic features mfcc and lpcc05 comparative study of voice print based acoustic features mfcc and lpcc
05 comparative study of voice print based acoustic features mfcc and lpcc
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based ModelReal-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
 
Dynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingDynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modelling
 
IRJET- Emotion recognition using Speech Signal: A Review
IRJET-  	  Emotion recognition using Speech Signal: A ReviewIRJET-  	  Emotion recognition using Speech Signal: A Review
IRJET- Emotion recognition using Speech Signal: A Review
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 

Similar to An effective evaluation study of objective measures using spectral subtractive enhanced signal

IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET Journal
 
Review Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different TechniquesReview Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different TechniquesIRJET Journal
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...eSAT Journals
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignmentskevig
 
01 8445 speech enhancement
01 8445 speech enhancement01 8445 speech enhancement
01 8445 speech enhancementIAESIJEECS
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniqueseSAT Journals
 
Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...Venkat Projects
 
Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Katia Raichlin-Levi
 
Mfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speechMfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speechsipij
 
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...IRJET Journal
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...eSAT Journals
 
Sentiment analysis by deep learning approaches
Sentiment analysis by deep learning approachesSentiment analysis by deep learning approaches
Sentiment analysis by deep learning approachesTELKOMNIKA JOURNAL
 

Similar to An effective evaluation study of objective measures using spectral subtractive enhanced signal (20)

IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
 
Review Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different TechniquesReview Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different Techniques
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignments
 
01 8445 speech enhancement
01 8445 speech enhancement01 8445 speech enhancement
01 8445 speech enhancement
 
Nd2421622165
Nd2421622165Nd2421622165
Nd2421622165
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniques
 
Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...
 
Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
 
Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
Mfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speechMfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speech
 
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...
 
Ijetcas14 390
Ijetcas14 390Ijetcas14 390
Ijetcas14 390
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Sentiment analysis by deep learning approaches
Sentiment analysis by deep learning approachesSentiment analysis by deep learning approaches
Sentiment analysis by deep learning approaches
 

More from eSAT Journals

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementseSAT Journals
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case studyeSAT Journals
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case studyeSAT Journals
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreeSAT Journals
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialseSAT Journals
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...eSAT Journals
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...eSAT Journals
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizereSAT Journals
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementeSAT Journals
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...eSAT Journals
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteeSAT Journals
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...eSAT Journals
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...eSAT Journals
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabseSAT Journals
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaeSAT Journals
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...eSAT Journals
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodeSAT Journals
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniqueseSAT Journals
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...eSAT Journals
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...eSAT Journals
 

More from eSAT Journals (20)

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavements
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case study
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case study
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangalore
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizer
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources management
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concrete
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabs
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in india
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn method
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniques
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...
 

Recently uploaded

A case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfA case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfKamal Acharya
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdfKamal Acharya
 
The Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docx
The Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docxThe Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docx
The Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docxCenterEnamel
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edgePaco Orozco
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfAbrahamGadissa
 
ENERGY STORAGE DEVICES INTRODUCTION UNIT-I
ENERGY STORAGE DEVICES  INTRODUCTION UNIT-IENERGY STORAGE DEVICES  INTRODUCTION UNIT-I
ENERGY STORAGE DEVICES INTRODUCTION UNIT-IVigneshvaranMech
 
fluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answerfluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answerapareshmondalnita
 
fundamentals of drawing and isometric and orthographic projection
fundamentals of drawing and isometric and orthographic projectionfundamentals of drawing and isometric and orthographic projection
fundamentals of drawing and isometric and orthographic projectionjeevanprasad8
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxR&R Consult
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdfKamal Acharya
 
LIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptLIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptssuser9bd3ba
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxViniHema
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxMd. Shahidul Islam Prodhan
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationRobbie Edward Sayers
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.PrashantGoswami42
 
Introduction to Casting Processes in Manufacturing
Introduction to Casting Processes in ManufacturingIntroduction to Casting Processes in Manufacturing
Introduction to Casting Processes in Manufacturingssuser0811ec
 
Danfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdfDanfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdfNurvisNavarroSanchez
 
Fruit shop management system project report.pdf
Fruit shop management system project report.pdfFruit shop management system project report.pdf
Fruit shop management system project report.pdfKamal Acharya
 

Recently uploaded (20)

A case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfA case study of cinema management system project report..pdf
A case study of cinema management system project report..pdf
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
The Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docx
The Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docxThe Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docx
The Ultimate Guide to External Floating Roofs for Oil Storage Tanks.docx
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdf
 
ENERGY STORAGE DEVICES INTRODUCTION UNIT-I
ENERGY STORAGE DEVICES  INTRODUCTION UNIT-IENERGY STORAGE DEVICES  INTRODUCTION UNIT-I
ENERGY STORAGE DEVICES INTRODUCTION UNIT-I
 
fluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answerfluid mechanics gate notes . gate all pyqs answer
fluid mechanics gate notes . gate all pyqs answer
 
fundamentals of drawing and isometric and orthographic projection
fundamentals of drawing and isometric and orthographic projectionfundamentals of drawing and isometric and orthographic projection
fundamentals of drawing and isometric and orthographic projection
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
 
LIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.pptLIGA(E)11111111111111111111111111111111111111111.ppt
LIGA(E)11111111111111111111111111111111111111111.ppt
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
Introduction to Casting Processes in Manufacturing
Introduction to Casting Processes in ManufacturingIntroduction to Casting Processes in Manufacturing
Introduction to Casting Processes in Manufacturing
 
Danfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdfDanfoss NeoCharge Technology -A Revolution in 2024.pdf
Danfoss NeoCharge Technology -A Revolution in 2024.pdf
 
Fruit shop management system project report.pdf
Fruit shop management system project report.pdfFruit shop management system project report.pdf
Fruit shop management system project report.pdf
 

An effective evaluation study of objective measures using spectral subtractive enhanced signal

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 361 AN EFFECTIVE EVALUATION STUDY OF OBJECTIVE MEASURES USING SPECTRAL SUBTRACTIVE ENHANCED SIGNAL Sukhvinder Kaur1 , Anil Garg2 1 M.Tech Student MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India 2 Associate Prof. in ECE department MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India Abstract Unwanted noises have a negative influence over communication because it disturbs the conversation and make the communication impossible. Speech enhancement algorithms are used for improving the quality and intelligibility or to reduce listener fatigues. Assessment of speech quality can be done by using either subjective listening test or objective quality measure. Evaluation of several objective measures with the speech processed by enhancement algorithms has been performed but these having limitations to assess original speech signal. This paper represents the study of speech quality measures and compute the values used for regression analyses of the objective measures evaluation study using spectral subtraction algorithm based enhanced speech signal. Keywords: MOS, ITU-T (P.835), SNRseg, log- likelihood ratio and itakura-saito. ---------------------------------------------------------------------***-------------------------------------------------------------------- 1. INTRODUCTION Speech Communication is an integral part of daily life. Communication systems include various applications such as, mobile phones, speech coding and compression, medical devices such as hearing aids etc. All these are associated with different types of noise or sounds. Unwanted sounds are commonly referred as disturbances. It can be both noise or interferences and which are not acceptable in the communication process. Therefore, unwanted sounds are having a negative influence over the communication because they may disturb the conversation and make it impossible to communicate at all. Signal Processing methods are the effective way for aiding the speech communication by reducing the disturbance levels with respect to the level of the speech. Such speech processing methods are known as speech enhancement methods. Speech enhancement algorithms improve the quality and intelligibility of speech or reduce listener fatigues. Speech enhancement has several real world applications which include: Telecommunications, Electronic hearing aids and Automatic speaker recognition software. These all are of important quality and intelligibility of speech which vastly improves the user’s listening experience. Two criteria are which generally used for measuring the performance of speech: quality and intelligibility. It is quite hard to satisfy both of them at the same time. Quality is the subjective measure which indicates the amount of effort needed to understand the speech material. Whereas Intelligibility, is an objective measure which signifies the amount of speech material correctly understood. This paper is represented in such a manner that section 2 provides brief review of the different speech enhancement techniques and spectral subtraction algorithm. The speech quality measures are discussed in section 3. In the Section 4 evaluated results are shown and the last section 5 represents the conclusion part. References are given in the last section of the paper. 2. SPEECH ENHANCEMENT TECHNIQUES Speech enhancement methods can be classified in two ways: (1) Single channel enhancement technique (2) Multi channel enhancement technique. Single channel enhancement techniques are applied to situations in which only one acquisition channel is available. Multi channel enhancement techniques are employed in microphone arrays and take advantage of availability of multiple signal inputs to system, by making possible use of phase alignment to reject the undesired noise components. Single channel enhancement techniques are further divided again into different categories: Short Time Spectral Amplitude Estimation (i.e. Spectral subtraction and Non Causal Weiner Filtering), Periodicity of Voiced Speech based, Adaptive Comb Filtering based, Harmonic Selection based, adaptive noise canceling techniques based, Speech model based (i.e. Kalman Filtering Hidden Markov Modelling(HMM)). Spectral-subtractive algorithms: Is the simplest speech enhancement algorithms based on the principle of estimating the clean signal spectrum by subtracting an estimated noise spectrum from the noisy speech spectrum. Statistical model based algorithm: are based on the estimation of linear and non linear transform coefficients of clean signal by measuring the Fourier transform coefficients of the noisy signal. E.g., Wiener algorithm and minimum mean square error (MMSE) algorithms. Subspace Algorithms: are based on the principle that the clean signals estimation is done by simply nulling the component of the noisy vector residing in the noise subspace.
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 362 2.1 Spectral-Subtractive Algorithms The spectral subtraction algorithm is historically one of the first algorithms proposed for noise reduction based on the simple principle of assuming additive noise. Estimated clean signal spectrum can be obtained by subtracting an average noise spectrum from the noisy speech spectrum. The noise spectrum can be estimated, and updated, during the periods when the signal is absent or when only noise is present. Subtraction process needs to be done carefully to avoid speech distortions. If too much subtraction is done, then some speech information might be removed while if the subtraction is too little then much of the interfering noise remains. 3. SPEECH QUALITY MEASURES Speech processing algorithms usage is rapidly increasing, which raises the need of speech quality evaluation. The main evaluation method of speech enhancement system affects the intelligibility of the speech signal and improves the overall quality of the signal. The assessment of speech quality can be done by either using subjective listening tests or objective quality measures i.e. Subjective measures and Objective measures. 3.1 Subjective Measures Subjective evaluation involves comparisons of original and processed speech signals by a group of listeners who are asked to rate the quality of speech along a pre-determined scale. This evaluation method requires the judgment of human listeners. These can be further divided into two categories: (1) Relative Preference Methods - multiple signals are compared. (2)Absolute Category Rating Methods - a single stimulus is tested. Table 1.1 MOS rating scale Rating Speech quality Level of distortion 5 Excellent Imperceptible 4 Good Just perceptible, but not annoying 3 Fair Perceptible and slightly annoying 2 Poor Annoying , but not objectionable 1 Bad Very annoying and objectionable MOS (Mean Opinion score) is the most widely used subjective evaluation method [8]. The MOS test is based on five category rating scale of the speech quality given in Table 1.1. ITU-T standard (P.835) is designed for integrating the effect of signal and background distortions. Subjective evaluation is the reliable method for assessment of speech quality but these are time consuming and expensive. This evaluation method is stringent in nature as it needed a set of trained listeners for determining the quality of speech. Therefore need of objective evaluation arises. 3.2 Objective Measures Objective evaluation involves a mathematical comparison of original and processed speech. In this evaluation method the judgment is predicted with some analysis of the system. In this method quality is quantified by measuring the numerical distance between the original and processed signals. There are different types of objective quality measures which are further categorized as: (1) Time and frequency signal to noise ratio measures, (2) Spectral distance measure based on LPC, (3) Perceptually Motivated measures and (4) Composite measures. The objective measures accesses the quality of the processed speech without accessing the original speech signal. Objective measures of speech quality are implemented by first segmenting the speech signal into 10-30 ms frames, and then computing a distortion measure between the original and processed signals. Distortion measure computation can either be done in the time domain (e.g. signal to noise ratio measures) or in the frequency domain (LPC spectral distance measures). In the frequency domain measure distortions and differences detected in the magnitude spectra are correlated with speech quality. In Time and Frequency Signal to Noise Ratio Measures, the segmental signal to noise ratio can be evaluated either in terms of time or frequency domain. The original and processed signals both are aligned either in time domain or in frequency domain for the calculation. The main advantage of using frequency based segmental SNR over the time domain (SNRseg) is the flexibility to place different weights for different frequency bands of the spectrum. In the Spectral Distance Measure Based on LPC several LPC based objective measures are proposed on the basis of all pole models of the clean and enhanced speech signals. Three different LPC based objective measures include: The LLR (log- likelihood ratio) measure, the IS ( itakura-saito) measure and the Cepstrum distance measure. Cepstral distance measure is derived from the LPC coefficients. Perceptually Motivated Measures are used for the modeling of auditory processes like normal listening frequency selectivity and the perceived loudness. Weighted spectral slope (WSS), Bark distortion measures (BSD) and Perceptual evaluation of speech quality (PESQ) measures are the perceptually motivated measures. Composite Measures are formed by combining multiple objective measures. Regression analysis is used to compute the optimum combination of objective measure for maximum correlation. The selection of objective measures for composite measures depends basically on the experimental evidence and intuition. MARS technique can be used for composite measure evaluation. Figure of Merits are obtained by statistical analysis. This analysis is necessary for the validity of evaluation method of objective measures. Statistical analysis is used to assess the correlation between subjective score and the values of the
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 363 objective measures. Two types of figure of merit are calculated in statistical analysis: (1) Pearson’s correlation coefficient. (2) Standard error of the estimate. The first figure of merit Pearson’s correlation coefficient is used to obtain the correlation between subjective listening scores and objective measures. Second figure of merit is an estimate of the standard deviation of the error obtained by objective measures to predict subjective listening scores. 3.2.1 Time and Frequency Signal to Noise Ratio Measures The segmental signal to noise ratio is evaluated either in terms of time or frequency domain. The time domain measure is the simplest objective measure which is used to evaluate speech enhancement and speech coding algorithms. In this measure original and processed signal are aligned in time and phase errors are present. The segmental signal to noise ratio is given by eqn. (3.1) (3.1) Where is the original signal (clean), is the enhanced signal, is the frame length and is the number of frames in the signal. The segmental SNR in terms of frequency domain produce the frequency weighted segmental SNR is given in eqn. (3.2) (3.2) Where is the weight placed on the jth frequency band, is the number of bands, is the total number of frames in the signal. is the filter bank amplitude (excitation spectrum) of the clean signal and is the filter bank amplitude of enhanced signal in the same bank. 3.2.2 LPC based Objective Measures LPC based objective measures including the log likelihood ratio (LLR), Itakura Saito distance measure (IS), cepstrum distance measure (CEP).The two most commonly used measures are LLR and IS. Cepstral distance measure is derived from the LPC coefficients.The log likelihood ratio (LLR) measure is defined as in eqn. (3.3) Where the coefficients of the clean signal are, are the coefficients of enhanced signal and is the autocorrelation matrix of clean signal.The Itakura Saito(IS) measure is defined as in eqn.(3.4) (3.4) Where and are the all pole gains of the clean and enhanced signals.The cepstrum distance measure (CEP) is defined as in eqn. (3.5) (3.5) Where and are the cepstrum coefficients of the clean and enhanced signals. 4. EVALUATION RESULTS Results evaluated for the Statistical Analysis of spectral subtractive enhanced signal is discussed in this section. In this paper, the clean and processed signal is used for statistical analysis. Speech signal is first segmented into frames and then standard deviation and mean values are computed for the noisy and enhanced signals shown in table 1.2. The original time waveform for clean noise, noisy and enhanced signals are shown in fig 1. These are used as the primary tool for the result analysis.
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 364 Fig 1: Time waveform representation of clean, noise, noisy and spectrally subtractive enhanced signal The spectrogram of clean, noise, noisy and enhanced signal shown in fig 2. Spectrogram is utilized for detecting the differences in the magnitude spectra of waveform information. The quality of the processed speech is determined by without needing access to the original speech signal.
  • 5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 365 Fig 2: Spectrogram representation of clean, noise, noisy and spectrally subtractive enhanced signal Computation of the LPC measure is done by estimating the correlation coefficient by using the mean value and standard deviation values. Table 1.2 shows the respective values of mean and standard deviation for different objective measures such as SNR, IS, LR, LLR, WSM. These values are the LPC coefficients that can be used for computing the objective measures. Table 1.2 Table for mean and standard deviation values of objective measure Objective measure Mean value Standard deviation SNR 1.388178 1.727957 IS 30.170003 21.972701 LR 0.205720 0.189852 LLR 0.176183 0.139966 WSM 3.686912 2.256240 SNR plot of enhanced signal with respect to the noisy signal is shown in fig 3. The plot shows the smaller values of the signal energy during the interval of silence in the speech signal results the larger negative value of (SNR). Fig 3: SNR plot with respect to the noisy signal spectrum
  • 6. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 366 Itakura saito ratio (IS) plot of enhanced signal with respect to the noisy signal is shown in fig 4. The plot shows the smaller values of the signal energy during the interval when the speech in the processed signal is of highest amplitude i.e., speech is clearly understandable and distractive and larger values of IS shown for the noise present in processed signal. Fig 4: IS (itakura- saito ratio) plot with respect to the noisy signal spectrum LLR (log likelihood ratio) plot of enhanced signal with respect to the noisy signal is shown in fig 5. The plot shows the lower values of the signal energy during the interval of speech signal and results the larger peak value of (LLR) for the noise in the processed signal spectrum. Fig 5: LLR (log likelihood ratio) plot with respect to the noisy signal spectrum. 5. CONCLUSION Different speech enhancement algorithms have been proposed to improve the performance of modern communication devices in noisy environments. The key conclusion drawn from this paper is that speech quality assessment is done through subjective and objective evaluation measures. In this paper study of objective measure and computed values of the standard deviation and mean for noisy and enhanced signal are given.
  • 7. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 367 It is very difficult to find reliable and fair comparison among different algorithms due to lack of common speech database used in algorithms evaluation, different types of noise and differences in the testing methodology. The most accurate method for evaluating speech quality is subjective listening tests but these are time consuming and expensive. This evaluation method is stringent in nature as it needed a set of trained listeners for determining the quality of speech. Therefore need of objective evaluation arises. As the frame length increases SNR value decreases monotonically. REFERENCES [1]. Loizou, P., 2007. Speech Enhancement: Theory and Practice. CRC Press, Boca Raton, FL. [2]. Y. Hu and P. C. Loizou, “Subjective evaluation and comparison of speech enhancement algorithms,” submitted to Speech Communication. [3]. Yi Hu and Philipos C. Loizou, “Evaluation of Objective Quality Measures for Speech Enhancement”, IEEE Trans. on Audio, Speech, and Language Processing, Vol. 16, No. 1, January 2008. [4]. P. Chu and D. Messerschmitt, “A frequency weighted itakura-saito spectral distance measure”, IEEE Trans. Acoustics, Speech and Signal Processing, vol. 30, pp. 545— 560, Aug. 1982. [5]. Anuradha R. Fukane, “Enhancement of Noisy Speech Signals for Hearing Aids”, 2011 International Conference on Communication Systems and Network Technologies, 978-0-7695-4437-3/11© 2011 IEEE [6]. Sunil D. Kamath,” A Multi-Band Spectral Subtraction Method for Enhancing Speech Corrupted By Colored Noise”, 2002 [7]. Jianfen Ma, Yi Hu and Philipos C. Loizou, “Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions”, J. Acoust. Soc. Am., Vol. 125, No. 5, May 2009. [8]. Kuldip Paliwal, “Single-channel speech enhancement using spectral subtraction in the short-time modulation domain”, Elsevier, Speech Communication 52 450–475, 2010.