Compressive speech enhancement is based on the compressive sensing (CS) sampling theory and utilizes the sparsity of the signal for its enhancement. To improve the performance of the discrete wavelet transform (DWT) basisfunction based compressive speech enhancement algorithm, this study presents a semi-soft thresholding approach suggesting improved threshold estimation and threshold rescaling parameters. The semi-soft thresholding approach utilizes two thresholds, one threshold value is an improved universal threshold and the other is calculated based on the initial-silenceregion of the signal. This study suggests that thresholding should be applied to both detail coefficients and approximation coefficients to remove noise effectively. The performances of the hard, soft, garrote and semi-soft thresholding approaches are compared based on objective quality and speech intelligibility measures. The normalized covariance measure is introduced as an effective intelligibility measure as it has a strong correlation with the intelligibility of the speech signal. A visual inspection of the output signal is used to verify the results. Experiments were conducted on the noisy speech corpus (NOIZEUS) speech database. The experimental results indicate that the proposed method of semi-soft thresholding using improved threshold estimation provides better enhancement compared to the other thresholding approaches.
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...IJERA Editor
ย
This paper presents compressive sensing technique used for speech reconstruction using linear predictive coding because the
speech is more sparse in LPC. DCT of a speech is taken and the DCT points of sparse speech are thrown away arbitrarily.
This is achieved by making some point in DCT domain to be zero by multiplying with mask functions. From the incomplete
points in DCT domain, the original speech is reconstructed using compressive sensing and the tool used is Gradient
Projection for Sparse Reconstruction. The performance of the result is compared with direct IDCT subjectively. The
experiment is done and it is observed that the performance is better for compressive sensing than the DCT.
Improvement of minimum tracking in Minimum Statistics noise estimation methodCSCJournals
ย
Noise spectrum estimation is a fundamental component of speech enhancement and speech recognition systems. In this paper we propose a new method for minimum tracking in Minimum Statistics (MS) noise estimation method. This noise estimation algorithm is proposed for highly nonstationary noise environments. This was confirmed with formal listening tests which indicated that the proposed noise estimation algorithm when integrated in speech enhancement was preferred over other noise estimation algorithms.
Many algorithms have been developed to find sparse representation over redundant dictionaries or
transform. This paper presents a novel method on compressive sensing (CS)-based image compression
using sparse basis on CDF9/7 wavelet transform. The measurement matrix is applied to the three levels of
wavelet transform coefficients of the input image for compressive sampling. We have used three different
measurement matrix as Gaussian matrix, Bernoulli measurement matrix and random orthogonal matrix.
The orthogonal matching pursuit (OMP) and Basis Pursuit (BP) are applied to reconstruct each level of
wavelet transform separately. Experimental results demonstrate that the proposed method given better
quality of compressed image than existing methods in terms of proposed image quality evaluation indexes
and other objective (PSNR/UIQI/SSIM) measurements.
Comparative study to realize an automatic speaker recognition system IJECEIAES
ย
In this research, we present an automatic speaker recognition system based on adaptive orthogonal transformations. To obtain the informative features with a minimum dimension from the input signals, we created an adaptive operator, which helped to identify the speakerโs voice in a fast and efficient manner. We test the efficiency and the performance of our method by comparing it with another approach, mel-frequency cepstral coefficients (MFCCs), which is widely used by researchers as their feature extraction method. The experimental results show the importance of creating the adaptive operator, which gives added value to the proposed approach. The performance of the system achieved 96.8% accuracy using Fourier transform as a compression method and 98.1% using Correlation as a compression method.
A New Approach for Speech Enhancement Based On Eigenvalue Spectral SubtractionCSCJournals
ย
In this paper, a phase space reconstruction-based method is proposed for speech enhancement. The method embeds the noisy signal into a high dimensional reconstructed phase space and uses Spectral Subtraction idea. The advantages of the proposed method are fast performance, high SNR and good MOS. In order to evaluate the proposed method, ten signals of TIMIT database mixed with the white additive Gaussian noise and then the method was implemented. The efficiency of the proposed method was evaluated by using qualitative and quantitative criteria.
Comparative Study of Compressive Sensing Techniques For Image EnhancementCSCJournals
ย
Compressive Sensing is a new way of sampling signals at a sub-Nyquist rate. For many signals, this revolutionary technology strongly relies on the sparsity of the signal and incoherency between sensing basis and representation basis. In this work, compressed sensing method is proposed to reduce the noise of the image signal. Noise reduction and image reconstruction are formulated in the theoretical framework of compressed sensing using Basis Pursuit de-noising (BPDN) and Compressive Sampling Matching Pursuit (CoSaMP) algorithm when random measurement matrix is utilized to acquire the data. Ultimately, it is demonstrated that the proposed methods can't perfectly recover the image signal. Therefore, we have used a complementary approach for enhancing the performance of CS recovery with non-sparse signals. In this work, we have used a new designed CS recovery framework, called De-noising-based Approximate Message Passing (D-AMP). This method uses a de-noising algorithm to recover signals from compressive measurements. For de-noising purpose the Non-Local Means (NLM), Bayesian Least Squares Gaussian Scale Mixtures (BLS-GSM) and Block Matching 3D collaborative have been used. Also, in this work, we have evaluated the performance of our proposed image enhancement methods using the quality measure peak signal-to-noise ratio (PSNR).
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...IJERA Editor
ย
This paper presents compressive sensing technique used for speech reconstruction using linear predictive coding because the
speech is more sparse in LPC. DCT of a speech is taken and the DCT points of sparse speech are thrown away arbitrarily.
This is achieved by making some point in DCT domain to be zero by multiplying with mask functions. From the incomplete
points in DCT domain, the original speech is reconstructed using compressive sensing and the tool used is Gradient
Projection for Sparse Reconstruction. The performance of the result is compared with direct IDCT subjectively. The
experiment is done and it is observed that the performance is better for compressive sensing than the DCT.
Improvement of minimum tracking in Minimum Statistics noise estimation methodCSCJournals
ย
Noise spectrum estimation is a fundamental component of speech enhancement and speech recognition systems. In this paper we propose a new method for minimum tracking in Minimum Statistics (MS) noise estimation method. This noise estimation algorithm is proposed for highly nonstationary noise environments. This was confirmed with formal listening tests which indicated that the proposed noise estimation algorithm when integrated in speech enhancement was preferred over other noise estimation algorithms.
Many algorithms have been developed to find sparse representation over redundant dictionaries or
transform. This paper presents a novel method on compressive sensing (CS)-based image compression
using sparse basis on CDF9/7 wavelet transform. The measurement matrix is applied to the three levels of
wavelet transform coefficients of the input image for compressive sampling. We have used three different
measurement matrix as Gaussian matrix, Bernoulli measurement matrix and random orthogonal matrix.
The orthogonal matching pursuit (OMP) and Basis Pursuit (BP) are applied to reconstruct each level of
wavelet transform separately. Experimental results demonstrate that the proposed method given better
quality of compressed image than existing methods in terms of proposed image quality evaluation indexes
and other objective (PSNR/UIQI/SSIM) measurements.
Comparative study to realize an automatic speaker recognition system IJECEIAES
ย
In this research, we present an automatic speaker recognition system based on adaptive orthogonal transformations. To obtain the informative features with a minimum dimension from the input signals, we created an adaptive operator, which helped to identify the speakerโs voice in a fast and efficient manner. We test the efficiency and the performance of our method by comparing it with another approach, mel-frequency cepstral coefficients (MFCCs), which is widely used by researchers as their feature extraction method. The experimental results show the importance of creating the adaptive operator, which gives added value to the proposed approach. The performance of the system achieved 96.8% accuracy using Fourier transform as a compression method and 98.1% using Correlation as a compression method.
A New Approach for Speech Enhancement Based On Eigenvalue Spectral SubtractionCSCJournals
ย
In this paper, a phase space reconstruction-based method is proposed for speech enhancement. The method embeds the noisy signal into a high dimensional reconstructed phase space and uses Spectral Subtraction idea. The advantages of the proposed method are fast performance, high SNR and good MOS. In order to evaluate the proposed method, ten signals of TIMIT database mixed with the white additive Gaussian noise and then the method was implemented. The efficiency of the proposed method was evaluated by using qualitative and quantitative criteria.
Comparative Study of Compressive Sensing Techniques For Image EnhancementCSCJournals
ย
Compressive Sensing is a new way of sampling signals at a sub-Nyquist rate. For many signals, this revolutionary technology strongly relies on the sparsity of the signal and incoherency between sensing basis and representation basis. In this work, compressed sensing method is proposed to reduce the noise of the image signal. Noise reduction and image reconstruction are formulated in the theoretical framework of compressed sensing using Basis Pursuit de-noising (BPDN) and Compressive Sampling Matching Pursuit (CoSaMP) algorithm when random measurement matrix is utilized to acquire the data. Ultimately, it is demonstrated that the proposed methods can't perfectly recover the image signal. Therefore, we have used a complementary approach for enhancing the performance of CS recovery with non-sparse signals. In this work, we have used a new designed CS recovery framework, called De-noising-based Approximate Message Passing (D-AMP). This method uses a de-noising algorithm to recover signals from compressive measurements. For de-noising purpose the Non-Local Means (NLM), Bayesian Least Squares Gaussian Scale Mixtures (BLS-GSM) and Block Matching 3D collaborative have been used. Also, in this work, we have evaluated the performance of our proposed image enhancement methods using the quality measure peak signal-to-noise ratio (PSNR).
Performance analysis of compressive sensing recovery algorithms for image pr...IJECEIAES
ย
The modern digital world comprises of transmitting media files like image, audio, and video which leads to usage of large memory storage, high data transmission rate, and a lot of sensory devices. Compressive sensing (CS) is a sampling theory that compresses the signal at the time of acquiring it. Compressive sensing samples the signal efficiently below the Nyquist rate to minimize storage and recoveries back the signal significantly minimizing the data rate and few sensors. The proposed paper proceeds with three phases. The first phase describes various measurement matrices like Gaussian matrix, circulant matrix, and special random matrices which are the basic foundation of compressive sensing technique that finds its application in various fields like wireless sensors networks (WSN), internet of things (IoT), video processing, biomedical applications, and many. Finally, the paper analyses the performance of the various reconstruction algorithms of compressive sensing like basis pursuit (BP), compressive sampling matching pursuit (CoSaMP), iteratively reweighted least square (IRLS), iterative hard thresholding (IHT), block processing-based basis pursuit (BP-BP) based on mean square error (MSE), and peak signal to noise ratio (PSNR) and then concludes with future works.
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSijiert bestjournal
ย
This paper presents a review of LPC methods for enhancement of speech signals. The purpose of all method of speech enhancement is to impr ove quality of speech signal by minimizing the background noise . This paper especially comments on Power Spectral Subtraction,Multiband Spectral Subtraction,Non - Linear Spectral Subtraction Method,MMSE Spectral Subtraction Method and Spectral Subtraction based on perceptual properties. As the spectral subtraction produces the Residual noise,Musical noise,Linear Predictive Analysis method is used to enhance the Speech.
Performance of Matching Algorithmsfor Signal Approximationiosrjce
ย
IOSR Journal of Electronics and Communication Engineering(IOSR-JECE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of electronics and communication engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in electronics and communication engineering. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Improved nonlocal means based on pre classification and invariant block matchingIAEME Publication
ย
One of the most popular image denoising methods based on self-similarity is called nonlocal
means (NLM). Though it can achieve remarkable performance, this method has a few shortcomings,
e.g., the computationally expensive calculation of the similarity measure, and the lack of reliable
candidates for some non repetitive patches. In this paper, we propose to improve NLM by integrating
Gaussian blur, clustering, and row image weighted averaging into the NLM framework.
Experimental results show that the proposed technique can perform denoising better than the original
NLM both quantitatively and visually, especially when the noise level is high.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IMPROVEMENT OF BM3D ALGORITHM AND EMPLOYMENT TO SATELLITE AND CFA IMAGES DENO...ijistjournal
ย
This paper proposes a new procedure in order to improve the performance of block matching and 3-D filtering (BM3D) image denoising algorithm. It is demonstrated that it is possible to achieve a better performance than that of BM3D algorithm in a variety of noise levels. This method changes BM3D algorithm parameter values according to noise level, removes prefiltering, which is used in high noise level; therefore Peak Signal-to-Noise Ratio (PSNR) and visual quality get improved, and BM3D complexities and processing time are reduced. This improved BM3D algorithm is extended and used to denoise satellite and color filter array (CFA) images. Output results show that the performance has upgraded in comparison with current methods of denoising satellite and CFA images. In this regard this algorithm is compared with Adaptive PCA algorithm, that has led to superior performance for denoising CFA images, on the subject of PSNR and visual quality. Also the processing time has decreased significantly.
IMPROVEMENT OF BM3D ALGORITHM AND EMPLOYMENT TO SATELLITE AND CFA IMAGES DENO...ijistjournal
ย
This paper proposes a new procedure in order to improve the performance of block matching and 3-D filtering (BM3D) image denoising algorithm. It is demonstrated that it is possible to achieve a better performance than that of BM3D algorithm in a variety of noise levels. This method changes BM3D algorithm parameter values according to noise level, removes prefiltering, which is used in high noise level; therefore Peak Signal-to-Noise Ratio (PSNR) and visual quality get improved, and BM3D complexities and processing time are reduced. This improved BM3D algorithm is extended and used to denoise satellite and color filter array (CFA) images. Output results show that the performance has upgraded in comparison with current methods of denoising satellite and CFA images. In this regard this algorithm is compared with Adaptive PCA algorithm, that has led to superior performance for denoising CFA images, on the subject of PSNR and visual quality. Also the processing time has decreased significantly.
Suppression of noise in noisy speech signal is required in many speech enhancement applications like signal recording and transmission from one place to other. In this paper a novel single line noise cancellation system is proposed using derivative of normalized least mean spare algorithm. The proposed system has two phases. The first phase is generation of secondary reference signal from incoming primary signal itself at initial silence period and pause between two words, which is essential while adaptive filter using as noise canceller. Second phase is noise cancellation using proposed modified error data normalized step size (EDNSS) algorithm. The performance of the proposed algorithm is compared with normalized least mean square (NLMS) algorithm and original EDNSS algorithm using standard IEEE sentence (SP23) of Noizeus data base with different types of real-world noise at different level of signal to noise ratio (SNR). The output of proposed, NLMS and EDNSS algorithm are measured with output SNR, excessive mean square error (EMSE) and misadjustment (M). The results clearly illustrates that the proposed algorithm gives improved result over conventional NLMS and EDNSS algorithm. The speed of convergence is also maintained as same conventional NLMS algorithm.
Comparative performance analysis of channel normalization techniqueseSAT Journals
ย
Abstract A major part of the interaction between humans takes place via speech communication. The speech signal carries both useful and unwanted information. Processing of such signals involve enhancing the useful information. The intelligibility of speech signals is significantly reduced due to the presence of unwanted information such as noise. Channel normalization algorithms suppress such additive noise introduced in the speech signals by transmission channel or by recording environment conditions. Enhancing the quality and intelligibility of speech signals improve the performance of speech systems such as Automatic speech recognition (ASR) , voice communication and hearing aids to name the few. Based on the experimental results the comparative analysis of channel normalization techniques have been presented in this paper to find out the most suitable algorithm for enhancing the speech signals. Keywords: Cepstral Mean Normalization, Spectral Subtraction, Weiner filter, Signal to Noise Ratio
Comparitive analysis of doa and beamforming algorithms for smart antenna systemseSAT Journals
ย
Abstract This paper revolves around the implementation of Direction of arrival and Adaptive beam-forming algorithms for Smart Antenna Systems. This paper also investigates the implementation of algorithms on various planner array geometries viz. circular and rectangular. Music algorithm is primarily finds the possible location of desired user and adaptive beam-forming algorithms such as LMS, RLS and CMA algorithms adapts the weights of the array. DOA estimation gives the maximum peak of spectrum with respect to angle of arrival where the desired user is supposed to exist. After DOA estimation weights of array antenna are changed with the changing received signal. This methodology is called as Spectral estimation, which allows the antenna pattern to steer in desired direction estimated by DOA and simultaneously null out the interfering signals. Rate of convergence is the major criterion for comparison for adaptive beam-forming algorithms. Keywords: DOA, MUSIC, LMS, RLS, CMA, SAS.
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
ย
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
ย
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
More Related Content
Similar to Compressive speech enhancement using semi-soft thresholding and improved threshold estimation
Performance analysis of compressive sensing recovery algorithms for image pr...IJECEIAES
ย
The modern digital world comprises of transmitting media files like image, audio, and video which leads to usage of large memory storage, high data transmission rate, and a lot of sensory devices. Compressive sensing (CS) is a sampling theory that compresses the signal at the time of acquiring it. Compressive sensing samples the signal efficiently below the Nyquist rate to minimize storage and recoveries back the signal significantly minimizing the data rate and few sensors. The proposed paper proceeds with three phases. The first phase describes various measurement matrices like Gaussian matrix, circulant matrix, and special random matrices which are the basic foundation of compressive sensing technique that finds its application in various fields like wireless sensors networks (WSN), internet of things (IoT), video processing, biomedical applications, and many. Finally, the paper analyses the performance of the various reconstruction algorithms of compressive sensing like basis pursuit (BP), compressive sampling matching pursuit (CoSaMP), iteratively reweighted least square (IRLS), iterative hard thresholding (IHT), block processing-based basis pursuit (BP-BP) based on mean square error (MSE), and peak signal to noise ratio (PSNR) and then concludes with future works.
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSijiert bestjournal
ย
This paper presents a review of LPC methods for enhancement of speech signals. The purpose of all method of speech enhancement is to impr ove quality of speech signal by minimizing the background noise . This paper especially comments on Power Spectral Subtraction,Multiband Spectral Subtraction,Non - Linear Spectral Subtraction Method,MMSE Spectral Subtraction Method and Spectral Subtraction based on perceptual properties. As the spectral subtraction produces the Residual noise,Musical noise,Linear Predictive Analysis method is used to enhance the Speech.
Performance of Matching Algorithmsfor Signal Approximationiosrjce
ย
IOSR Journal of Electronics and Communication Engineering(IOSR-JECE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of electronics and communication engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in electronics and communication engineering. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Improved nonlocal means based on pre classification and invariant block matchingIAEME Publication
ย
One of the most popular image denoising methods based on self-similarity is called nonlocal
means (NLM). Though it can achieve remarkable performance, this method has a few shortcomings,
e.g., the computationally expensive calculation of the similarity measure, and the lack of reliable
candidates for some non repetitive patches. In this paper, we propose to improve NLM by integrating
Gaussian blur, clustering, and row image weighted averaging into the NLM framework.
Experimental results show that the proposed technique can perform denoising better than the original
NLM both quantitatively and visually, especially when the noise level is high.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IMPROVEMENT OF BM3D ALGORITHM AND EMPLOYMENT TO SATELLITE AND CFA IMAGES DENO...ijistjournal
ย
This paper proposes a new procedure in order to improve the performance of block matching and 3-D filtering (BM3D) image denoising algorithm. It is demonstrated that it is possible to achieve a better performance than that of BM3D algorithm in a variety of noise levels. This method changes BM3D algorithm parameter values according to noise level, removes prefiltering, which is used in high noise level; therefore Peak Signal-to-Noise Ratio (PSNR) and visual quality get improved, and BM3D complexities and processing time are reduced. This improved BM3D algorithm is extended and used to denoise satellite and color filter array (CFA) images. Output results show that the performance has upgraded in comparison with current methods of denoising satellite and CFA images. In this regard this algorithm is compared with Adaptive PCA algorithm, that has led to superior performance for denoising CFA images, on the subject of PSNR and visual quality. Also the processing time has decreased significantly.
IMPROVEMENT OF BM3D ALGORITHM AND EMPLOYMENT TO SATELLITE AND CFA IMAGES DENO...ijistjournal
ย
This paper proposes a new procedure in order to improve the performance of block matching and 3-D filtering (BM3D) image denoising algorithm. It is demonstrated that it is possible to achieve a better performance than that of BM3D algorithm in a variety of noise levels. This method changes BM3D algorithm parameter values according to noise level, removes prefiltering, which is used in high noise level; therefore Peak Signal-to-Noise Ratio (PSNR) and visual quality get improved, and BM3D complexities and processing time are reduced. This improved BM3D algorithm is extended and used to denoise satellite and color filter array (CFA) images. Output results show that the performance has upgraded in comparison with current methods of denoising satellite and CFA images. In this regard this algorithm is compared with Adaptive PCA algorithm, that has led to superior performance for denoising CFA images, on the subject of PSNR and visual quality. Also the processing time has decreased significantly.
Suppression of noise in noisy speech signal is required in many speech enhancement applications like signal recording and transmission from one place to other. In this paper a novel single line noise cancellation system is proposed using derivative of normalized least mean spare algorithm. The proposed system has two phases. The first phase is generation of secondary reference signal from incoming primary signal itself at initial silence period and pause between two words, which is essential while adaptive filter using as noise canceller. Second phase is noise cancellation using proposed modified error data normalized step size (EDNSS) algorithm. The performance of the proposed algorithm is compared with normalized least mean square (NLMS) algorithm and original EDNSS algorithm using standard IEEE sentence (SP23) of Noizeus data base with different types of real-world noise at different level of signal to noise ratio (SNR). The output of proposed, NLMS and EDNSS algorithm are measured with output SNR, excessive mean square error (EMSE) and misadjustment (M). The results clearly illustrates that the proposed algorithm gives improved result over conventional NLMS and EDNSS algorithm. The speed of convergence is also maintained as same conventional NLMS algorithm.
Comparative performance analysis of channel normalization techniqueseSAT Journals
ย
Abstract A major part of the interaction between humans takes place via speech communication. The speech signal carries both useful and unwanted information. Processing of such signals involve enhancing the useful information. The intelligibility of speech signals is significantly reduced due to the presence of unwanted information such as noise. Channel normalization algorithms suppress such additive noise introduced in the speech signals by transmission channel or by recording environment conditions. Enhancing the quality and intelligibility of speech signals improve the performance of speech systems such as Automatic speech recognition (ASR) , voice communication and hearing aids to name the few. Based on the experimental results the comparative analysis of channel normalization techniques have been presented in this paper to find out the most suitable algorithm for enhancing the speech signals. Keywords: Cepstral Mean Normalization, Spectral Subtraction, Weiner filter, Signal to Noise Ratio
Comparitive analysis of doa and beamforming algorithms for smart antenna systemseSAT Journals
ย
Abstract This paper revolves around the implementation of Direction of arrival and Adaptive beam-forming algorithms for Smart Antenna Systems. This paper also investigates the implementation of algorithms on various planner array geometries viz. circular and rectangular. Music algorithm is primarily finds the possible location of desired user and adaptive beam-forming algorithms such as LMS, RLS and CMA algorithms adapts the weights of the array. DOA estimation gives the maximum peak of spectrum with respect to angle of arrival where the desired user is supposed to exist. After DOA estimation weights of array antenna are changed with the changing received signal. This methodology is called as Spectral estimation, which allows the antenna pattern to steer in desired direction estimated by DOA and simultaneously null out the interfering signals. Rate of convergence is the major criterion for comparison for adaptive beam-forming algorithms. Keywords: DOA, MUSIC, LMS, RLS, CMA, SAS.
Similar to Compressive speech enhancement using semi-soft thresholding and improved threshold estimation (20)
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
ย
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
ย
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
Enhancing battery system identification: nonlinear autoregressive modeling fo...IJECEIAES
ย
Precisely characterizing Li-ion batteries is essential for optimizing their
performance, enhancing safety, and prolonging their lifespan across various
applications, such as electric vehicles and renewable energy systems. This
article introduces an innovative nonlinear methodology for system
identification of a Li-ion battery, employing a nonlinear autoregressive with
exogenous inputs (NARX) model. The proposed approach integrates the
benefits of nonlinear modeling with the adaptability of the NARX structure,
facilitating a more comprehensive representation of the intricate
electrochemical processes within the battery. Experimental data collected
from a Li-ion battery operating under diverse scenarios are employed to
validate the effectiveness of the proposed methodology. The identified
NARX model exhibits superior accuracy in predicting the battery's behavior
compared to traditional linear models. This study underscores the
importance of accounting for nonlinearities in battery modeling, providing
insights into the intricate relationships between state-of-charge, voltage, and
current under dynamic conditions.
Smart grid deployment: from a bibliometric analysis to a surveyIJECEIAES
ย
Smart grids are one of the last decades' innovations in electrical energy.
They bring relevant advantages compared to the traditional grid and
significant interest from the research community. Assessing the field's
evolution is essential to propose guidelines for facing new and future smart
grid challenges. In addition, knowing the main technologies involved in the
deployment of smart grids (SGs) is important to highlight possible
shortcomings that can be mitigated by developing new tools. This paper
contributes to the research trends mentioned above by focusing on two
objectives. First, a bibliometric analysis is presented to give an overview of
the current research level about smart grid deployment. Second, a survey of
the main technological approaches used for smart grid implementation and
their contributions are highlighted. To that effect, we searched the Web of
Science (WoS), and the Scopus databases. We obtained 5,663 documents
from WoS and 7,215 from Scopus on smart grid implementation or
deployment. With the extraction limitation in the Scopus database, 5,872 of
the 7,215 documents were extracted using a multi-step process. These two
datasets have been analyzed using a bibliometric tool called bibliometrix.
The main outputs are presented with some recommendations for future
research.
Use of analytical hierarchy process for selecting and prioritizing islanding ...IJECEIAES
ย
One of the problems that are associated to power systems is islanding
condition, which must be rapidly and properly detected to prevent any
negative consequences on the system's protection, stability, and security.
This paper offers a thorough overview of several islanding detection
strategies, which are divided into two categories: classic approaches,
including local and remote approaches, and modern techniques, including
techniques based on signal processing and computational intelligence.
Additionally, each approach is compared and assessed based on several
factors, including implementation costs, non-detected zones, declining
power quality, and response times using the analytical hierarchy process
(AHP). The multi-criteria decision-making analysis shows that the overall
weight of passive methods (24.7%), active methods (7.8%), hybrid methods
(5.6%), remote methods (14.5%), signal processing-based methods (26.6%),
and computational intelligent-based methods (20.8%) based on the
comparison of all criteria together. Thus, it can be seen from the total weight
that hybrid approaches are the least suitable to be chosen, while signal
processing-based methods are the most appropriate islanding detection
method to be selected and implemented in power system with respect to the
aforementioned factors. Using Expert Choice software, the proposed
hierarchy model is studied and examined.
Enhancing of single-stage grid-connected photovoltaic system using fuzzy logi...IJECEIAES
ย
The power generated by photovoltaic (PV) systems is influenced by
environmental factors. This variability hampers the control and utilization of
solar cells' peak output. In this study, a single-stage grid-connected PV
system is designed to enhance power quality. Our approach employs fuzzy
logic in the direct power control (DPC) of a three-phase voltage source
inverter (VSI), enabling seamless integration of the PV connected to the
grid. Additionally, a fuzzy logic-based maximum power point tracking
(MPPT) controller is adopted, which outperforms traditional methods like
incremental conductance (INC) in enhancing solar cell efficiency and
minimizing the response time. Moreover, the inverter's real-time active and
reactive power is directly managed to achieve a unity power factor (UPF).
The system's performance is assessed through MATLAB/Simulink
implementation, showing marked improvement over conventional methods,
particularly in steady-state and varying weather conditions. For solar
irradiances of 500 and 1,000 W/m2
, the results show that the proposed
method reduces the total harmonic distortion (THD) of the injected current
to the grid by approximately 46% and 38% compared to conventional
methods, respectively. Furthermore, we compare the simulation results with
IEEE standards to evaluate the system's grid compatibility.
Enhancing photovoltaic system maximum power point tracking with fuzzy logic-b...IJECEIAES
ย
Photovoltaic systems have emerged as a promising energy resource that
caters to the future needs of society, owing to their renewable, inexhaustible,
and cost-free nature. The power output of these systems relies on solar cell
radiation and temperature. In order to mitigate the dependence on
atmospheric conditions and enhance power tracking, a conventional
approach has been improved by integrating various methods. To optimize
the generation of electricity from solar systems, the maximum power point
tracking (MPPT) technique is employed. To overcome limitations such as
steady-state voltage oscillations and improve transient response, two
traditional MPPT methods, namely fuzzy logic controller (FLC) and perturb
and observe (P&O), have been modified. This research paper aims to
simulate and validate the step size of the proposed modified P&O and FLC
techniques within the MPPT algorithm using MATLAB/Simulink for
efficient power tracking in photovoltaic systems.
Adaptive synchronous sliding control for a robot manipulator based on neural ...IJECEIAES
ย
Robot manipulators have become important equipment in production lines, medical fields, and transportation. Improving the quality of trajectory tracking for
robot hands is always an attractive topic in the research community. This is a
challenging problem because robot manipulators are complex nonlinear systems
and are often subject to fluctuations in loads and external disturbances. This
article proposes an adaptive synchronous sliding control scheme to improve trajectory tracking performance for a robot manipulator. The proposed controller
ensures that the positions of the joints track the desired trajectory, synchronize
the errors, and significantly reduces chattering. First, the synchronous tracking
errors and synchronous sliding surfaces are presented. Second, the synchronous
tracking error dynamics are determined. Third, a robust adaptive control law is
designed,the unknown components of the model are estimated online by the neural network, and the parameters of the switching elements are selected by fuzzy
logic. The built algorithm ensures that the tracking and approximation errors
are ultimately uniformly bounded (UUB). Finally, the effectiveness of the constructed algorithm is demonstrated through simulation and experimental results.
Simulation and experimental results show that the proposed controller is effective with small synchronous tracking errors, and the chattering phenomenon is
significantly reduced.
Remote field-programmable gate array laboratory for signal acquisition and de...IJECEIAES
ย
A remote laboratory utilizing field-programmable gate array (FPGA) technologies enhances studentsโ learning experience anywhere and anytime in embedded system design. Existing remote laboratories prioritize hardware access and visual feedback for observing board behavior after programming, neglecting comprehensive debugging tools to resolve errors that require internal signal acquisition. This paper proposes a novel remote embeddedsystem design approach targeting FPGA technologies that are fully interactive via a web-based platform. Our solution provides FPGA board access and debugging capabilities beyond the visual feedback provided by existing remote laboratories. We implemented a lab module that allows users to seamlessly incorporate into their FPGA design. The module minimizes hardware resource utilization while enabling the acquisition of a large number of data samples from the signal during the experiments by adaptively compressing the signal prior to data transmission. The results demonstrate an average compression ratio of 2.90 across three benchmark signals, indicating efficient signal acquisition and effective debugging and analysis. This method allows users to acquire more data samples than conventional methods. The proposed lab allows students to remotely test and debug their designs, bridging the gap between theory and practice in embedded system design.
Detecting and resolving feature envy through automated machine learning and m...IJECEIAES
ย
Efficiently identifying and resolving code smells enhances software project quality. This paper presents a novel solution, utilizing automated machine learning (AutoML) techniques, to detect code smells and apply move method refactoring. By evaluating code metrics before and after refactoring, we assessed its impact on coupling, complexity, and cohesion. Key contributions of this research include a unique dataset for code smell classification and the development of models using AutoGluon for optimal performance. Furthermore, the study identifies the top 20 influential features in classifying feature envy, a well-known code smell, stemming from excessive reliance on external classes. We also explored how move method refactoring addresses feature envy, revealing reduced coupling and complexity, and improved cohesion, ultimately enhancing code quality. In summary, this research offers an empirical, data-driven approach, integrating AutoML and move method refactoring to optimize software project quality. Insights gained shed light on the benefits of refactoring on code quality and the significance of specific features in detecting feature envy. Future research can expand to explore additional refactoring techniques and a broader range of code metrics, advancing software engineering practices and standards.
Smart monitoring technique for solar cell systems using internet of things ba...IJECEIAES
ย
Rapidly and remotely monitoring and receiving the solar cell systems status parameters, solar irradiance, temperature, and humidity, are critical issues in enhancement their efficiency. Hence, in the present article an improved smart prototype of internet of things (IoT) technique based on embedded system through NodeMCU ESP8266 (ESP-12E) was carried out experimentally. Three different regions at Egypt; Luxor, Cairo, and El-Beheira cities were chosen to study their solar irradiance profile, temperature, and humidity by the proposed IoT system. The monitoring data of solar irradiance, temperature, and humidity were live visualized directly by Ubidots through hypertext transfer protocol (HTTP) protocol. The measured solar power radiation in Luxor, Cairo, and El-Beheira ranged between 216-1000, 245-958, and 187-692 W/m 2 respectively during the solar day. The accuracy and rapidity of obtaining monitoring results using the proposed IoT system made it a strong candidate for application in monitoring solar cell systems. On the other hand, the obtained solar power radiation results of the three considered regions strongly candidate Luxor and Cairo as suitable places to build up a solar cells system station rather than El-Beheira.
An efficient security framework for intrusion detection and prevention in int...IJECEIAES
ย
Over the past few years, the internet of things (IoT) has advanced to connect billions of smart devices to improve quality of life. However, anomalies or malicious intrusions pose several security loopholes, leading to performance degradation and threat to data security in IoT operations. Thereby, IoT security systems must keep an eye on and restrict unwanted events from occurring in the IoT network. Recently, various technical solutions based on machine learning (ML) models have been derived towards identifying and restricting unwanted events in IoT. However, most ML-based approaches are prone to miss-classification due to inappropriate feature selection. Additionally, most ML approaches applied to intrusion detection and prevention consider supervised learning, which requires a large amount of labeled data to be trained. Consequently, such complex datasets are impossible to source in a large network like IoT. To address this problem, this proposed study introduces an efficient learning mechanism to strengthen the IoT security aspects. The proposed algorithm incorporates supervised and unsupervised approaches to improve the learning models for intrusion detection and mitigation. Compared with the related works, the experimental outcome shows that the model performs well in a benchmark dataset. It accomplishes an improved detection accuracy of approximately 99.21%.
Developing a smart system for infant incubators using the internet of things ...IJECEIAES
ย
This research is developing an incubator system that integrates the internet of things and artificial intelligence to improve care for premature babies. The system workflow starts with sensors that collect data from the incubator. Then, the data is sent in real-time to the internet of things (IoT) broker eclipse mosquito using the message queue telemetry transport (MQTT) protocol version 5.0. After that, the data is stored in a database for analysis using the long short-term memory network (LSTM) method and displayed in a web application using an application programming interface (API) service. Furthermore, the experimental results produce as many as 2,880 rows of data stored in the database. The correlation coefficient between the target attribute and other attributes ranges from 0.23 to 0.48. Next, several experiments were conducted to evaluate the model-predicted value on the test data. The best results are obtained using a two-layer LSTM configuration model, each with 60 neurons and a lookback setting 6. This model produces an R 2 value of 0.934, with a root mean square error (RMSE) value of 0.015 and a mean absolute error (MAE) of 0.008. In addition, the R 2 value was also evaluated for each attribute used as input, with a result of values between 0.590 and 0.845.
A review on internet of things-based stingless bee's honey production with im...IJECEIAES
ย
Honey is produced exclusively by honeybees and stingless bees which both are well adapted to tropical and subtropical regions such as Malaysia. Stingless bees are known for producing small amounts of honey and are known for having a unique flavor profile. Problem identified that many stingless bees collapsed due to weather, temperature and environment. It is critical to understand the relationship between the production of stingless bee honey and environmental conditions to improve honey production. Thus, this paper presents a review on stingless bee's honey production and prediction modeling. About 54 previous research has been analyzed and compared in identifying the research gaps. A framework on modeling the prediction of stingless bee honey is derived. The result presents the comparison and analysis on the internet of things (IoT) monitoring systems, honey production estimation, convolution neural networks (CNNs), and automatic identification methods on bee species. It is identified based on image detection method the top best three efficiency presents CNN is at 98.67%, densely connected convolutional networks with YOLO v3 is 97.7%, and DenseNet201 convolutional networks 99.81%. This study is significant to assist the researcher in developing a model for predicting stingless honey produced by bee's output, which is important for a stable economy and food security.
A trust based secure access control using authentication mechanism for intero...IJECEIAES
ย
The internet of things (IoT) is a revolutionary innovation in many aspects of our society including interactions, financial activity, and global security such as the military and battlefield internet. Due to the limited energy and processing capacity of network devices, security, energy consumption, compatibility, and device heterogeneity are the long-term IoT problems. As a result, energy and security are critical for data transmission across edge and IoT networks. Existing IoT interoperability techniques need more computation time, have unreliable authentication mechanisms that break easily, lose data easily, and have low confidentiality. In this paper, a key agreement protocol-based authentication mechanism for IoT devices is offered as a solution to this issue. This system makes use of information exchange, which must be secured to prevent access by unauthorized users. Using a compact contiki/cooja simulator, the performance and design of the suggested framework are validated. The simulation findings are evaluated based on detection of malicious nodes after 60 minutes of simulation. The suggested trust method, which is based on privacy access control, reduced packet loss ratio to 0.32%, consumed 0.39% power, and had the greatest average residual energy of 0.99 mJoules at 10 nodes.
Fuzzy linear programming with the intuitionistic polygonal fuzzy numbersIJECEIAES
ย
In real world applications, data are subject to ambiguity due to several factors; fuzzy sets and fuzzy numbers propose a great tool to model such ambiguity. In case of hesitation, the complement of a membership value in fuzzy numbers can be different from the non-membership value, in which case we can model using intuitionistic fuzzy numbers as they provide flexibility by defining both a membership and a non-membership functions. In this article, we consider the intuitionistic fuzzy linear programming problem with intuitionistic polygonal fuzzy numbers, which is a generalization of the previous polygonal fuzzy numbers found in the literature. We present a modification of the simplex method that can be used to solve any general intuitionistic fuzzy linear programming problem after approximating the problem by an intuitionistic polygonal fuzzy number with n edges. This method is given in a simple tableau formulation, and then applied on numerical examples for clarity.
The performance of artificial intelligence in prostate magnetic resonance im...IJECEIAES
ย
Prostate cancer is the predominant form of cancer observed in men worldwide. The application of magnetic resonance imaging (MRI) as a guidance tool for conducting biopsies has been established as a reliable and well-established approach in the diagnosis of prostate cancer. The diagnostic performance of MRI-guided prostate cancer diagnosis exhibits significant heterogeneity due to the intricate and multi-step nature of the diagnostic pathway. The development of artificial intelligence (AI) models, specifically through the utilization of machine learning techniques such as deep learning, is assuming an increasingly significant role in the field of radiology. In the realm of prostate MRI, a considerable body of literature has been dedicated to the development of various AI algorithms. These algorithms have been specifically designed for tasks such as prostate segmentation, lesion identification, and classification. The overarching objective of these endeavors is to enhance diagnostic performance and foster greater agreement among different observers within MRI scans for the prostate. This review article aims to provide a concise overview of the application of AI in the field of radiology, with a specific focus on its utilization in prostate MRI.
Seizure stage detection of epileptic seizure using convolutional neural networksIJECEIAES
ย
According to the World Health Organization (WHO), seventy million individuals worldwide suffer from epilepsy, a neurological disorder. While electroencephalography (EEG) is crucial for diagnosing epilepsy and monitoring the brain activity of epilepsy patients, it requires a specialist to examine all EEG recordings to find epileptic behavior. This procedure needs an experienced doctor, and a precise epilepsy diagnosis is crucial for appropriate treatment. To identify epileptic seizures, this study employed a convolutional neural network (CNN) based on raw scalp EEG signals to discriminate between preictal, ictal, postictal, and interictal segments. The possibility of these characteristics is explored by examining how well timedomain signals work in the detection of epileptic signals using intracranial Freiburg Hospital (FH), scalp Children's Hospital Boston-Massachusetts Institute of Technology (CHB-MIT) databases, and Temple University Hospital (TUH) EEG. To test the viability of this approach, two types of experiments were carried out. Firstly, binary class classification (preictal, ictal, postictal each versus interictal) and four-class classification (interictal versus preictal versus ictal versus postictal). The average accuracy for stage detection using CHB-MIT database was 84.4%, while the Freiburg database's time-domain signals had an accuracy of 79.7% and the highest accuracy of 94.02% for classification in the TUH EEG database when comparing interictal stage to preictal stage.
Analysis of driving style using self-organizing maps to analyze driver behaviorIJECEIAES
ย
Modern life is strongly associated with the use of cars, but the increase in acceleration speeds and their maneuverability leads to a dangerous driving style for some drivers. In these conditions, the development of a method that allows you to track the behavior of the driver is relevant. The article provides an overview of existing methods and models for assessing the functioning of motor vehicles and driver behavior. Based on this, a combined algorithm for recognizing driving style is proposed. To do this, a set of input data was formed, including 20 descriptive features: About the environment, the driver's behavior and the characteristics of the functioning of the car, collected using OBD II. The generated data set is sent to the Kohonen network, where clustering is performed according to driving style and degree of danger. Getting the driving characteristics into a particular cluster allows you to switch to the private indicators of an individual driver and considering individual driving characteristics. The application of the method allows you to identify potentially dangerous driving styles that can prevent accidents.
Hyperspectral object classification using hybrid spectral-spatial fusion and ...IJECEIAES
ย
Because of its spectral-spatial and temporal resolution of greater areas, hyperspectral imaging (HSI) has found widespread application in the field of object classification. The HSI is typically used to accurately determine an object's physical characteristics as well as to locate related objects with appropriate spectral fingerprints. As a result, the HSI has been extensively applied to object identification in several fields, including surveillance, agricultural monitoring, environmental research, and precision agriculture. However, because of their enormous size, objects require a lot of time to classify; for this reason, both spectral and spatial feature fusion have been completed. The existing classification strategy leads to increased misclassification, and the feature fusion method is unable to preserve semantic object inherent features; This study addresses the research difficulties by introducing a hybrid spectral-spatial fusion (HSSF) technique to minimize feature size while maintaining object intrinsic qualities; Lastly, a soft-margins kernel is proposed for multi-layer deep support vector machine (MLDSVM) to reduce misclassification. The standard Indian pines dataset is used for the experiment, and the outcome demonstrates that the HSSF-MLDSVM model performs substantially better in terms of accuracy and Kappa coefficient.
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
ย
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
Saudi Arabia stands as a titan in the global energy landscape, renowned for its abundant oil and gas resources. It's the largest exporter of petroleum and holds some of the world's most significant reserves. Let's delve into the top 10 oil and gas projects shaping Saudi Arabia's energy future in 2024.
Welcome to WIPAC Monthly the magazine brought to you by the LinkedIn Group Water Industry Process Automation & Control.
In this month's edition, along with this month's industry news to celebrate the 13 years since the group was created we have articles including
A case study of the used of Advanced Process Control at the Wastewater Treatment works at Lleida in Spain
A look back on an article on smart wastewater networks in order to see how the industry has measured up in the interim around the adoption of Digital Transformation in the Water Industry.
Literature Review Basics and Understanding Reference Management.pptxDr Ramhari Poudyal
ย
Three-day training on academic research focuses on analytical tools at United Technical College, supported by the University Grant Commission, Nepal. 24-26 May 2024
Water billing management system project report.pdfKamal Acharya
ย
Our project entitled โWater Billing Management Systemโ aims is to generate Water bill with all the charges and penalty. Manual system that is employed is extremely laborious and quite inadequate. It only makes the process more difficult and hard.
The aim of our project is to develop a system that is meant to partially computerize the work performed in the Water Board like generating monthly Water bill, record of consuming unit of water, store record of the customer and previous unpaid record.
We used HTML/PHP as front end and MYSQL as back end for developing our project. HTML is primarily a visual design environment. We can create a android application by designing the form and that make up the user interface. Adding android application code to the form and the objects such as buttons and text boxes on them and adding any required support code in additional modular.
MySQL is free open source database that facilitates the effective management of the databases by connecting them to the software. It is a stable ,reliable and the powerful solution with the advanced features and advantages which are as follows: Data Security.MySQL is free open source database that facilitates the effective management of the databases by connecting them to the software.
Student information management system project report ii.pdfKamal Acharya
ย
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)MdTanvirMahtab2
ย
This presentation is about the working procedure of Shahjalal Fertilizer Company Limited (SFCL). A Govt. owned Company of Bangladesh Chemical Industries Corporation under Ministry of Industries.
Cosmetic shop management system project report.pdfKamal Acharya
ย
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
Final project report on grocery store management system..pdfKamal Acharya
ย
In todayโs fast-changing business environment, itโs extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
Final project report on grocery store management system..pdf
ย
Compressive speech enhancement using semi-soft thresholding and improved threshold estimation
1. International Journal of Electrical and Computer Engineering (IJECE)
Vol. 13, No. 3, June 2023, pp. 2788~2800
ISSN: 2088-8708, DOI: 10.11591/ijece.v13i3.pp2788-2800 ๏ฒ 2788
Journal homepage: http://ijece.iaescore.com
Compressive speech enhancement using semi-soft thresholding
and improved threshold estimation
Smriti Sahu, Neela Rayavarapu
Electronics and Telecommunication Department, Symbiosis Institute of Technology, Symbiosis International (Deemed University),
Pune, India
Article Info ABSTRACT
Article history:
Received Jun 12, 2022
Revised Jul 19, 2022
Accepted Aug 18, 2022
Compressive speech enhancement is based on the compressive sensing (CS)
sampling theory and utilizes the sparsity of the signal for its enhancement.
To improve the performance of the discrete wavelet transform (DWT) basis-
function based compressive speech enhancement algorithm, this study
presents a semi-soft thresholding approach suggesting improved threshold
estimation and threshold rescaling parameters. The semi-soft thresholding
approach utilizes two thresholds, one threshold value is an improved
universal threshold and the other is calculated based on the initial-silence-
region of the signal. This study suggests that thresholding should be applied
to both detail coefficients and approximation coefficients to remove noise
effectively. The performances of the hard, soft, garrote and semi-soft
thresholding approaches are compared based on objective quality and speech
intelligibility measures. The normalized covariance measure is introduced as
an effective intelligibility measure as it has a strong correlation with the
intelligibility of the speech signal. A visual inspection of the output signal is
used to verify the results. Experiments were conducted on the noisy speech
corpus (NOIZEUS) speech database. The experimental results indicate that
the proposed method of semi-soft thresholding using improved threshold
estimation provides better enhancement compared to the other thresholding
approaches.
Keywords:
Compressive sensing
Discrete wavelet transform
Normalized covariance
measure
Thresholding
This is an open access article under the CC BY-SA license.
Corresponding Author:
Smriti Sahu
Electronics and Telecommunication Department, Symbiosis Institute of Technology, Symbiosis International
(Deemed University)
Lavale, Pune, 412115, India
Email: smritisahu13@gmail.com
1. INTRODUCTION
The quality and intelligibility of a speech signal are severely impaired in noisy environments and
may increase listener fatigue. In this study, the focus is on enhancing the quality of a speech signal that is
corrupted by common additive background noises coming from noisy environments such as streets, airports,
and restaurants. Compressive sensing (CS) is a new sampling approach, which enables us to reconstruct the
signal using significantly less samples compared to the Nyquist sampling process [1]โ[4]. In this way, CS
saves memory as well as processing time. The three important aspects of CS are sparsity [5], incoherence [6]
and restricted isometry property [7]. Low et al. [8] suggests that CS enhances the speech signal by utilizing
its sparse nature and the non-sparse nature of the noise, in the time-frequency domain. Thus, compressive
speech enhancement allows us to utilize resources optimally [9]. Future communication systems would be
required to sense signals rapidly. In this way, CS will be the revolution in the field of next-generation
communication systems [10]โ[14].
2. Int J Elec & Comp Eng ISSN: 2088-8708 ๏ฒ
Compressive speech enhancement using semi-soft thresholding and improved โฆ (Smriti Sahu)
2789
The effectiveness of compressive speech enhancement lies in the correct choice of sensing matrices,
the transform domain to ensure signal sparsity and the recovery algorithm [15]โ[17]. Gaussian random
matrix was selected as the sensing matrix because it satisfies the restricted isometry property of CS, in the
most probabilistic sense [18]. Pilastri et al. [19] presents the effectiveness of basis pursuit (๐1 minimization)
reconstruction algorithm compared to the orthogonal matching pursuit-based algorithms in CS, as applied to
images. Thus, ๐1 minimization is chosen as the reconstruction algorithm [20], [21]. Our previous work has
shown the effectiveness of the discrete wavelet transform (DWT) based basis function for compressive
speech enhancement [22]โ[24]. This study concluded that the Fejรฉr-Korovkin wavelet, with a vanishing
moment N=22 (fk22), was the optimal basis function, for compressive speech enhancement.
This work applies different threshold functions and analyzes the results for compressive speech
enhancement using DWT as the basis function. Donoho [25] suggested two threshold functions; hard and
soft. The hard threshold function creates signal discontinuity near the threshold points. The soft threshold
function solves the signal discontinuity issue, but it causes the constant difference between the input signal
and the output signal. To overcome the shortcomings of the hard and soft threshold functions, Gao [26]
proposed the garrote threshold function. The denoising effect of the garrote threshold is better, but it does not
resolve the constant deviation issue. Thus, the threshold functions should be improved to achieve a better
denoising effect. Jing-Yi et al. [27] presents the basic principles and structure of wavelet threshold functions
and their application for denoising. Jain and Tiwari [28] proposed an adaptive nonlinear mid-threshold
function along with a new threshold estimation method for wavelet-based denoising of phonocardiogram
signals.
In this paper, a semi-soft thresholding approach is proposed for compressive speech enhancement,
which is based on the nonlinear mid-threshold function. In addition, improved threshold estimation and
threshold rescaling parameters to achieve a better enhancement have been proposed. The effectiveness of the
proposed semi-soft threshold function is compared with the hard, soft and garrote threshold function based on
five performance measures: signal-to-noise ratio (SNR), segmental signal-to-noise ratio (SegSNR), root
mean square error (RMSE), perceptual evaluation of speech quality (PESQ) and normalized covariance
metrics measure (NCM) [29]. The obtained results show that, with the proposed method, better enhancement
has been achieved compared to the other three threshold functions, when applied for compressive speech
enhancement.
The remaining paper has been organized as: section 2 provides a brief description of the
compressive speech enhancement process using the DWT basis function. Section 3 presents the wavelet
threshold functions. Section 4 presents the semi-soft thresholding approach and the proposed methods.
Section 5 describes the experimental settings, performance evaluation indices, results, and discussions.
Section 6 summarizes the conclusions of this study.
2. COMPRESSIVE SPEECH ENHANCEMENT USING DWT BASIS FUNCTION
The compressive sensing approach involves four main steps: sparse representation, measurement of
the signal using a sensing matrix, sparse recovery, and reverse sparsity [30]โ[33]. The input signal is
sparsified using the basis function. The sparse output is then measured into a small set of samples using the
sensing matrices. Finally, the signal is reconstructed by reversing the sparsity followed by the sparse
recovery algorithms.
Figure 1 presents the step-by-step process of compressive sensing. In the first step, namely โsparse
representationโ, the signal is projected onto a suitable basis function. During the second step โmeasurementโ,
the sparse signal ๐ฅ โ ๐ ๐
is multiplied with the sensing matrix ๐ โ ๐ ๐ร๐
. Thus, only M number of
measurements ๐ฆ โ ๐ ๐
(M โช ๐) are taken from the sparse signal ๐ฅ. The set of under-sampled measurements
๐ฆ is called the observation vector. The third step, โsparse recoveryโ, is a problem of an underdetermined
system of linear equations. But sparse recovery is possible as the signal is sufficiently sparse, and the sensing
matrix complies with the restricted isometry property (RIP) [7]. The sparsest solution amongst all the
possible solutions is to be found using ๐1 minimization. In the last step, the clean signal is recovered by
reversing the sparsity [34].
Previous work by the authors suggested that the DWT basis function is the optimal basis function
for compressive speech enhancement [23]. Orthogonal wavelets support signal denoising and Fejรฉr-Korovkin
(fk22) wavelet was chosen as the optimal basis function. Earlier studies in the DWT basis-function based
compressive speech enhancement process, suggest that a one-level wavelet decomposition be applied to the
signal. Then the detail coefficients are processed using the CS approach and the approximation coefficients
are used at the reconstruction stage.
3. ๏ฒ ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 3, June 2023: 2788-2800
2790
Figure 1. Compressive sensing process
3. WAVELET THRESHOLD FUNCTIONS
Wavelet denoising approaches diminish the noise by thresholding the wavelet decomposition
coefficients according to the threshold functions. The two important factors for wavelet denoising are the
threshold value and the threshold function, based on which the wavelet decomposition coefficients are shrunk
or scaled. Donoho [35] suggested a universal threshold function as given in (1).
๐โ = ๐โ2 log ๐ (1)
where ๐ denotes the noise variance and ๐ is the signal length. This threshold helps to reduce noise to a large
extent and preserves the information in the input signal. ๐ is calculated as shown in (2).
๐ = ๐๐ด๐ท (|๐ฅ|)/0.6745 (2)
where ๐๐ด๐ท is the median absolute deviation and is calculated as shown in (3):
๐๐ด๐ท = ๐๐๐๐๐๐(|๐ฅ โ ๐๐๐๐๐๐(๐ฅ)|) (3)
The two most common threshold functions are the ones proposed by Donoho [25] and they are the hard
threshold and soft threshold.
Hard threshold: this approach sets the value of the coefficients that are below the threshold value to
zero, and the coefficients that are above the threshold value remain the same. The hard threshold function is
given by (4).
๐๐ป๐ ๐ป(๐ฅ, ๐โ) = {
๐ฅ, |๐ฅ| โฅ ๐โ
0, |๐ฅ| < ๐โ
(4)
where ๐ฅ is the input signal of length ๐ and ๐โ denotes the threshold value. Hard thresholding does not affect
the signal energy significantly, but it causes signal discontinuities.
Soft Threshold: This approach sets the value below the thresholds to zero and performs amplitude
subtraction of the coefficients above the threshold [25]. The soft threshold function is given by (5).
๐๐ป๐ ๐(๐ฅ, ๐โ) = {
๐๐๐(๐ฅ)(|๐ฅ| โ ๐โ), |๐ฅ| โฅ ๐โ
0, |๐ฅ| < ๐โ
(5)
where ๐ฅ is the input signal of length ๐, ๐โ denotes the threshold value and ๐๐๐(. ) represents the signum
function and it gives the sign of ๐ฅ. The soft thresholding approach attenuates the high-frequency coefficients
of the signal, which makes the signal very smooth. Due to this, high-frequency information is lost and there
is a constant difference between the input signal and output signal.
4. Int J Elec & Comp Eng ISSN: 2088-8708 ๏ฒ
Compressive speech enhancement using semi-soft thresholding and improved โฆ (Smriti Sahu)
2791
Garrote Threshold: The garrote threshold is an intermediate threshold function between the hard and
soft threshold functions as it performs hard thresholding of large data values and soft thresholding of small
data values using the Garrotte shrinkage function given by (6).
๐๐ป๐ ๐บ(๐ฅ, ๐โ) = {
(๐ฅ โ ๐โ2
/๐ฅ), |๐ฅ| โฅ ๐โ
0, |๐ฅ| < ๐โ
(6)
where ๐ฅ is the input signal of length ๐ and ๐โ denotes the threshold value. Garrote threshold also attenuates
the signal and causes constant deviation in the output signal with respect to the input signal [36].
4. SEMI-SOFT THRESHOLDING AND PROPOSED METHODS
4.1. Semi-soft threshold function
In order to resolve the issues caused by the hard, soft and Garrotte thresholds, this work explored the
non-linear mid function suggested by Jain and Tiwari [28]. A semi-soft thresholding approach, for
compressive speech enhancement using the DWT basis function, is being proposed here. This approach
performs thresholding in three parts: i) retaining the high-frequency coefficients, ii) setting small value
coefficients to zero, and iii) shrinking the moderate value coefficients according to the nonlinear shrinkage
function given by (7).
๐๐ป๐ ๐๐(๐ฅ, ๐โ๐ฟ, ๐โ๐) = {
๐ฅ, |๐ฅ| > ๐โ๐
๐ฅ3
/๐โ๐
2
๐โ๐ฟ โค |๐ฅ| โค ๐โ๐
0, |๐ฅ| < ๐โ๐ฟ
(7)
where, ๐โ๐ฟ and ๐โ๐ are lower and upper threshold values respectively. The output response for the given
threshold functions, for a linear test input, was generated. The threshold values selected were: ๐โ๐ฟ = 4 and
๐โ๐ = 4.5.
The performance of the threshold functions can be seen in Figure 2. As evident from Figure 2(a),
there is a sharp discontinuity at the threshold point for the hard threshold function. Figure 2(b) shows that
there is a constant deviation between the input and the output response in the case of the soft threshold
function. Figure 2(c) shows that the garrote threshold tries to solve the discontinuity problem of the hard
threshold as well as to decrease the difference between the output and input response. Figure 2(d) displays
that the proposed semi-soft threshold function does not show sharp discontinuities and the output response
gradually reaches zero at the threshold points. Thus, the semi-soft threshold function overcomes the signal
discontinuity issue of the hard threshold function as well as the constant deviation issue of the soft and
garrote threshold functions.
4.2. Wavelet decomposition and proposed method of thresholding the coefficients
The wavelet decomposition process decomposes the signal into low-frequency coefficients
(approximation coefficients) and high-frequency coefficients (detail coefficients) [37], [38]. In the case of
wavelet denoising multilevel wavelet decomposition, wavelet transform is applied to the noisy input, which
generates the noisy wavelet coefficients to the level ๐. The detail coefficients are thresholded for each level
from level 1 to ๐, while the approximation coefficients are used at the wavelet reconstruction stage [39].
Previous work was related to compressive speech enhancement and suggested the use of only one
level wavelet decomposition. Thus, the detail, as well as the approximation coefficients, were quite noisy. If
thresholding is only applied to detail coefficients, the noise present in the low-frequency region will not be
reduced and a noisy output is obtained. Hence, this work suggests the application of the proposed semi-soft
thresholding to both the detail and approximation coefficients, for compressive speech enhancement using
the DWT basis function.
4.3. Threshold estimation
As thresholding is being applied to both the detail and the approximation coefficients, the
effectiveness of the semi-soft thresholding is solely dependent on the selection of the threshold value. A large
threshold value results in a noisy output and a low threshold value will not be effective for noise suppression.
In this work, we propose two threshold estimation approaches based on the universal threshold suggested by
Donoho and Johnstone [40]. The first one is an improved universal threshold ๐โ1 calculated according to (8).
๐โ1 = ๐ โ2 log ๐ (8)
5. ๏ฒ ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 3, June 2023: 2788-2800
2792
where ๐ represents the standard deviation of the detail coefficients and ๐ represents the length of the detail
coefficients. The reason for this is that the standard deviation is useful for describing the variability of the
coefficients and it remains on the same scale as the input coefficients. In the case of the universal threshold
(๐โ = ๐โ2 log ๐ ) suggested in [40], ๐ is the noise variance of the signal.
(a) (b)
(c) (d)
Figure 2. Output responses resulting from (a) hard, (b) soft, (c) garrote, and (d) semi-soft threshold functions,
for linear test input
The second threshold is an initial-silence-region based universal threshold ๐โ2. When the speech
signal is recorded or analysed, there is a period of silence before the utterance of a syllable. This region is
called the initial-silence region, as the speaker takes some time to speak before uttering the first letter or
word. The initial-silence region shows the signal having very little energy, which is equivalent to low noise.
Thus, we can estimate the variance of the noise using this region. The initial-silence-region based universal
threshold ๐โ2 is calculated as shown in (9).
๐โ2 = ๐๐โ2 log ๐๐ (9)
where, ๐๐ and ๐๐ are noise variance and length of the initial-silence-region respectively. Noise variance ๐๐ is
calculated as:
๐๐ = ๐๐ด๐ท (|๐ฅ๐|)/0.6745 (10)
where ๐๐ด๐ท is the median absolute deviation and is calculated as given in (11):
๐๐ด๐ท = ๐๐๐๐๐๐(|๐ฅ๐ โ ๐๐๐๐๐๐(๐ฅ๐)|) (11)
6. Int J Elec & Comp Eng ISSN: 2088-8708 ๏ฒ
Compressive speech enhancement using semi-soft thresholding and improved โฆ (Smriti Sahu)
2793
4.4. Threshold rescaling
Threshold rescaling refers to the optimization of threshold values as they affect the operating range
of the threshold functions. It was found that if we applied the threshold directly, there was some noise residue
present in the enhanced output signal. Thus, threshold rescaling is an important aspect to be considered for
improving the performance of the semi-soft thresholding approach. The rescaled threshold values are given
as (12) and (13):
๐1 = ๐ผ ร ๐โ1 (12)
๐2 = ๐ฝ ร ๐โ2 (13)
ฮฑ and ฮฒ are rescaling constants ranging from 1 to 1.5 and ฮฑ > ฮฒ. Based on the performance comparison for
different values of ฮฑ and ฮฒ, it was found that the value of ฮฑ should be 1.5 so that the rescaled threshold
allows the removal of a large range of noise residue. The optimum value of rescaling constant ฮฒ was found to
be 1.3. By rescaling the threshold values, the operating range of the non-linear mid function is increased. The
higher of the two threshold values is considered as the upper threshold ๐โU and the lower is considered as the
lower threshold ๐โL.
4.5. Method
The experimental steps for the DWT basis-function based compressive speech enhancement are as:
- The noisy speech signal ๐ฅ is taken as the input.
- Framing is essential as we do not process the entire signal in one go when using compressive sensing. The
frame size should not be very small as the frames are further decomposed using DWT. The frame size
should not be very large also as it may increase processing time. Thus, the input speech signal is divided
into non-overlapping frames of 1024 samples each.
- Gaussian random matrix is defined as sensing matrix ๐.
- One-level DWT ๐ is applied to the framed input. We will get the high-frequency coefficients vector ๐๐ท
and low-frequency coefficients vector ๐๐ด.
- The improved universal threshold ๐โ1 and initial-silence region-based threshold ๐โ2 are calculated. After
threshold rescaling, the values of ๐โU and ๐โL are determined.
- Semi-soft thresholding is implemented on the detail as well as the approximation coefficients using the
upper and lower thresholds, ๐โU ๐๐๐ ๐โL. The thresholded coefficients ๐๐ท๐ and ๐๐ด๐ are obtained.
- The observation vector obtained is given in (14).
๐ฆ = ๐ โ ๐๐ท๐ (14)
- The reconstruction algorithm is applied to ๐ฆ to get ๐๐ท๐
โฒ
.
- One-level IDWT is applied to the reconstructed high-frequency coefficients vector ๐๐ท๐
โฒ
and the
thresholded low-frequency coefficients vector ๐๐ด๐ to reverse sparsity.
- The processed frames are merged to form the enhanced speech signal ๐ฅโฒ
.
5. EXPERIMENTAL RESULTS
sp04.wav (male), sp07.wav (male), sp15.wav (female) and sp30.wav (female) were taken from the
noisy speech corpus (NOIZEUS) speech database [41]. These signals were distorted by babble, street, and
airport noises; at 5 dB, 10 dB and 15 dB input SNR. The sampling frequency was 8 kHz. The input signal
which is the original speech signal distorted by one of the above-mentioned noises was framed into non-
overlapping frames of 1,024 samples for compressive speech enhancement process. For the initial-silence-
region based threshold calculation an initial, 500 samples were taken. The compression ratio was chosen to
be 50%. A Gaussian random matrix was selected for the sensing matrix. ๐1 minimization was applied as a
recovery algorithm, for sparse recovery. Objective quality measures, as well as speech intelligibility
measures, were used for the performance assessment of the reconstructed signals.
5.1. Performance evaluation indices
Performance evaluation indices are the parameters which are used to test the effectiveness of the
algorithms. Threshold function performance was assessed with the aid of three objective quality measures:
SNR, SegSNR, RMSE. Two speech intelligibility measures: PESQ and NCM [29], [42].
7. ๏ฒ ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 3, June 2023: 2788-2800
2794
SNR is the ratio between the power of a signal and the power of the background noise. SNR is
expressed in decibels. Higher SNR represents that signal is more than the noise. It is expressed in (15),
where x(n) represents the input signal, while x^' (n) represents the enhanced signal.
๐๐๐ = 10 ๐๐๐10
โ ๐ฅ2(๐)
โ(๐ฅ(๐)โ๐ฅโฒ(๐))
2 (15)
where ๐ฅ(๐) represents the input signal, while ๐ฅโฒ
(๐) represents the enhanced signal.
Segmental SNR is the mean of SNRs of all frames of the speech signal. SegSNR is calculated
using (16).
๐๐๐๐๐๐ =
10
๐
โ log10
โ ๐ฅ2(๐)
๐๐+๐โ1
๐=๐๐
โ (๐ฅ(๐)โ๐ฅโฒ((๐))2
๐๐+๐โ1
๐=๐๐
๐โ1
๐=0 (16)
where ๐ฅ(๐) represents the input signal, ๐ฅโฒ
(๐) represents the enhanced signal, ๐ gives the length of the frame
and ๐ gives the number of frames in the signal.
RMSE is the most commonly used objective measure, which indicates the difference between the
clean signal and the enhanced signal. It is calculated using (17).
๐ ๐๐๐ธ = โ
1
๐
โ (๐ฅ(๐) โ ๐ฅโฒ(๐))2
๐
๐=1 (17)
where N is signal length, ๐ฅ(๐) represents the input signal and ๐ฅโฒ(๐) represents the enhanced signal.
The ITU-T P.862 standard defines the perceptual evaluation of speech quality (PESQ), which
denotes the overall speech quality score. PESQ models the mean opinion score (MOS); it covers a signal
quality scale from 1 (bad) to 5 (excellent) [43]โ[45]. The NCM measure relies on the covariance between the
envelope of the input and output signals [46]. It calculates the correlation coefficient equation between the
input and output signal envelopes and maps it to the NCM index. This value of this parameter indicates the
intelligibility of the speech signal. A higher value indicates higher intelligibility.
5.2. Results and discussions
In the case of the wavelet threshold denoising approach, the reconstructed signal quality is also
related to the number of decomposition levels apart from the selection of threshold value and threshold
function. More decomposition levels are required to achieve better denoising, which will increase the
computation as well as the processing time. At each decomposition level, thresholding is applied to quantify
the detail coefficients. This may lead to the reconstruction error being too large and may reduce signal
quality. While in the case of the proposed compressive speech enhancement process, only one level of
wavelet decomposition is required, and thresholding is applied to both the detail as well as approximation
coefficients, which reduces the complexity of the process as well as provides better enhancement.
In the CS based speech enhancement process, threshold functions were applied to the sparsified
signal, obtained using the DWT basis function, Experiments were performed on two male speech signals
(sp04.wav, sp07.wav) and two female speech signals (sp15.wav, sp30.wav). The performance was observed
for three input SNRs 5, 10, and 15 dB, for three background noises (babble, street, and airport). The
effectiveness of the proposed algorithm has been presented here for one male speech signal (sp04.wav) and
one female speech wave (sp15.wav) distorted by two noises babble and street.
5.2.1. Performance analysis of the threshold functions for compressive speech enhancement of signal
โsp04.wavโ distorted by babble and street noise
Table 1 shows the comparative assessment of the threshold functions for the CS based enhancement
of speech signal sp04.wav (male) corrupted by babble noise and street noise. It may be noted from the table
that the performance of the proposed method of semi-soft thresholding approach is consistently better than
that of the hard, soft and garrote threshold functions in terms of all five performance measures, regardless of
the input SNR and noise type.
Figures 3(a)-(f) to 5(a)-(f) show the comparative examination of threshold functions for compressive
speech enhancement of signal sp04.wav distorted by babble noise and street noise at 5, 10, and 15 dB
respectively. Figures 3(a), 4(a) and 5(a) shows the clean input signal sp04.wav, while the Figures 3(b), 4(b)
and 5(b) shows the noisy speech signal. Figures 3(c), 4(c) and 5(c) shows the compressive speech enhanced
signal using the hard threshold function. Similarly, subfigures (d), (e) and (f) shows the compressive speech
enhanced signal for soft, garrote and semi-soft threshold functions respectively. By visual inspection of the
figures, it is seen that signals were clipped off near the transition regions in the case of the hard threshold. For
8. Int J Elec & Comp Eng ISSN: 2088-8708 ๏ฒ
Compressive speech enhancement using semi-soft thresholding and improved โฆ (Smriti Sahu)
2795
the soft and garrote threshold, there is a constant deviation compared to the input signal. This makes the
signal very smooth, due to which high-frequency information is lost. In the case of semi-soft thresholding,
some noise residue was observed for low input SNR, but results were good in terms of performance
parameter values and there was little or no signal clipping. Thus, the signal intelligibility is good and the
same is suggested by the values obtained for parameters PESQ and NCM.
Table 1. Comparison of threshold functions for compressive speech enhancement of sp04.wav distorted by
babble noise and street noise
Threshold Functions Performance Measures Babble Noise Street Noise
Input SNR (dB) Input SNR (dB)
5 10 15 5 10 15
Hard SNR (dB) 6.9491 8.1201 9.1320 7.5599 8.1063 9.0085
SegSNR(dB) 4.1646 3.7549 3.8061 4.8692 3.9199 3.8125
RMSE 0.0233 0.0186 0.0160 0.0217 0.0186 0.0163
PESQ 1.6769 0.9745 1.2795 1.2590 1.5589 1.7168
NCM 0.7414 0.7786 0.8092 0.8249 0.8201 0.8130
Soft SNR (dB) 3.9215 4.6263 5.0013 4.2819 4.7131 5.0352
SegSNR(dB) 1.8617 1.7906 1.8104 2.2255 1.8954 1.8463
RMSE 0.0330 0.0278 0.0258 0.0316 0.0275 0.0257
PESQ 1.2387 1.0403 1.3169 1.5810 1.6334 1.7556
NCM 0.6556 0.6889 0.7063 0.7408 0.7239 0.7180
Garrote SNR (dB) 5.3171 6.3278 6.9272 5.7635 6.3590 6.9219
SegSNR(dB) 2.7757 2.6424 2.6665 3.2921 2.7703 2.7041
RMSE 0.0281 0.0229 0.0207 0.0267 0.0228 0.0207
PESQ 1.3195 1.0564 1.3403 1.6056 1.6528 1.7632
NCM 0.6771 0.7137 0.7351 0.7719 0.7549 0.7487
Semi-soft SNR (dB) 8.7336 12.2407 16.4597 8.6220 11.0462 15.4204
SegSNR(dB) 5.8638 7.3360 9.4517 5.8721 6.3690 8.5403
RMSE 0.0190 0.0116 0.0069 0.0192 0.0133 0.0078
PESQ 2.5864 2.8241 3.0337 1.8169 2.5004 2.8296
NCM 0.8030 0.9036 0.9488 0.8543 0.9012 0.9452
(a) (a)
(b) (b)
(c) (c)
(d) (d)
(e) (e)
(f) (f)
Figure 3. Compressive speech enhancement results using different threshold functions for sp04.wav distorted
by babble and street noise at 5 dB input SNR (a) clean input signal (sp04.wav), (b) noisy signal
(5 dB input SNR), (c) hard (d) soft, (e) garrote, and (f) semi-soft threshold function
9. ๏ฒ ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 3, June 2023: 2788-2800
2796
Figure 4. Compressive speech enhancement results using different threshold functions for sp04.wav distorted
by babble and street noise at 10 dB input SNR (a) clean input signal (sp04.wav), (b) noisy signal
(10 dB input SNR), (c) hard (d) soft, (e) garrote, and (f) semi-soft threshold function
Figure 5. Compressive speech enhancement results using different threshold functions for sp04.wav distorted
by babble and street noise at 15 dB input SNR (a) clean input signal (sp04.wav), (b) noisy signal
(15 dB input SNR), (c) hard (d) soft, (e) garrote, and (f) semi-soft threshold function
5.2.2. Performance analysis of threshold functions for compressive speech enhancement of signal
โsp15.wavโ distorted by babble and street noise
Table 2 shows the comparative assessment of the threshold functions for the CS based enhancement
of the speech signal, sp15.wav (female), corrupted by babble noise and street noise. It can be observed from
the table that the proposed method of semi-soft thresholding, using the proposed threshold estimation, shows
quantitative effectiveness in terms of the higher values of the SNR, SegSNR, PESQ and NCM parameters
and lower MSE value.
10. Int J Elec & Comp Eng ISSN: 2088-8708 ๏ฒ
Compressive speech enhancement using semi-soft thresholding and improved โฆ (Smriti Sahu)
2797
Table 2. Comparison of threshold functions for compressive speech enhancement of sp15.wav distorted
by babble noise and street noise
Threshold
Functions
Performance
Measures
Babble Noise Street Noise
Input SNR (dB) Input SNR (dB)
5 10 15 5 10 15
Hard SNR (dB) 7.8635 8.8808 9.7073 9.4812 9.3760 9.8832
SegSNR(dB) 5.6624 5.3251 5.2789 7.1728 5.7113 5.4088
RMSE 0.0208 0.0169 0.0148 0.0172 0.0159 0.0145
PESQ 2.2321 1.3001 1.4766 1.7410 1.6008 1.6650
NCM 0.6825 0.7080 0.7525 0.8082 0.7917 0.7925
Soft SNR (dB) 4.3485 5.1686 5.6220 5.3405 5.5184 5.7358
SegSNR(dB) 2.4894 2.4740 2.4992 3.2411 2.6910 2.5653
RMSE 0.0311 0.0259 0.0238 0.0277 0.0248 0.0234
PESQ 1.7105 1.2487 1.3984 1.8705 1.6022 1.6415
NCM 0.5756 0.6121 0.6460 0.6909 0.6844 0.6800
Garrote SNR (dB) 5.9068 6.9653 7.6281 7.2461 7.4180 7.7768
SegSNR(dB) 3.7312 3.6766 3.7040 4.8633 3.9898 3.7943
RMSE 0.0260 0.0210 0.0189 0.0222 0.0199 0.0185
PESQ 1.8389 1.2812 1.4363 1.9168 1.6350 1.6638
NCM 0.6071 0.6426 0.6795 0.7292 0.7226 0.7175
Semi-soft SNR (dB) 9.5422 12.8414 17.1266 10.1959 11.4555 15.5869
SegSNR(dB) 7.3188 9.1442 12.0443 7.8726 7.6667 10.4260
RMSE 0.0171 0.0107 0.0063 0.0158 0.0125 0.0075
PESQ 2.7847 3.0169 3.1815 1.9917 2.2616 2.7882
NCM 0.7667 0.8579 0.9198 0.8292 0.8690 0.9244
Figures 6(a)-(f) to 8(a)-(f) show the comparative results of threshold functions for compressive
speech enhancement of the signal sp15.wav distorted due to babble noise and street noise at 5, 10, and 15 dB
respectively. Figures 6(a), 7(a) and 8(a) shows the clean input signal sp15.wav. Figures 6(b), 7(b) and 8(b)
shows the noisy speech signal. Figures 6(c), 7(c) and 8(c) shows the compressive speech enhanced signal
using the hard threshold function. Similarly, Subfigures (d), (e) and (f) shows the compressive speech
enhanced signal for soft, garrote and semi-soft threshold functions respectively. It can be observed from the
figures that the proposed method of semi-soft thresholding, performs effective noise suppression without any
significant signal loss. Thus, the proposed approach overcomes the signal discontinuity issue caused by hard
thresholding and constant deviation issues caused by soft and garrote thresholding.
Figure 6. Compressive speech enhancement results using different threshold functions for sp15.wav distorted
by babble and street noise at 5 dB input SNR (a) clean input signal (sp15.wav), (b) noisy signal
(5 dB input SNR), (c) hard (d) soft, (e) garrote, and (f) semi-soft threshold function
11. ๏ฒ ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 3, June 2023: 2788-2800
2798
Figure 7. Compressive speech enhancement results using different threshold functions for sp15.wav distorted
by babble and street noise at 10 dB input SNR (a) clean input signal (sp15.wav), (b) noisy signal
(10 dB input SNR), (c) hard (d) soft, (e) garrote, and (f) semi-soft threshold function
Figure 8. Compressive speech enhancement results using different threshold functions for sp15.wav distorted
by babble and street noise at 15 dB input SNR (a) clean input signal (sp15.wav), (b) noisy signal
(15 dB input SNR), (c) hard (d) soft, (e) garrote, and (f) semi-soft threshold function
6. CONCLUSION
To improve the performance of the compressive speech enhancement process, this paper analyzes
the conventional threshold functions and introduces semi-soft thresholding with improved threshold
estimation. The key contributions of the paper are as: i) this study highlights the problem associated with the
traditional wavelet threshold functions and suggests a semi-soft threshold function, which is premised on the
nonlinear mid-threshold function and utilizes two threshold values; ii) improved threshold estimation and
12. Int J Elec & Comp Eng ISSN: 2088-8708 ๏ฒ
Compressive speech enhancement using semi-soft thresholding and improved โฆ (Smriti Sahu)
2799
threshold rescaling parameters have also been proposed here. One threshold is an improved universal
threshold, and the other threshold is estimated using the initial-silence-region of the signal; iii) this study
suggests that thresholding should be applied to both the decomposition coefficients, to achieve effective
noise reduction, in the case of the one-level wavelet decomposition; and iv) the proposed method of semi-soft
thresholding with improved threshold estimation resolves the signal discontinuity issue of the hard threshold
function by using two levels of thresholds. It also retains the large coefficients, thus reducing high-frequency
information loss and constant signal deviation problems created by soft and garrote thresholding.
Experimental results and the visual analysis show that the proposed method improves signal quality
and intelligibility. Thus, it is concluded that the proposed method gives a more accurate threshold estimation
to achieve better denoising and is more effective and feasible when compared with the conventional
threshold functions. A visual inspection of the enhanced speech signal indicates the presence of noise residue
at low SNR, but the parametric analysis suggests that the signal intelligibility is quite good compared to the
other thresholding approaches. Future work should be directed towards improving the algorithm to obtain
effective noise reduction even at low SNRs.
REFERENCES
[1] E. J. Candes, J. Romberg, and T. Tao, โRobust uncertainty principles: exact signal reconstruction from highly incomplete
frequency information,โ IEEE Transactions on Information Theory, vol. 52, no. 2, pp. 489โ509, Feb. 2006, doi:
10.1109/TIT.2005.862083.
[2] D. L. Donoho, M. Elad, and V. N. Temlyakov, โStable recovery of sparse overcomplete representations in the presence of noise,โ
IEEE Transactions on Information Theory, vol. 52, no. 1, pp. 6โ18, Jan. 2006, doi: 10.1109/TIT.2005.860430.
[3] G. Kutyniok, โTheory and applications of compressed sensing,โ GAMM-Mitteilungen, vol. 36, no. 1, pp. 79โ101, Aug. 2013, doi:
10.1002/gamm.201310005.
[4] T. Strohmer, โMeasure what should be measured: progress and challenges in compressive sensing,โ IEEE Signal Processing
Letters, vol. 19, no. 12, pp. 887โ893, Dec. 2012, doi: 10.1109/LSP.2012.2224518.
[5] I. Selesnick, โIntroduction to sparsity in signal processing,โ A tutorial on sparsity-based methods in signal processing, 2012.
https://eeweb.engineering.nyu.edu/iselesni/teaching/lecture_notes/sparsity_intro/sparse_SP_intro.pdf (accessed Aug. 01, 2021).
[6] E. Candรจs and J. Romberg, โSparsity and incoherence in compressive sampling,โ Inverse Problems, vol. 23, no. 3, pp. 969โ985,
Jun. 2007, doi: 10.1088/0266-5611/23/3/008.
[7] E. J. Candรจs, โThe restricted isometry property and its implications for compressed sensing,โ Comptes Rendus Mathematique,
vol. 346, no. 9โ10, pp. 589โ592, May 2008, doi: 10.1016/j.crma.2008.03.014.
[8] S. Y. Low, D. S. Pham, and S. Venkatesh, โCompressive speech enhancement,โ Speech Communication, vol. 55, no. 6,
pp. 757โ768, Jul. 2013, doi: 10.1016/j.specom.2013.03.003.
[9] H. Haneche, B. Boudraa, and A. Ouahabi, โSpeech enhancement using compressed sensing-based method,โ in 2018 International
Conference on Electrical Sciences and Technologies in Maghreb (CISTEM), Oct. 2018, pp. 1โ6, doi:
10.1109/CISTEM.2018.8613609.
[10] M. M. Abo-Zahhad, A. I. Hussein, and A. M. Mohamed, โCompressive sensing algorithms for signal processing applications: a
survey,โ International Journal of Communications, Network and System Sciences, vol. 8, no. 5, pp. 197โ216, 2015, doi:
10.4236/ijcns.2015.85021.
[11] M. G. Christensen, J. Ostergaard, and S. H. Jensen, โOn compressed sensing and its application to speech and audio signals,โ in
2009 Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers, 2009, pp. 356โ360, doi:
10.1109/ACSSC.2009.5469828.
[12] M. A. Davenport, P. T. Boufounos, M. B. Wakin, and R. G. Baraniuk, โSignal processing with compressive measurements,โ IEEE
Journal of Selected Topics in Signal Processing, vol. 4, no. 2, pp. 445โ460, Apr. 2010, doi: 10.1109/JSTSP.2009.2039178.
[13] M. A. Davenport, M. F. Duarte, Y. C. Eldar, and G. Kutyniok, โIntroduction to compressed sensing,โ in Compressed Sensing,
Cambridge University Press, 2012, pp. 1โ64.
[14] O. Holtz, โCompressive sensing: a paradigm shift in signal processing,โ Arxiv.org/abs/0812.3137, Dec. 2008.
[15] T. Savic and R. Albijanic, โCS reconstruction of the speech and musical signals,โ in 2015 4th Mediterranean Conference on
Embedded Computing (MECO), Jun. 2015, pp. 299โ302, doi: 10.1109/MECO.2015.7181927.
[16] T. V Sreenivas and W. B. Kleijn, โCompressive sensing for sparsely excited speech signals,โ in 2009 IEEE International
Conference on Acoustics, Speech and Signal Processing, Apr. 2009, pp. 4125โ4128, doi: 10.1109/ICASSP.2009.4960536.
[17] โCompressive sensing resources,โ Digital Signal Processing at Rice University. http://dsp.rice.edu/cs/ (accessed Oct. 10, 2021).
[18] R. Baraniuk, M. Davenport, R. DeVore, and M. Wakin, โA simple proof of the restricted isometry property for random matrices,โ
Constructive Approximation, vol. 28, no. 3, pp. 253โ263, Dec. 2008, doi: 10.1007/s00365-007-9003-x.
[19] J. M. R. S. Pilastri, Andrรฉ Luiz and Tavares, โReconstruction algorithms in compressive sensing: an overview,โ in 11th edition of
the Doctoral Symposium in Informatics Engineering (DSIE-16), 2016, pp. 127โ137.
[20] E. Candes, E. Candes, J. Romberg, and J. Romberg, โl1-magic: Recovery of sparse signals via convex programming,โ pp. 1โ19,
2005.
[21] D. L. Donoho, โFor most large underdetermined systems of linear equations the minimal 1-norm solution is also the sparsest
solution,โ Communications on Pure and Applied Mathematics, vol. 59, no. 6, pp. 797โ829, Jun. 2006, doi: 10.1002/cpa.20132.
[22] Y. V. Parkale and S. L. Nalbalwar, โApplication of 1-D discrete wavelet transform based compressed sensing matrices for speech
compression,โ SpringerPlus, vol. 5, no. 1, Dec. 2016, doi: 10.1186/s40064-016-3740-x.
[23] S. Sahu and N. Rayavarapu, โPerformance comparison of sparsifying basis functions for compressive speech enhancement,โ
International Journal of Speech Technology, vol. 22, no. 3, pp. 769โ783, Sep. 2019, doi: 10.1007/s10772-019-09622-9.
[24] S.-F. Xu and X.-B. Chen, โSpeech signal acquisition methods based on compressive sensing,โ in Systems and Computer
Technology, CRC Press, 2015, pp. 125โ130.
[25] D. L. Donoho, โDe-noising by soft-thresholding,โ IEEE Transactions on Information Theory, vol. 41, no. 3, pp. 613โ627, May
1995, doi: 10.1109/18.382009.
[26] H.-Y. Gao, โWavelet shrinkage denoising using the non-negative garrote,โ Journal of Computational and Graphical Statistics,
vol. 7, no. 4, pp. 469โ488, Dec. 1998, doi: 10.1080/10618600.1998.10474789.
13. ๏ฒ ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 3, June 2023: 2788-2800
2800
[27] L. Jing-yi, L. Hong, Y. Dong, and Z. Yan-sheng, โA new wavelet threshold function and denoising application,โ Mathematical
Problems in Engineering, vol. 2016, pp. 1โ8, 2016, doi: 10.1155/2016/3195492.
[28] P. K. Jain and A. K. Tiwari, โAn adaptive thresholding method for the wavelet based denoising of phonocardiogram signal,โ
Biomedical Signal Processing and Control, vol. 38, pp. 388โ399, Sep. 2017, doi: 10.1016/j.bspc.2017.07.002.
[29] V. Abrol, P. Sharma, and S. Budhiraja, โEvaluating performance of compressed sensing for speech signals,โ in 2013 3rd IEEE
International Advance Computing Conference (IACC), Feb. 2013, pp. 1159โ1164, doi: 10.1109/IAdCC.2013.6514391.
[30] R. Baraniuk, โCompressive sensing,โ IEEE Signal Processing Magazine, vol. 24, no. 4, pp. 118โ121, Jul. 2007, doi:
10.1109/MSP.2007.4286571.
[31] E. J. Candes and M. B. Wakin, โAn introduction to compressive sampling,โ IEEE Signal Processing Magazine, vol. 25, no. 2,
pp. 21โ30, Mar. 2008, doi: 10.1109/MSP.2007.914731.
[32] D. L. Donoho, โCompressed sensing,โ IEEE Transactions on Information Theory, vol. 52, no. 4, pp. 1289โ1306, Apr. 2006, doi:
10.1109/TIT.2006.871582.
[33] S. Foucart and H. Rauhut, โA mathematical introduction to compressive sensing,โ Applied and Numerical Harmonic Analysis,
no. 9780817649470, pp. 1โ615, 2013.
[34] F. F. Firouzeh, S. Ghorshi, and S. Salsabili, โCompressed sensing based speech enhancement,โ in 2014 8th International
Conference on Signal Processing and Communication Systems (ICSPCS), Dec. 2014, pp. 1โ6, doi:
10.1109/ICSPCS.2014.7021068.
[35] D. L. Donoho, โProgress in wavelet analysis and WVD: a ten-minute tour,โ Progress in Wavelet Analysis and Applications,
pp. 109โ128, 1993.
[36] L. Breiman, โBetter subset regression using the nonnegative garrote,โ Technometrics, vol. 37, no. 4, pp. 373โ384, Nov. 1995, doi:
10.2307/1269730.
[37] A. Graps, โAn introduction to wavelets,โ IEEE Computational Science and Engineering, vol. 2, no. 2, pp. 50โ61, 1995, doi:
10.1109/99.388960.
[38] M. Stรฉphane, A wavelet tour of signal processing. Elsevier, 2009.
[39] M. Misiti, Y. Misiti, G. Oppenheim, and J. M. Poggi, Wavelet toolboxTM 4 userโs guide. The MathWorks, 2009.
[40] D. Donoho and I. Johnstone, โIdeal spatial adaptation via wavelet shringkage,โ Biometrica, pp. 425โ455, 1994.
[41] P. Loizou, โNOIZEUS: A noisy speech corpus for evaluation of speech enhancement algorithms,โ University of Texas at Dallas.
http://ecs.utdallas.edu/loizou/speech/noizeus/ (accessed Sep. 15, 2021).
[42] Z. Lin, L. Zhou, and X. Qiu, โA composite objective measure on subjective evaluation of speech enhancement algorithms,โ
Applied Acoustics, vol. 145, pp. 144โ148, Feb. 2019, doi: 10.1016/j.apacoust.2018.10.002.
[43] Y. Hu and P. C. Loizou, โEvaluation of objective quality measures for speech enhancement,โ IEEE Transactions on Audio,
Speech, and Language Processing, vol. 16, no. 1, pp. 229โ238, Jan. 2008, doi: 10.1109/TASL.2007.911054.
[44] A. W. Rix, J. G. Beerends, M. P. Hollier, and A. P. Hekstra, โPerceptual evaluation of speech quality (PESQ)-a new method for
speech quality assessment of telephone networks and codecs,โ in IEEE International Conference on Acoustics, Speech, and Signal
Processing. Proceedings (Cat. No.01CH37221), 2001, vol. 2, pp. 749โ752, doi: 10.1109/ICASSP.2001.941023.
[45] P. C. Loizou and Philipos C. Loizou, Speech enhancement: theory and practice. Taylor and Francis, 2013.
[46] J. Ma, Y. Hu, and P. C. Loizou, โObjective measures for predicting speech intelligibility in noisy conditions based on new band-
importance functions,โ The Journal of the Acoustical Society of America, vol. 125, no. 5, pp. 3387โ3405, 2009, doi:
10.1121/1.3097493.
BIOGRAPHIES OF AUTHORS
Smriti Sahu received her B.E. degree in Electronics and Telecommunication
Engineering, from Chhattisgarh Swami Vivekanand Technical University (C.S.V.T.U.),
Bhilai, Chhattisgarh in 2009 and M.E.(Communication) from C.S.V.T.U., Bhilai (C.G.) in
2014. She worked as a lecturer at Shri Shankaracharya College of Engineering and
Technology (SSCET), Bhilai for 4 years in the Department of Electronics and
Telecommunication Engineering. She received Research Fellowship from Symbiosis
International (Deemed University) and pursuing PhD at Symbiosis Institute of Technology,
Lavale, Pune. She has a research interest in digital signal processing, image processing and its
applications. She can be contacted at email: smritisahu13@gmail.com.
Neela Rayavarapu received her B.E. degree in Electrical Engineering from
Bangalore University, Bangalore, India in 1984. She received her MS Degree in Electrical and
Computer Engineering from Rutgers, the State University of New Jersey, USA, in 1987, and a
PhD degree in Electronics and Communication Engineering in 2012 from Panjab University,
Chandigarh. She has been involved in teaching and research in Electrical, Electronics, and
Communication Engineering since 1987. She worked as a professor at Symbiosis Institute of
Technology, Pune for 9 years. Her areas of interest are digital signal processing and its
applications and control systems. She can be contacted at email: neela.raya27@gmail.com.