SlideShare a Scribd company logo
1 of 7
Download to read offline
Jan Zizka (Eds) : CCSIT, SIPP, AISC, PDCTA - 2013
pp. 315–321, 2013. © CS & IT-CSCP 2013 DOI : 10.5121/csit.2013.3636
AN EFFICIENT PEAK VALLEY
DETECTION BASED VAD ALGORITHM
FOR ROBUST DETECTION OF SPEECH
AUDITORY BRAINSTEM RESPONSES
Ranganadh Narayanam
Department of Electronics & Communications Engineering
Bharat Institute of Engineering & Technology
Mangalpally (V), Ibrahimpatnam (M)
Hyderabad, A.P., India – 501 510
rnara100@uottawa.ca, rnara100@gmail.com
ABSTRACT
Voice Activity Detection (VAD) problem considers detecting the presence of speech in a noisy
signal. The speech/non-speech classification task is not as trivial as it appears, and most of the
VAD algorithms fail when the level of background noise increases. In this research we are
presenting a new technique for Voice Activity Detection (VAD) in EEG collected brain stem
speech evoked potentials data [7, 8, 9]. This one is spectral subtraction method in which we
have developed our own mathematical formula for the peak valley detection (PVD) of the
frequency spectra to detect the voice activity [1]. The purpose of this research is to compare the
performance of this SNR based PVD (SNRPVD) method over Zero-Crossing rate detector [5]
and statistical analysis based algorithms [10]. We have put into application of these three
algorithms on these particular data sets of this experiment [7, 8, 9] and VAD is verified and
compared the results of these three. MATLAB routines were developed on these particular
methodologies. Finally we concluded that the method of SNRPVD surely performing better than
the ZCR and statistical algorithms.
KEYWORDS
Zero Crossing Rate; Peak valley detection; Voice Activity Detection; stastical anlysis method.
1. INTRODUCTION
In this paper we are presenting three algorithms for the purpose of the Voice activity detection in
EEG collected brain stem speech evoked potentials. In our experiment [7,8,9] EEG Eelectrode
recordings are made from brain stem in response to complex, speech-like, sound stimuli. There is
increasing interest in recording auditory brainstem responses to speech stimuli (speech ABR) as
there is evidence that they are useful in the diagnosis of central auditory processing disorders [7].
However, the frequency content of natural speech is neither concentrated in frequency or in time,
316 Computer Science & Information Technology (CS & IT)
the recording of speech ABR of sufficient quality may require tens of minutes [6]. Even with a
synthetic consonant-vowel stimulus, a recording time of several minutes was required [8]. Speech
ABR is believed to originate in neural activity that is phase-locked to the envelope or harmonics
of the stimulus. As a result, the recorded responses are remarkably speech-like. In fact, speech
ABR is quite intelligible if played back as a sound [9]. As a result, methods used for Voice
Activity Detection (VAD) can be useful for the detection of speech ABR. Once the response is
detected, then other noise suppression algorithms could in principle be applied to improve the
signal-to-noise ratio (SNR).
We implemented three algorithms for the purpose of the VAD for our experiment of brain stem
speech evoked potentials: (a) Linear-interpolation zero-crossing rate algorithm [5] explained in
the section 2 A, specifically for our application. (b) A new proposed VAD algorithm that is based
on a binary weighting of the spectral components of the signal under test [1]. This algorithm,
explained in the section 2 B, is based on the property that vowels have distinctive spectral peaks.
These are likely to remain higher than their surroundings even after severe corruption. Therefore,
by developing a method of detecting the spectral peaks of vowel sounds in corrupted signal voice
activity can be detected as well even in low signal-to-noise ratio (SNR) conditions. (c) Two more
statistical algorithms [10] are also implemented, based on a statistical approach that has become
the standard for detecting harmonic components in a related evoked response, the auditory
steady-state response (ASSR).
We provided the results in the sections of 3 and 4. Finally we found the peak valley detection
based SNRPVD algorithm perorming better than the remaining two.
2. ALGORITHMS
A. Zero Crossing Rate usage for the purpose of Voice activity detection[5]
First, in this paper we have implemented the Voice activity detection for the collected EEG brain
stem speech evoked potentials using the “linear interpolation Zero crossing rate algorithm”. In
this algorithm the shape of the signal is very close to the straight line near the zero crossing. Near
the zero crossing the samples are described as points in straight line defined by the angular
parameter a, and a linear parameter b. The two consecutive samples on the X-axis can be
expressed as
xn = a × n + b
x୬ାଵ = a × ሺn + 1ሻ + b (1)
The parameters a and b can be expressed in terms of the samples.
a = xn+1 - xn
b = (n+1) × xn – n × xn+1 (2)
The raising zero crossing must be in between two consecutive samples which must satisfy the
condition.
x୬ ≤ 0 < x୬ାଵ (3)
Computer Science & Information Technology (CS & IT) 317
Sample displacement for interpolation parameter d, is defined when interpolated sample n+d is
given by
x୬ାୢ = a × ሺn + dሻ + b (4)
By making the interpolated value equal to zero we can calculate the desired instant n+d of the
zero crossing.
ܾ
−ܽ
n + d = (b/-a) = [ (n×xn+1)-(n+1)xn]/[(xn+1)-(xn)] (5)
The displacement d is given by
d =
ି୶౤
ሺ୶౤శభ ሻିሺ୶౤ሻ
(6)
d is a fractional number as of the equation 3, which is the zero crossing instant in sample numbers
n+d. The samples xn and xn+1 are trailing and leading samples from the zero cross instant.
B. Peak valley detection algorithm for the Voice activity detection: Signal to Noise
ratio Peak valley detection ratio[1]
This method uses spectral peaks of vowel sounds to detect Voice activity in this particular
experiment of EEG collected brain stem speech evoked potentials. Using this method we reduce
the problem of detecting the voice activity to the problem of detecting the presence of vowels. In
this the assumption is that the vowel sounds are nearly unique to speech. Why is this vowel sound
detection is the reason that consonant sounds are always accompanied by vowel sounds and their
duration is short so presence of vowel sounds can be directly related to the presence of speech.
Vowel sounds are having distinctive spectral peaks of energy at specific spectral bands. Even
though there is severe noise corruption the peaks remain higher than their surroundings.
We assume that the positions of major spectral peaks are the most important factor in recognizing
the vowel sounds rather than the relative sizes of peaks or the shapes in spectral valleys, which are
vulnerable to noise. Using this concept we propose the Signal to noise Ratio peak valley difference
(SNRPVD) which calculates the similarity between the peak signature vector S of a registered
vowel sound and the spectrum X of the input signal. In this by using one conventional existing
peak valley detection formula [1] and applying it on several of our data sets and for our application
we have concluded and modified the formula to this following formula.
S Vector: The peak signature vector S contains the peak position information for a vowel sound.
It is a binary vector designed by us for this type of data collection. We already know that the
locations of the places where these vowel sounds peaks occur i.e.> for example at 400 Hz, 500
Hz, 700 Hz, 800 Hz, 1000 Hz. So after we locate which frequencies of vowel peaks we need to
select then we have to use the following formula for the detection of the locations of the
frequencies in the given vector size of 1024 and for the frequency sampling rate of 3202 Hz. Then
put “1” in those calculated locations and put zeros in the remaining locations which gives the “S”
318 Computer Science & Information Technology (CS & IT)
vector for that particular data set. Then we can apply the above SNRPVD formula for the voice
activity detection. There are several ways to design this S vector depending on the application and
the data collection.
S vector frequency location calculation formula.
Frequency location = [(Sampling Frequency / Number of Samples) (frequency for which we
need to find the location)] + 2. (12)
3. RESULTS
In this section the results of the three algorithms are presented in the form of bar graphs which are
useful for the analysis of the performance of the three algorithms. In this we observed the
SNRPVD [1] algorithm is far better than the ZCR [5, 9] and Statistical analysis algorithm [10].
The results are discussed in the discussion section, which are given in the tabular form Table 1.
We have 10 different subjects for analysis but we have presented here graphical results of one
subject for example purposes for all the three algorithms in Fig 1,2,3,4.
Subject 2
Figure 1. ZCR algorithm.
0
50
100
150
200
1 5 9 13 17 21 25 29 33 37 41 45 49
ZCR
SNR (+10 to -40 db)
Computer Science & Information Technology (CS & IT) 319
Figure 2. SNRPVD algorithm.
Figure 3. Statistical algorithm: for fundamental frequency of 100 Hz.
Figure 4. Statistical algorithm: For the frequency tone 100 Hz + 200 Hz.
For figures (1), (2), (3) & (4) on the X-axis SNR values in db from 10 dB to -40 dB in the steps of
1, and on the Y-axis ZCR, SNRPVD, Statistical algorithm: For fundamental frequency of 100 Hz,
Statistical algorithm: For the frequency tone 100 Hz + 200 Hz respectively.
4. DISCUSSIONS
The results are given in the Tabular form Table 1 for all the three algorithms under evaluation. In
this we have taken into consideration the EEG pure noise data which we have collected during the
data collection. The ZCR [5, 8] and SNRPVD [1] values as reference for evaluation which are 120
and 1.8596 respectively for this EEG noise data. On these for precise evaluation purpose we have
taken some additional percentage of 5% more on these values which are 126 and 1.95258
respectively to form as a threshold. So this is the 100% surety reference threshold from the ZCR
0
2
4
6
8
10
1 5 9 13 17 21 25 29 33 37 41 45 49
SNRPVD
SNR (+10 to -40 db)
0
0.05
0.1
0.15
0.2
1 5 9 13 17 21 25 29 33 37 41 45 49
p-valuewithF-ratiotest
atF0
SNR (+10 to -40)
0
0.1
0.2
0.3
0.4
1 5 9 13 17 21 25 29 33 37 41 45 49
p-valuewithF-ratiotest
atF0and2F0
SNR (+10 to -40)
320 Computer Science & Information Technology (CS & IT)
[8] and SNRPVD [1]. So for the statistical analysis purposes we will take as low p-value as
possible for the reference threshold value to make it as close as possible for the 100% surety. In
our case we have taken 0.01 is the threshold p-value we have taken. Form this we have 1% error
i.e. 99% surety. For ZCR [8] and SNRPVD [1] values of all 10 subjects if the values are more than
the taken threshold then it is considered as voice detection. For statistical analysis algorithm if it is
less than the given threshold then it is considered as the voice detection. So in this case we started
the experiment as adding the noise from 10 db to -40 db and also 1000db to the EEG pure noise
signal and then adding this noised signal to the original data signal and then we did put into
application this ZCR and SNRPVD and also statistical algorithmic procedures.
Table 1. The results for the three algorithms for the subjects 1-10.
Subject
number
SNRPVD SNR
cutoff (db)
ZCR SNR
cutoff (db)
P-values
Column 2 SNR
cutoff (db)
P-values
column 4 SNR
cutoff (db)
1 -18 -2 nothing -6
2 -26 -20 -14 -13
3 -32 1000 -19 -16
4 -23 -21 -12 -11
5 -24 -11 -15 -12
6 -26 -21 -12 -11
7 -23 -2 -16 -15
8 -36 -15 -13 -16
9 -30 -18 2 -15
10 -32 -11 -14 -12
(Note: column 2 is for: For fundamental frequency of 100 Hz. Column 4 is for: For the frequency tone 100
Hz + 200 Hz.)
5. CONCLUSION AND FURTHER RESEARCH
After As of the observed results from the table I it is clear that we can do better in the case of
Signal to Noise Ratio Peak Valley Detection (SNRPVD) algorithm than Zero Cross Rating and
Statistical analysis algorithms. As a further research part we are implementing few more VAD
algorithms existing and about to compare them with our SNRPVD technique for further
verification of its better performance over them, and would like to standardize it & then to apply in
real time applications.
REFERENCES
[1] In-Chul Yoo and Dongsuk Yook, “Robust voice activity detection using the spectral spectral peaks of
vowel sounds”. ETRI Journal,Volume 31,Number 4,August 2009.
[2] R. Nicole, J. Sohn, N.S. Kim, and W. Sung, “A statistical Model Based Voice Activity Detection,”
IEEE Signal Process. Lett., vol. 6, 1999, pp. 1-3.
[3] Javier Ramirez,Jos C segura, Carmen Benitez,Angel de la torre, Antonio Rubio, “Efficient voice
activity detection algorithms using long-term speech information”, J. Ram_ırez et al. / Speech
Communication 42 (2004) 271–287.
[4] I. Krekule, “zero crosssing detection of the presence of evoked responses”, Electroencephalography
and clinical neurophysiology, Elsevier publishing company, Amsterdam – Printed in the netherlands.
[5] GBrson Eduardo Mog, Eduardo Parente kbeiro, “Zero Crossing determination by linear interpolation
of sampled sinusoidal signal.
Computer Science & Information Technology (CS & IT) 321
[6] Dajani, R.H., Purcell, D., Wong, W., Kunov, H., Picton, T.W. 2005. Recording Human Evoked
Potentials That Follow the Pitch Contour of a Natural Vowel. IEEE Transactions on Biomedical
Engineering 52, 1614-1618.
[7] Johnson, K.L., Nicol, G.T., Kraus, N. 2005. Brain Stem Response to Speech: A Biological Marker of
Auditory Processing. Ear & Hearing 26, 424-434.
[8] Russo, N., Nicol, T., Musacchia, G., Kraus, N. 2004. Brainstem responses to speech syllables. Clinical
Neurophysiology 115, 2021-2030.
[9] Galbraith GC, Arbagey PW, Branski R. Intelligible speech encoded in the human brain stem
frequency-following response. NeuroReport 1995; 6: 2363-2367.
[10] M.S. John, T.W. Picton, MASTER: a Windows program for recording multipleauditory steady-state
response. Computer Methods and Programs in Biomedicine 61 (2000) 125–150, Elsevier.
Author’s Biography
Mr. Ranganadh Narayanam is an Assistant professor in the department of Electronics & Communications
Engineering in Bharat Institute of Engineering & Technology(BIET).This research is continuation of the
research done in univeristy of ottawa under the guidance of Dr. Hilmi Dajani. This current research was
partly funded by BIET. Mr.Narayanam, a research student in the area of “Brain Stem Speech Evoked
Potentials” under the guidance of Dr. Hilmi Dajnai of University of Ottawa,Canada. He was also a research
student in The University of Texas at San Antonio under Dr. Parimal A Patel,Dr. Artyom M. Grigoryan, Dr
Sos Again, Dr. CJ Qian,. in the areas of signal processing and digital systems, control systems. He worked
in the area of Brian Imaging in University of California Berkeley. Mr. Narayanam is having around 5+2
years of full time teaching & research experiences respectively, and more than 5 years of entry level
research experience and more than 10 publications. Mr. Narayanam’s research interests include
neurological Signal & Image processing, DSP software & Hardware design and implementations,
neurotechnologies.

More Related Content

What's hot

A New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction
A New Approach for Speech Enhancement Based On Eigenvalue Spectral SubtractionA New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction
A New Approach for Speech Enhancement Based On Eigenvalue Spectral SubtractionCSCJournals
 
Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...
Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...
Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...Takuma_OKAMOTO
 
Spatial Fourier transform-based localized sound zone generation with loudspea...
Spatial Fourier transform-based localized sound zone generation with loudspea...Spatial Fourier transform-based localized sound zone generation with loudspea...
Spatial Fourier transform-based localized sound zone generation with loudspea...Takuma_OKAMOTO
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniqueseSAT Journals
 
Plane wave decomposition and beamforming for directional spatial sound locali...
Plane wave decomposition and beamforming for directional spatial sound locali...Plane wave decomposition and beamforming for directional spatial sound locali...
Plane wave decomposition and beamforming for directional spatial sound locali...Muhammad Imran
 
Reduced Ordering Based Approach to Impulsive Noise Suppression in Color Images
Reduced Ordering Based Approach to Impulsive Noise Suppression in Color ImagesReduced Ordering Based Approach to Impulsive Noise Suppression in Color Images
Reduced Ordering Based Approach to Impulsive Noise Suppression in Color ImagesIDES Editor
 
Digital signal processing through speech, hearing, and Python
Digital signal processing through speech, hearing, and PythonDigital signal processing through speech, hearing, and Python
Digital signal processing through speech, hearing, and PythonMel Chua
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringTejus Adiga M
 
2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...
2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...
2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...Mohamed Mubeen S
 
Pre emphasis on data for an adaptive fingerprint image enhancement
Pre emphasis on data for an adaptive fingerprint image enhancementPre emphasis on data for an adaptive fingerprint image enhancement
Pre emphasis on data for an adaptive fingerprint image enhancementIAEME Publication
 
Dominant CIR Tap Index Identification for Wideband Channels
Dominant CIR Tap Index Identification for Wideband ChannelsDominant CIR Tap Index Identification for Wideband Channels
Dominant CIR Tap Index Identification for Wideband ChannelsT. E. BOGALE
 

What's hot (18)

Ky2418521856
Ky2418521856Ky2418521856
Ky2418521856
 
A New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction
A New Approach for Speech Enhancement Based On Eigenvalue Spectral SubtractionA New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction
A New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction
 
Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...
Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...
Real-time neural text-to-speech with sequence-to-sequence acoustic model and ...
 
Spatial Fourier transform-based localized sound zone generation with loudspea...
Spatial Fourier transform-based localized sound zone generation with loudspea...Spatial Fourier transform-based localized sound zone generation with loudspea...
Spatial Fourier transform-based localized sound zone generation with loudspea...
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniques
 
Plane wave decomposition and beamforming for directional spatial sound locali...
Plane wave decomposition and beamforming for directional spatial sound locali...Plane wave decomposition and beamforming for directional spatial sound locali...
Plane wave decomposition and beamforming for directional spatial sound locali...
 
Closed loop DPCM
Closed loop DPCMClosed loop DPCM
Closed loop DPCM
 
Reduced Ordering Based Approach to Impulsive Noise Suppression in Color Images
Reduced Ordering Based Approach to Impulsive Noise Suppression in Color ImagesReduced Ordering Based Approach to Impulsive Noise Suppression in Color Images
Reduced Ordering Based Approach to Impulsive Noise Suppression in Color Images
 
Matlab: Speech Signal Analysis
Matlab: Speech Signal AnalysisMatlab: Speech Signal Analysis
Matlab: Speech Signal Analysis
 
Digital signal processing through speech, hearing, and Python
Digital signal processing through speech, hearing, and PythonDigital signal processing through speech, hearing, and Python
Digital signal processing through speech, hearing, and Python
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency Filtering
 
Ijecet 06 09_006
Ijecet 06 09_006Ijecet 06 09_006
Ijecet 06 09_006
 
W4101139143
W4101139143W4101139143
W4101139143
 
2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...
2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...
2015_Reduced-Complexity Super-Resolution DOA Estimation with Unknown Number o...
 
Emb
EmbEmb
Emb
 
Pre emphasis on data for an adaptive fingerprint image enhancement
Pre emphasis on data for an adaptive fingerprint image enhancementPre emphasis on data for an adaptive fingerprint image enhancement
Pre emphasis on data for an adaptive fingerprint image enhancement
 
Wavelet
WaveletWavelet
Wavelet
 
Dominant CIR Tap Index Identification for Wideband Channels
Dominant CIR Tap Index Identification for Wideband ChannelsDominant CIR Tap Index Identification for Wideband Channels
Dominant CIR Tap Index Identification for Wideband Channels
 

Viewers also liked

Comparison analysis of shunt active filter and transformerless parallel hybri...
Comparison analysis of shunt active filter and transformerless parallel hybri...Comparison analysis of shunt active filter and transformerless parallel hybri...
Comparison analysis of shunt active filter and transformerless parallel hybri...csandit
 
Hoe schrijf ik een blog
Hoe schrijf ik een blog Hoe schrijf ik een blog
Hoe schrijf ik een blog Wina Ricardo
 
RegenUrb: Newsletter Nov-2015
RegenUrb: Newsletter Nov-2015RegenUrb: Newsletter Nov-2015
RegenUrb: Newsletter Nov-2015regenurb
 
Contextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portalContextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portalcsandit
 
Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...
Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...
Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...Zhulkeflee Ismail
 
OLICO Youth facilitators' handbook
OLICO Youth facilitators' handbookOLICO Youth facilitators' handbook
OLICO Youth facilitators' handbookNathalia von Witt
 
Ação Direta de Inconstitucionalidade contra acordos de leniência
Ação Direta de Inconstitucionalidade contra acordos de leniênciaAção Direta de Inconstitucionalidade contra acordos de leniência
Ação Direta de Inconstitucionalidade contra acordos de leniênciaRaul Jungmann
 
De Reis van de Heldin April 2015
De Reis van de Heldin April 2015De Reis van de Heldin April 2015
De Reis van de Heldin April 2015Peter de Kuster
 
JT15-003 ColorectalCare_Chatter_Jan15
JT15-003 ColorectalCare_Chatter_Jan15JT15-003 ColorectalCare_Chatter_Jan15
JT15-003 ColorectalCare_Chatter_Jan15Lauren Fuqua Lawson
 
Phrasal verbs - Luz Mary Rodriguez
Phrasal verbs - Luz Mary RodriguezPhrasal verbs - Luz Mary Rodriguez
Phrasal verbs - Luz Mary RodriguezDeyna Riascos
 
Certificate medical radio imaging technology admission, eligibility
Certificate medical radio imaging technology admission, eligibilityCertificate medical radio imaging technology admission, eligibility
Certificate medical radio imaging technology admission, eligibilitySunil Beniwan
 
Différents types de riz gluant du vietnamien
Différents types de riz gluant du vietnamienDifférents types de riz gluant du vietnamien
Différents types de riz gluant du vietnamienVietnam Original Travel
 
Special june-87-13-iii (comp. sci. & app.)
Special june-87-13-iii (comp. sci. & app.)Special june-87-13-iii (comp. sci. & app.)
Special june-87-13-iii (comp. sci. & app.)singlark
 

Viewers also liked (18)

Comparison analysis of shunt active filter and transformerless parallel hybri...
Comparison analysis of shunt active filter and transformerless parallel hybri...Comparison analysis of shunt active filter and transformerless parallel hybri...
Comparison analysis of shunt active filter and transformerless parallel hybri...
 
Hoe schrijf ik een blog
Hoe schrijf ik een blog Hoe schrijf ik een blog
Hoe schrijf ik een blog
 
Я.Ю.Якунін. Упровадження мультипрофільного навчання в середній загальноосвітн...
Я.Ю.Якунін. Упровадження мультипрофільного навчання в середній загальноосвітн...Я.Ю.Якунін. Упровадження мультипрофільного навчання в середній загальноосвітн...
Я.Ю.Якунін. Упровадження мультипрофільного навчання в середній загальноосвітн...
 
RegenUrb: Newsletter Nov-2015
RegenUrb: Newsletter Nov-2015RegenUrb: Newsletter Nov-2015
RegenUrb: Newsletter Nov-2015
 
Front-end tools
Front-end toolsFront-end tools
Front-end tools
 
Contextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portalContextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portal
 
Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...
Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...
Slideshare (lesson#8)tauheed-course-(batch#5-aug-dec-2015)-shafa'at-(4-novemb...
 
OLICO Youth facilitators' handbook
OLICO Youth facilitators' handbookOLICO Youth facilitators' handbook
OLICO Youth facilitators' handbook
 
Ação Direta de Inconstitucionalidade contra acordos de leniência
Ação Direta de Inconstitucionalidade contra acordos de leniênciaAção Direta de Inconstitucionalidade contra acordos de leniência
Ação Direta de Inconstitucionalidade contra acordos de leniência
 
Weekly Market Review - October 4, 2013
Weekly Market Review - October 4, 2013Weekly Market Review - October 4, 2013
Weekly Market Review - October 4, 2013
 
Contoh data
Contoh dataContoh data
Contoh data
 
De Reis van de Heldin April 2015
De Reis van de Heldin April 2015De Reis van de Heldin April 2015
De Reis van de Heldin April 2015
 
JT15-003 ColorectalCare_Chatter_Jan15
JT15-003 ColorectalCare_Chatter_Jan15JT15-003 ColorectalCare_Chatter_Jan15
JT15-003 ColorectalCare_Chatter_Jan15
 
Phrasal verbs - Luz Mary Rodriguez
Phrasal verbs - Luz Mary RodriguezPhrasal verbs - Luz Mary Rodriguez
Phrasal verbs - Luz Mary Rodriguez
 
Certificate medical radio imaging technology admission, eligibility
Certificate medical radio imaging technology admission, eligibilityCertificate medical radio imaging technology admission, eligibility
Certificate medical radio imaging technology admission, eligibility
 
Différents types de riz gluant du vietnamien
Différents types de riz gluant du vietnamienDifférents types de riz gluant du vietnamien
Différents types de riz gluant du vietnamien
 
Presentation1
Presentation1Presentation1
Presentation1
 
Special june-87-13-iii (comp. sci. & app.)
Special june-87-13-iii (comp. sci. & app.)Special june-87-13-iii (comp. sci. & app.)
Special june-87-13-iii (comp. sci. & app.)
 

Similar to An efficient peak valley detection based vad algorithm for robust detection of speech auditory brainstem responses

A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...IRJET Journal
 
129966863516564072[1]
129966863516564072[1]129966863516564072[1]
129966863516564072[1]威華 王
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...ijsrd.com
 
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LANMETHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LANIJNSA Journal
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing IJECEIAES
 
Audio Signal Processing
Audio Signal Processing Audio Signal Processing
Audio Signal Processing Ahmed A. Arefin
 
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...ijcisjournal
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABsipij
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...CSCJournals
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
20575-38936-1-PB.pdf
20575-38936-1-PB.pdf20575-38936-1-PB.pdf
20575-38936-1-PB.pdfIjictTeam
 
Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionIOSRJVSP
 
Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...csandit
 
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...Roman Atachiants
 
Robot navigation in unknown environment with obstacle recognition using laser...
Robot navigation in unknown environment with obstacle recognition using laser...Robot navigation in unknown environment with obstacle recognition using laser...
Robot navigation in unknown environment with obstacle recognition using laser...IJECEIAES
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 

Similar to An efficient peak valley detection based vad algorithm for robust detection of speech auditory brainstem responses (20)

A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
 
129966863516564072[1]
129966863516564072[1]129966863516564072[1]
129966863516564072[1]
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
 
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LANMETHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
 
Audio Signal Processing
Audio Signal Processing Audio Signal Processing
Audio Signal Processing
 
F010334548
F010334548F010334548
F010334548
 
Real Time Speech Enhancement in the Waveform Domain
Real Time Speech Enhancement in the Waveform DomainReal Time Speech Enhancement in the Waveform Domain
Real Time Speech Enhancement in the Waveform Domain
 
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
20575-38936-1-PB.pdf
20575-38936-1-PB.pdf20575-38936-1-PB.pdf
20575-38936-1-PB.pdf
 
Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
 
Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...
 
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
Research: Applying Various DSP-Related Techniques for Robust Recognition of A...
 
K31074076
K31074076K31074076
K31074076
 
H0814247
H0814247H0814247
H0814247
 
Robot navigation in unknown environment with obstacle recognition using laser...
Robot navigation in unknown environment with obstacle recognition using laser...Robot navigation in unknown environment with obstacle recognition using laser...
Robot navigation in unknown environment with obstacle recognition using laser...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 

Recently uploaded

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

An efficient peak valley detection based vad algorithm for robust detection of speech auditory brainstem responses

  • 1. Jan Zizka (Eds) : CCSIT, SIPP, AISC, PDCTA - 2013 pp. 315–321, 2013. © CS & IT-CSCP 2013 DOI : 10.5121/csit.2013.3636 AN EFFICIENT PEAK VALLEY DETECTION BASED VAD ALGORITHM FOR ROBUST DETECTION OF SPEECH AUDITORY BRAINSTEM RESPONSES Ranganadh Narayanam Department of Electronics & Communications Engineering Bharat Institute of Engineering & Technology Mangalpally (V), Ibrahimpatnam (M) Hyderabad, A.P., India – 501 510 rnara100@uottawa.ca, rnara100@gmail.com ABSTRACT Voice Activity Detection (VAD) problem considers detecting the presence of speech in a noisy signal. The speech/non-speech classification task is not as trivial as it appears, and most of the VAD algorithms fail when the level of background noise increases. In this research we are presenting a new technique for Voice Activity Detection (VAD) in EEG collected brain stem speech evoked potentials data [7, 8, 9]. This one is spectral subtraction method in which we have developed our own mathematical formula for the peak valley detection (PVD) of the frequency spectra to detect the voice activity [1]. The purpose of this research is to compare the performance of this SNR based PVD (SNRPVD) method over Zero-Crossing rate detector [5] and statistical analysis based algorithms [10]. We have put into application of these three algorithms on these particular data sets of this experiment [7, 8, 9] and VAD is verified and compared the results of these three. MATLAB routines were developed on these particular methodologies. Finally we concluded that the method of SNRPVD surely performing better than the ZCR and statistical algorithms. KEYWORDS Zero Crossing Rate; Peak valley detection; Voice Activity Detection; stastical anlysis method. 1. INTRODUCTION In this paper we are presenting three algorithms for the purpose of the Voice activity detection in EEG collected brain stem speech evoked potentials. In our experiment [7,8,9] EEG Eelectrode recordings are made from brain stem in response to complex, speech-like, sound stimuli. There is increasing interest in recording auditory brainstem responses to speech stimuli (speech ABR) as there is evidence that they are useful in the diagnosis of central auditory processing disorders [7]. However, the frequency content of natural speech is neither concentrated in frequency or in time,
  • 2. 316 Computer Science & Information Technology (CS & IT) the recording of speech ABR of sufficient quality may require tens of minutes [6]. Even with a synthetic consonant-vowel stimulus, a recording time of several minutes was required [8]. Speech ABR is believed to originate in neural activity that is phase-locked to the envelope or harmonics of the stimulus. As a result, the recorded responses are remarkably speech-like. In fact, speech ABR is quite intelligible if played back as a sound [9]. As a result, methods used for Voice Activity Detection (VAD) can be useful for the detection of speech ABR. Once the response is detected, then other noise suppression algorithms could in principle be applied to improve the signal-to-noise ratio (SNR). We implemented three algorithms for the purpose of the VAD for our experiment of brain stem speech evoked potentials: (a) Linear-interpolation zero-crossing rate algorithm [5] explained in the section 2 A, specifically for our application. (b) A new proposed VAD algorithm that is based on a binary weighting of the spectral components of the signal under test [1]. This algorithm, explained in the section 2 B, is based on the property that vowels have distinctive spectral peaks. These are likely to remain higher than their surroundings even after severe corruption. Therefore, by developing a method of detecting the spectral peaks of vowel sounds in corrupted signal voice activity can be detected as well even in low signal-to-noise ratio (SNR) conditions. (c) Two more statistical algorithms [10] are also implemented, based on a statistical approach that has become the standard for detecting harmonic components in a related evoked response, the auditory steady-state response (ASSR). We provided the results in the sections of 3 and 4. Finally we found the peak valley detection based SNRPVD algorithm perorming better than the remaining two. 2. ALGORITHMS A. Zero Crossing Rate usage for the purpose of Voice activity detection[5] First, in this paper we have implemented the Voice activity detection for the collected EEG brain stem speech evoked potentials using the “linear interpolation Zero crossing rate algorithm”. In this algorithm the shape of the signal is very close to the straight line near the zero crossing. Near the zero crossing the samples are described as points in straight line defined by the angular parameter a, and a linear parameter b. The two consecutive samples on the X-axis can be expressed as xn = a × n + b x୬ାଵ = a × ሺn + 1ሻ + b (1) The parameters a and b can be expressed in terms of the samples. a = xn+1 - xn b = (n+1) × xn – n × xn+1 (2) The raising zero crossing must be in between two consecutive samples which must satisfy the condition. x୬ ≤ 0 < x୬ାଵ (3)
  • 3. Computer Science & Information Technology (CS & IT) 317 Sample displacement for interpolation parameter d, is defined when interpolated sample n+d is given by x୬ାୢ = a × ሺn + dሻ + b (4) By making the interpolated value equal to zero we can calculate the desired instant n+d of the zero crossing. ܾ −ܽ n + d = (b/-a) = [ (n×xn+1)-(n+1)xn]/[(xn+1)-(xn)] (5) The displacement d is given by d = ି୶౤ ሺ୶౤శభ ሻିሺ୶౤ሻ (6) d is a fractional number as of the equation 3, which is the zero crossing instant in sample numbers n+d. The samples xn and xn+1 are trailing and leading samples from the zero cross instant. B. Peak valley detection algorithm for the Voice activity detection: Signal to Noise ratio Peak valley detection ratio[1] This method uses spectral peaks of vowel sounds to detect Voice activity in this particular experiment of EEG collected brain stem speech evoked potentials. Using this method we reduce the problem of detecting the voice activity to the problem of detecting the presence of vowels. In this the assumption is that the vowel sounds are nearly unique to speech. Why is this vowel sound detection is the reason that consonant sounds are always accompanied by vowel sounds and their duration is short so presence of vowel sounds can be directly related to the presence of speech. Vowel sounds are having distinctive spectral peaks of energy at specific spectral bands. Even though there is severe noise corruption the peaks remain higher than their surroundings. We assume that the positions of major spectral peaks are the most important factor in recognizing the vowel sounds rather than the relative sizes of peaks or the shapes in spectral valleys, which are vulnerable to noise. Using this concept we propose the Signal to noise Ratio peak valley difference (SNRPVD) which calculates the similarity between the peak signature vector S of a registered vowel sound and the spectrum X of the input signal. In this by using one conventional existing peak valley detection formula [1] and applying it on several of our data sets and for our application we have concluded and modified the formula to this following formula. S Vector: The peak signature vector S contains the peak position information for a vowel sound. It is a binary vector designed by us for this type of data collection. We already know that the locations of the places where these vowel sounds peaks occur i.e.> for example at 400 Hz, 500 Hz, 700 Hz, 800 Hz, 1000 Hz. So after we locate which frequencies of vowel peaks we need to select then we have to use the following formula for the detection of the locations of the frequencies in the given vector size of 1024 and for the frequency sampling rate of 3202 Hz. Then put “1” in those calculated locations and put zeros in the remaining locations which gives the “S”
  • 4. 318 Computer Science & Information Technology (CS & IT) vector for that particular data set. Then we can apply the above SNRPVD formula for the voice activity detection. There are several ways to design this S vector depending on the application and the data collection. S vector frequency location calculation formula. Frequency location = [(Sampling Frequency / Number of Samples) (frequency for which we need to find the location)] + 2. (12) 3. RESULTS In this section the results of the three algorithms are presented in the form of bar graphs which are useful for the analysis of the performance of the three algorithms. In this we observed the SNRPVD [1] algorithm is far better than the ZCR [5, 9] and Statistical analysis algorithm [10]. The results are discussed in the discussion section, which are given in the tabular form Table 1. We have 10 different subjects for analysis but we have presented here graphical results of one subject for example purposes for all the three algorithms in Fig 1,2,3,4. Subject 2 Figure 1. ZCR algorithm. 0 50 100 150 200 1 5 9 13 17 21 25 29 33 37 41 45 49 ZCR SNR (+10 to -40 db)
  • 5. Computer Science & Information Technology (CS & IT) 319 Figure 2. SNRPVD algorithm. Figure 3. Statistical algorithm: for fundamental frequency of 100 Hz. Figure 4. Statistical algorithm: For the frequency tone 100 Hz + 200 Hz. For figures (1), (2), (3) & (4) on the X-axis SNR values in db from 10 dB to -40 dB in the steps of 1, and on the Y-axis ZCR, SNRPVD, Statistical algorithm: For fundamental frequency of 100 Hz, Statistical algorithm: For the frequency tone 100 Hz + 200 Hz respectively. 4. DISCUSSIONS The results are given in the Tabular form Table 1 for all the three algorithms under evaluation. In this we have taken into consideration the EEG pure noise data which we have collected during the data collection. The ZCR [5, 8] and SNRPVD [1] values as reference for evaluation which are 120 and 1.8596 respectively for this EEG noise data. On these for precise evaluation purpose we have taken some additional percentage of 5% more on these values which are 126 and 1.95258 respectively to form as a threshold. So this is the 100% surety reference threshold from the ZCR 0 2 4 6 8 10 1 5 9 13 17 21 25 29 33 37 41 45 49 SNRPVD SNR (+10 to -40 db) 0 0.05 0.1 0.15 0.2 1 5 9 13 17 21 25 29 33 37 41 45 49 p-valuewithF-ratiotest atF0 SNR (+10 to -40) 0 0.1 0.2 0.3 0.4 1 5 9 13 17 21 25 29 33 37 41 45 49 p-valuewithF-ratiotest atF0and2F0 SNR (+10 to -40)
  • 6. 320 Computer Science & Information Technology (CS & IT) [8] and SNRPVD [1]. So for the statistical analysis purposes we will take as low p-value as possible for the reference threshold value to make it as close as possible for the 100% surety. In our case we have taken 0.01 is the threshold p-value we have taken. Form this we have 1% error i.e. 99% surety. For ZCR [8] and SNRPVD [1] values of all 10 subjects if the values are more than the taken threshold then it is considered as voice detection. For statistical analysis algorithm if it is less than the given threshold then it is considered as the voice detection. So in this case we started the experiment as adding the noise from 10 db to -40 db and also 1000db to the EEG pure noise signal and then adding this noised signal to the original data signal and then we did put into application this ZCR and SNRPVD and also statistical algorithmic procedures. Table 1. The results for the three algorithms for the subjects 1-10. Subject number SNRPVD SNR cutoff (db) ZCR SNR cutoff (db) P-values Column 2 SNR cutoff (db) P-values column 4 SNR cutoff (db) 1 -18 -2 nothing -6 2 -26 -20 -14 -13 3 -32 1000 -19 -16 4 -23 -21 -12 -11 5 -24 -11 -15 -12 6 -26 -21 -12 -11 7 -23 -2 -16 -15 8 -36 -15 -13 -16 9 -30 -18 2 -15 10 -32 -11 -14 -12 (Note: column 2 is for: For fundamental frequency of 100 Hz. Column 4 is for: For the frequency tone 100 Hz + 200 Hz.) 5. CONCLUSION AND FURTHER RESEARCH After As of the observed results from the table I it is clear that we can do better in the case of Signal to Noise Ratio Peak Valley Detection (SNRPVD) algorithm than Zero Cross Rating and Statistical analysis algorithms. As a further research part we are implementing few more VAD algorithms existing and about to compare them with our SNRPVD technique for further verification of its better performance over them, and would like to standardize it & then to apply in real time applications. REFERENCES [1] In-Chul Yoo and Dongsuk Yook, “Robust voice activity detection using the spectral spectral peaks of vowel sounds”. ETRI Journal,Volume 31,Number 4,August 2009. [2] R. Nicole, J. Sohn, N.S. Kim, and W. Sung, “A statistical Model Based Voice Activity Detection,” IEEE Signal Process. Lett., vol. 6, 1999, pp. 1-3. [3] Javier Ramirez,Jos C segura, Carmen Benitez,Angel de la torre, Antonio Rubio, “Efficient voice activity detection algorithms using long-term speech information”, J. Ram_ırez et al. / Speech Communication 42 (2004) 271–287. [4] I. Krekule, “zero crosssing detection of the presence of evoked responses”, Electroencephalography and clinical neurophysiology, Elsevier publishing company, Amsterdam – Printed in the netherlands. [5] GBrson Eduardo Mog, Eduardo Parente kbeiro, “Zero Crossing determination by linear interpolation of sampled sinusoidal signal.
  • 7. Computer Science & Information Technology (CS & IT) 321 [6] Dajani, R.H., Purcell, D., Wong, W., Kunov, H., Picton, T.W. 2005. Recording Human Evoked Potentials That Follow the Pitch Contour of a Natural Vowel. IEEE Transactions on Biomedical Engineering 52, 1614-1618. [7] Johnson, K.L., Nicol, G.T., Kraus, N. 2005. Brain Stem Response to Speech: A Biological Marker of Auditory Processing. Ear & Hearing 26, 424-434. [8] Russo, N., Nicol, T., Musacchia, G., Kraus, N. 2004. Brainstem responses to speech syllables. Clinical Neurophysiology 115, 2021-2030. [9] Galbraith GC, Arbagey PW, Branski R. Intelligible speech encoded in the human brain stem frequency-following response. NeuroReport 1995; 6: 2363-2367. [10] M.S. John, T.W. Picton, MASTER: a Windows program for recording multipleauditory steady-state response. Computer Methods and Programs in Biomedicine 61 (2000) 125–150, Elsevier. Author’s Biography Mr. Ranganadh Narayanam is an Assistant professor in the department of Electronics & Communications Engineering in Bharat Institute of Engineering & Technology(BIET).This research is continuation of the research done in univeristy of ottawa under the guidance of Dr. Hilmi Dajani. This current research was partly funded by BIET. Mr.Narayanam, a research student in the area of “Brain Stem Speech Evoked Potentials” under the guidance of Dr. Hilmi Dajnai of University of Ottawa,Canada. He was also a research student in The University of Texas at San Antonio under Dr. Parimal A Patel,Dr. Artyom M. Grigoryan, Dr Sos Again, Dr. CJ Qian,. in the areas of signal processing and digital systems, control systems. He worked in the area of Brian Imaging in University of California Berkeley. Mr. Narayanam is having around 5+2 years of full time teaching & research experiences respectively, and more than 5 years of entry level research experience and more than 10 publications. Mr. Narayanam’s research interests include neurological Signal & Image processing, DSP software & Hardware design and implementations, neurotechnologies.