SlideShare a Scribd company logo
1 of 10
Download to read offline
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
DOI : 10.5121/ijist.2012.2603 33
VOWEL PHONEME RECOGNITION BASED ON
AVERAGE ENERGY INFORMATION IN THE
ZEROCROSSING INTERVALS AND ITS
DISTRIBUTION USING ANN
Sunil Kumar R.K1
and Lajish.V.L2
Department of Computer Science, University of Calicut, Kerala - 673635, INDIA
1
sunilcsirc@rediffmail.com
2
lajish@uoc.ac.in
ABSTRACT
Speech signal is modelled using the average energy of the signal in the zerocrossing intervals. Variation of
these energies in the zerocrossing interval of the signal is studied and the distribution of this parameter
through out the signal is evaluated. It is observed that the distribution patterns are similar for repeated
utterances of the same vowels and varies from vowel to vowel. Credibility of the proposed parameter is
verified over five Malayalam (one of the most popular Indian language) vowels using multilayer feed
forward artificial neural network based recognition system. The performance of the system using additive
white Gaussian noise corrupted speech is also studied for different SNR levels. From the experimental
results it is evident that the average energy information in the zerocrossing intervals and its distributions
can be effectively utilised for vowel phone classification and recognition.
KEYWORDS
Zerocrossing Intervals, Phoneme Recognition, ANN
1. INTRODUCTION
Parameterization of analog speech signal is the first step in the speech recognition process.
Although various aspects of phoneme recognition have been investigated by researchers, the
parameterization of analog speech and its computational complexity is still a headache for them.
We are interested in a set of processing techniques used for phone recognition that are reasonably
termed as time domain methods. By this we mean that the processing methods involve the wave
form of the speech signal directly. Some examples of representation of speech signal in terms of
time domain measurements include average zerocrossing rate, energy and autocorrelation
function. The time domain method like zerocrossing analysis technique has been applied to
several signal processing and signal analysis tasks. Some of this task include speech analysis and
speech recognition [1] [2] [3] [4], electroencephalographic (EEG), biomedical applications,
communication application, oceanographic analysis and many others [5]. The relative easy
method by which Zerocrossing information can be extracted and its low cost implementation
made this technique attractive. Zerocrossing rate is widely used in many speech analysis and
recognition purposes. In application involving speech processing and speech recognition,
additional interest in Zerocrossing analysis has gained impetus through observation of Licklider
and Pollack who shoed that clipped speech is highly intelligible [6]. As a result, numerous speech
analysis and recognition devices have been built utilizing Zerocrossing analysis techniques [7]. In
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
34
spite of this simplicity, the resulting representation provides a useful basis for estimating
important features of the speech signal. Phoneme recognition has major contribution in designing
practical efficient speech recognition systems. Many phoneme recognition systems are reported in
literature [8] [9] [10] [11]. In this paper we modelled the speech signal with a computationally
simple speech parameter using the average energy information in the zerocrossing intervals of the
signal and used it for phoneme recognition applications. We have used five Malayalam (one of
the most popular Indian language) vowels for the conduct of recognition experiments. The
present work also illustrates how the proposed parameters can be effectively used in neural
network based phoneme recognition systems. The paper is organized in five sections. Session 1.1
describes the representation of energy in the discrete speech signals. Session 2 describes the
speech modelling technique using average energy in the zecrocrossing interval and the Session 3
describes the speech parameterization using the average energy in the zerocrossing interval
distribution. Session 4 describes vowel recognition using Artificial Neural Network (ANN) and
the final section deals the conclusion.
1.1. Energy of the Discrete Speech Signal
In a typical speech signal we can see that its certain properties considerably changes with time.
For example, we can observe a significant variation in the peak amplitude of the signal and a
considerable variation of fundamental frequency within voiced regions in a speech signal. These
facts suggest that simple time domain processing techniques should be capable of providing
useful information of signal features, such as intensity, excitation mode, pitch, and possibly even
vocal tract parameters, such as formant frequencies. Most of the short time processing techniques
that give time domain features (Qn), can be mathematically represented as
 =
− 

∞

=−∞
where T[] is the transformation matrix which may be either linear or nonlinear, X(m) represents
the data sequence and W(n-m) represents a limited time window sequence. The energy of the
discrete time signal is defined as
 =  	



Such a quantity has little meaning or utility for speech since it gives little information about time
dependent properties of speech signal. We have observed that the amplitude of the speech signal
varies appreciably with time. In particular, the amplitude of the unvoiced segment is generally
much lower than amplitude of the voiced segment. The short time energy of the speech signal
provides a convenient representation that reflects the amplitude variation and can be defined as
 =  	
	
	 −
The major significance of En is that it provides a basis for distinguishing voiced speech segment
from unvoiced speech segment. It can be seen that the value of En for the unvoiced segments are
significantly smaller than voiced segments. The energy function can also be used to locate
approximately the time at which voiced speech become unvoiced speech and vice versa, and for
high quality speech (high signal to noise ratio) the energy can be used to distinguish speech from
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
35
silence. The above discussion cites the importance of energy function (En) for speech analysis
purpose.
2. SPEECH MODELLING USING AVERAGE ENERGY IN THE ZEROCROSSING
INTERVAL
The speech production model [12] suggests that the energy of the voiced speech is concentrated
below about 3 kHz, where as in the case of unvoiced speech, most of the energy is found at higher
frequencies. Since high frequency implies high zerocrossing rate and low frequency implies low
zerocrossing rate, there is strong correlation between zerocrossing rate and energy distribution
with frequency [13] [14]. This motivates us to model the speech signal using average energy in
zerocrossing interval of the signal. Consider the speech segment shown in Figure 1. The ZCk
i
shows the ith
zerocrossing and ZCk
i+1 shows the i+1th
zerocrossing of kth
observation window. The
time interval between these two points is called ith
zerocrossing interval Tk
i in the kth
observation
window
Figure 1. Speech segment in kth
observation window.
The average energy in the ith zerocrossing interval can be obtained by the expression


=
1


 2
	
+1




Ek
i is the average energy of the signal in Tk
i
th
zerocrossing interval of kth
observation window and
X(t) is the instantaneous signal amplitude. The aim of the present study is to find a robust
coefficient for speech recognition application using the average energy in the zerocrossing
interval (AEZI). An XY plot is generated by plotting index number of zero crossing interval
along X axis and Average Energy in the Zerocrossing Interval (AEZI) along Y axis. Figure 2
(a-e) represents the average energy in the zerocrossing interval vs index number of the
zerocrossing interval for the Malayalam vowels /a/, /i/, /u/, /e/, /o/ respectively.
ZCk
i ZCk
i+1
Tk
i
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
36
0 50 100 150 200
0.0
0.2
0.4
0.6
0.8
0 50 100 150 200
0.0
0.2
0.4
0.6
0.8
A
ve
ra
ge
en
e
rgy
in
th
e
zerocrossin
g
in
te
rva
l
Zerocrossing interval index number
Figure 2 (a) Average energy in the zerocrossing interval vs zerocrossing interval
index number of vowel /a/
Figure 2 (b) Average energy in the zerocrossing interval vs zerocrossing interval
index number of vowel /i/
Figure 2 (c) Average energy in the zerocrossing interval vs zerocrossing interval
index number of vowel /u/
0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0
0 .0 0
0 .0 5
0 .1 0
0 .1 5
0 .2 0
0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0
0 .0 0
0 .0 5
0 .1 0
0 .1 5
0 .2 0
Average
energy
in
the
zerocrossing
interval
Z e ro c ro s s in g in te rv a l in d e x n u m b e r
0 2 0 4 0 6 0 8 0 1 0 0
0 .0 0
0 .0 5
0 .1 0
0 .1 5
0 .2 0
0 2 0 4 0 6 0 8 0 1 0 0
0 .0 0
0 .0 5
0 .1 0
0 .1 5
0 .2 0
Z e ro c ro s s in g in te rv a l in d e x n u m b e r
Average
energy
in
the
zerocrossing
interval
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
37
Figure 2 (d) Average energy in the zerocrossing interval vs zerocrossing interval
index number of vowel /e/
Figure 2 (e) Average energy in the zerocrossing interval vs zerocrossing interval
index number of vowel /o/
From the Figures 2 (a-e), we can see that average energy in the consecutive zerocrossing interval
are varying and its variation is different for different vowels . So this information can be used as
a signature for identifying different vowels.
3. SPEECH MODELLING USING AVERAGE ENERGY IN THE ZEROCROSSING
INTERVAL DISTRIBUTION
Let us define a range of energy values ej = range [Emax(j) , Emin(j)], where Emax(j) is the maximum
and Emin(j ), the minimum of jth
range of energy. Now we define a parameter Qk
(ej) that gives the
distribution of average energy in the zerocrossing interval of the signal X(t) in a particular range
of energy ej of kth
observation window Wk
. The function can be mathematically written as

	!
 =  #		
$
=1
!,


0 5 0 1 0 0 1 5 0 2 0 0 2 5 0
-0 .1
0 .0
0 .1
0 .2
0 .3
0 .4
0 .5
0 .6
0 .7
0 .8
0 .9
0 5 0 1 0 0 1 5 0 2 0 0 2 5 0
-0 .1
0 .0
0 .1
0 .2
0 .3
0 .4
0 .5
0 .6
0 .7
0 .8
0 .9
Z e ro c ro s s in g in te rv a l in d e x n u m b e r
Averge
energy
in
the
zerocrossing
interval
0 2 0 4 0 6 0 8 0 1 0 0 1 2 0
0 .0
0 .1
0 .2
0 .3
0 2 0 4 0 6 0 8 0 1 0 0 1 2 0
0 .0
0 .1
0 .2
0 .3
Average
energy
in
the
zerocrossing
interval
Z e ro c ro ss in g in te rva l in d e x n u m b e r
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
38
where j = 1,2,…………L ; k= 1,2,3 ………….. N and η (ej , Ek
j) = 1 if Ek
j lies in between the
range specified for ej and equal to zero otherwise.
For k = 1 we get the distribution values in all the L ranges as
Q1
(e1), Q1
(e2), Q1
(e3)………………………. Q1
(eL)
For k = 2 the distribution values are
Q2
(e1), Q2
(e2), Q2
(e3)………………………. Q2
(eL)
Or, in general, for k = N the distribution values are
QN
(e1), QN
(e2), QN
(e3)………………………. QN
(eL)
For j = 1 we get the total value of the distribution function as
Qtot(e1) = Q1
(e1)+ Q2
(e1)+ Q3
(e1)……………. QN
(e1)
For j =2,
Qtot(e2) = Q1
(e2)+ Q2
(e2)+ Q3
(e2)……………. QN
(e2) etc.
Or, in general, for j = L the total value of the distribution function will be
Qtot(eL) = Q1
(eL)+ Q2
(eL)+ Q3
(eL)……………. QN
(eL)
There fore,
	!
 =  
		
'
=1
!; 		 = 1, 2,3, … ., ,
where Qtot(ej) represents the distribution of average energy in the zerocrossing intervals in the
range ej.
3.1. Pattern formation
Speech signal is low pass filtered to 4kHz and sampled at 8kHz rate and digitized using 16 bit
A/D converter. Five Malayalam vowels /a/, /i/, /u/, /e/, and /o/ uttered by a single speaker is used
in the present study. From the band limited speech data, we have extracted Average Energy in the
Zerocrossing Intervals (AEZI) using the equation


=
1


 2
	
+1




For finding the distribution of average energy in the zerocrossing interval for each vowel, first we
fix a maximum threshold of energy value as 0.8 by examining the computed energy values of all
the vowels. This upper threshold is divided into twenty uniform ranges (0-0.04 , 0.04-0.08,….
……..0.76-0.8). The distribution of the average energy of the vowels in the zerocrossing interval
among all the ranges mentioned above is obtained next. Figure 3(a-e) shows the distribution of
average energy in the zerocrossing intervals of Malayalam vowels /a/, /i/, /u/, /e/ and /o/. The
normal slim lines represent the distribution graph obtained using the repeated utterances of same
vowels by the same speaker. The bold line shows the mean distribution plot.
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
39
Figure 3 (a) Number of zerocrossing intervals vs Energy range number –AEZI
of the vowel /a/
Figure 3 (b) Number of zerocrossing intervals vs Energy range number –AEZI
of the vowel /i/
Figure 3 (c) Number of zerocrossing intervals vs Energy range number –AEZI
of the vowel /u/
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
-1 0
0
1 0
2 0
3 0
4 0
5 0
6 0
7 0
8 0
9 0
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
-1 0
0
1 0
2 0
3 0
4 0
5 0
6 0
7 0
8 0
9 0
M E A N
Number
of
zerocrossing
intervals
E n e rg y ra n g e n u m b e r- A E Z I
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
-1 0
0
1 0
2 0
3 0
4 0
5 0
6 0
7 0
8 0
9 0
1 0 0
1 1 0
1 2 0
1 3 0
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
- 1 0
0
1 0
2 0
3 0
4 0
5 0
6 0
7 0
8 0
9 0
1 0 0
1 1 0
1 2 0
1 3 0
M E A N
Number
of
zerocrossing
intervals
E n e rg y ra n g e n u m b e r - A E Z I
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
0
1 0
2 0
3 0
4 0
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
0
1 0
2 0
3 0
4 0
M E A N
Number
of
zerocrossing
intervals
E n e rg y ra n g e n u m b e r - A E Z I
International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012
40
Figure 3 (d) Number of zerocrossing intervals vs Energy range number –AEZI
of the vowel /e/
Figure 3 (e) Number of zerocrossing intervals vs Energy range number –AEZI
of the vowel /o/
From the distribution graphs we can see that the distribution of average energy in the zerocrossing
interval has almost similar pattern for repeated utterances of a particular vowel and varies from
vowel to vowel. So this information can be used as a parameter for vowel recognition
applications.
4. VOWEL RECOGNITION USING ANN
The application of artificial neural networks to speech recognition is the youngest and least
understood of the recognition technologies. The ANN is based on the notion that complex
“computing” operations can be implemented by massive integration of individual computing
units, each of which performs an elementary computation. Artificial neural networks have several
advantages relative to sequential machines. First, the ability to adapt is at the very center of ANN
operations. Adaptation takes the form of adjusting the connection weights in order to achieve
desired mappings. Furthermore ANN can continue to adapt and learn, which is extremely useful
in processing and recognition of speech. Second, ANN tend to move robust or fault tolerant than
Von Neumann machines because the network is composed of many interconnected neurons, all
computing in parallel, and failure of a few processing units can often be compensated for by the
redundancy in the network. Similarly ANN can often generalize from incomplete or noisy data.
Finally ANN when used as classifier does not require strong statistical characterization or
parameterization of data [15] [16] [17] [18] [19]. These are the main motivations to choose
artificial neural networks for phoneme recognition.
We used Multilayer Layer Feed Forward architecture for this experiment. The network consists of
0 5 1 0 1 5 2 0
- 2 0
0
2 0
4 0
6 0
8 0
1 0 0
1 2 0
1 4 0
1 6 0
0 5 1 0 1 5 2 0
- 2 0
0
2 0
4 0
6 0
8 0
1 0 0
1 2 0
1 4 0
1 6 0
M E A N
Number
of
zerocrossing
intervals
E n e r g y r a n g e n u m b e r - A E Z I
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
0
1 0
2 0
3 0
4 0
5 0
6 0
0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2
0
1 0
2 0
3 0
4 0
5 0
6 0
m e a n
Number
of
zerocrossing
intervals
E n e r g y r a n g e n u m b e r - A E Z I

More Related Content

What's hot

Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 AudioNovel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audioinventy
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...IJERA Editor
 
New optimization scheme for cooperative spectrum sensing taking different snr...
New optimization scheme for cooperative spectrum sensing taking different snr...New optimization scheme for cooperative spectrum sensing taking different snr...
New optimization scheme for cooperative spectrum sensing taking different snr...eSAT Publishing House
 
Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...csandit
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundTELKOMNIKA JOURNAL
 
Speech signal analysis for linear filter banks of different orders
Speech signal analysis for linear filter banks of different ordersSpeech signal analysis for linear filter banks of different orders
Speech signal analysis for linear filter banks of different orderseSAT Journals
 
Automatic identification of silence, unvoiced and voiced chunks in speech
Automatic identification of silence, unvoiced and voiced chunks in speechAutomatic identification of silence, unvoiced and voiced chunks in speech
Automatic identification of silence, unvoiced and voiced chunks in speechcsandit
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Reviewijiert bestjournal
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...sipij
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkeSAT Journals
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech SynthesisCSCJournals
 
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Adaptive wavelet thresholding with robust hybrid features  for text-independe...Adaptive wavelet thresholding with robust hybrid features  for text-independe...
Adaptive wavelet thresholding with robust hybrid features for text-independe...IJECEIAES
 

What's hot (19)

Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 AudioNovel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
 
19 ijcse-01227
19 ijcse-0122719 ijcse-01227
19 ijcse-01227
 
New optimization scheme for cooperative spectrum sensing taking different snr...
New optimization scheme for cooperative spectrum sensing taking different snr...New optimization scheme for cooperative spectrum sensing taking different snr...
New optimization scheme for cooperative spectrum sensing taking different snr...
 
Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...Realization and design of a pilot assist decision making system based on spee...
Realization and design of a pilot assist decision making system based on spee...
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
 
speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
Speech signal analysis for linear filter banks of different orders
Speech signal analysis for linear filter banks of different ordersSpeech signal analysis for linear filter banks of different orders
Speech signal analysis for linear filter banks of different orders
 
Automatic identification of silence, unvoiced and voiced chunks in speech
Automatic identification of silence, unvoiced and voiced chunks in speechAutomatic identification of silence, unvoiced and voiced chunks in speech
Automatic identification of silence, unvoiced and voiced chunks in speech
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
 
H42045359
H42045359H42045359
H42045359
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
 
50120140501002
5012014050100250120140501002
50120140501002
 
Survey On Speech Synthesis
Survey On Speech SynthesisSurvey On Speech Synthesis
Survey On Speech Synthesis
 
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Adaptive wavelet thresholding with robust hybrid features  for text-independe...Adaptive wavelet thresholding with robust hybrid features  for text-independe...
Adaptive wavelet thresholding with robust hybrid features for text-independe...
 

Similar to VOWEL PHONEME RECOGNITION BASED ON AVERAGE ENERGY INFORMATION IN THE ZEROCROSSING INTERVALS AND ITS DISTRIBUTION USING ANN

Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...ijsrd.com
 
Cochlear implant acoustic simulation model based on critical band filters
Cochlear implant acoustic simulation model based on critical band filtersCochlear implant acoustic simulation model based on critical band filters
Cochlear implant acoustic simulation model based on critical band filtersIAEME Publication
 
Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...IRJET Journal
 
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of VocabularyAn Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabularysipij
 
SPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEWSPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEWijiert bestjournal
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...eSAT Journals
 
Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...Alexander Decker
 
Iciiecs1461
Iciiecs1461Iciiecs1461
Iciiecs1461phyuhsan
 
A review of analog audio scrambling methods for residual intelligibility
A review of analog audio scrambling methods for residual intelligibilityA review of analog audio scrambling methods for residual intelligibility
A review of analog audio scrambling methods for residual intelligibilityAlexander Decker
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environmentssipij
 
Analysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition TechniquesAnalysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition Techniquesidescitation
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresSivaranjan Goswami
 
SPEECH RECOGNITION USING SONOGRAM AND AANN
SPEECH RECOGNITION USING SONOGRAM AND AANNSPEECH RECOGNITION USING SONOGRAM AND AANN
SPEECH RECOGNITION USING SONOGRAM AND AANNAM Publications
 
Denoising of heart sound signal using wavelet transform
Denoising of heart sound signal using wavelet transformDenoising of heart sound signal using wavelet transform
Denoising of heart sound signal using wavelet transformeSAT Publishing House
 

Similar to VOWEL PHONEME RECOGNITION BASED ON AVERAGE ENERGY INFORMATION IN THE ZEROCROSSING INTERVALS AND ITS DISTRIBUTION USING ANN (20)

Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
 
Cochlear implant acoustic simulation model based on critical band filters
Cochlear implant acoustic simulation model based on critical band filtersCochlear implant acoustic simulation model based on critical band filters
Cochlear implant acoustic simulation model based on critical band filters
 
Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...
 
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of VocabularyAn Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
 
SPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEWSPEECH COMPRESSION TECHNIQUES: A REVIEW
SPEECH COMPRESSION TECHNIQUES: A REVIEW
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
 
1801 1805
1801 18051801 1805
1801 1805
 
1801 1805
1801 18051801 1805
1801 1805
 
Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...Estimating the quality of digitally transmitted speech over satellite communi...
Estimating the quality of digitally transmitted speech over satellite communi...
 
Iciiecs1461
Iciiecs1461Iciiecs1461
Iciiecs1461
 
A review of analog audio scrambling methods for residual intelligibility
A review of analog audio scrambling methods for residual intelligibilityA review of analog audio scrambling methods for residual intelligibility
A review of analog audio scrambling methods for residual intelligibility
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environments
 
Analysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition TechniquesAnalysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition Techniques
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
SPEECH RECOGNITION USING SONOGRAM AND AANN
SPEECH RECOGNITION USING SONOGRAM AND AANNSPEECH RECOGNITION USING SONOGRAM AND AANN
SPEECH RECOGNITION USING SONOGRAM AND AANN
 
ALXAI.pptx
ALXAI.pptxALXAI.pptx
ALXAI.pptx
 
H010234144
H010234144H010234144
H010234144
 
D011132635
D011132635D011132635
D011132635
 
Denoising of heart sound signal using wavelet transform
Denoising of heart sound signal using wavelet transformDenoising of heart sound signal using wavelet transform
Denoising of heart sound signal using wavelet transform
 

More from ijistjournal

Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...ijistjournal
 
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVINA MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVINijistjournal
 
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDYDECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDYijistjournal
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...ijistjournal
 
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...ijistjournal
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...ijistjournal
 
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...ijistjournal
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSijistjournal
 
Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...ijistjournal
 
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHMANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHMijistjournal
 
Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...ijistjournal
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...ijistjournal
 
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial DomainColour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domainijistjournal
 
Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...ijistjournal
 
International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)ijistjournal
 
Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...ijistjournal
 
Design and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression AlgorithmDesign and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression Algorithmijistjournal
 
Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...ijistjournal
 
Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...ijistjournal
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKijistjournal
 

More from ijistjournal (20)

Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...
 
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVINA MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
A MEDIAN BASED DIRECTIONAL CASCADED WITH MASK FILTER FOR REMOVAL OF RVIN
 
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDYDECEPTION AND RACISM IN THE TUSKEGEE  SYPHILIS STUDY
DECEPTION AND RACISM IN THE TUSKEGEE SYPHILIS STUDY
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...
 
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
A NOVEL APPROACH FOR SEGMENTATION OF SECTOR SCAN SONAR IMAGES USING ADAPTIVE ...
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
 
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
 
Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...Research Articles Submission - International Journal of Information Sciences ...
Research Articles Submission - International Journal of Information Sciences ...
 
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHMANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
ANALYSIS OF IMAGE WATERMARKING USING LEAST SIGNIFICANT BIT ALGORITHM
 
Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...Call for Research Articles - 6th International Conference on Machine Learning...
Call for Research Articles - 6th International Conference on Machine Learning...
 
Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...Online Paper Submission - International Journal of Information Sciences and T...
Online Paper Submission - International Journal of Information Sciences and T...
 
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial DomainColour Image Steganography Based on Pixel Value Differencing in Spatial Domain
Colour Image Steganography Based on Pixel Value Differencing in Spatial Domain
 
Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...Call for Research Articles - 5th International conference on Advanced Natural...
Call for Research Articles - 5th International conference on Advanced Natural...
 
International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)International Journal of Information Sciences and Techniques (IJIST)
International Journal of Information Sciences and Techniques (IJIST)
 
Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...Research Article Submission - International Journal of Information Sciences a...
Research Article Submission - International Journal of Information Sciences a...
 
Design and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression AlgorithmDesign and Implementation of LZW Data Compression Algorithm
Design and Implementation of LZW Data Compression Algorithm
 
Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...Online Paper Submission - 5th International Conference on Soft Computing, Art...
Online Paper Submission - 5th International Conference on Soft Computing, Art...
 
Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...Submit Your Research Articles - International Journal of Information Sciences...
Submit Your Research Articles - International Journal of Information Sciences...
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
 

Recently uploaded

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 

Recently uploaded (20)

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 

VOWEL PHONEME RECOGNITION BASED ON AVERAGE ENERGY INFORMATION IN THE ZEROCROSSING INTERVALS AND ITS DISTRIBUTION USING ANN

  • 1. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 DOI : 10.5121/ijist.2012.2603 33 VOWEL PHONEME RECOGNITION BASED ON AVERAGE ENERGY INFORMATION IN THE ZEROCROSSING INTERVALS AND ITS DISTRIBUTION USING ANN Sunil Kumar R.K1 and Lajish.V.L2 Department of Computer Science, University of Calicut, Kerala - 673635, INDIA 1 sunilcsirc@rediffmail.com 2 lajish@uoc.ac.in ABSTRACT Speech signal is modelled using the average energy of the signal in the zerocrossing intervals. Variation of these energies in the zerocrossing interval of the signal is studied and the distribution of this parameter through out the signal is evaluated. It is observed that the distribution patterns are similar for repeated utterances of the same vowels and varies from vowel to vowel. Credibility of the proposed parameter is verified over five Malayalam (one of the most popular Indian language) vowels using multilayer feed forward artificial neural network based recognition system. The performance of the system using additive white Gaussian noise corrupted speech is also studied for different SNR levels. From the experimental results it is evident that the average energy information in the zerocrossing intervals and its distributions can be effectively utilised for vowel phone classification and recognition. KEYWORDS Zerocrossing Intervals, Phoneme Recognition, ANN 1. INTRODUCTION Parameterization of analog speech signal is the first step in the speech recognition process. Although various aspects of phoneme recognition have been investigated by researchers, the parameterization of analog speech and its computational complexity is still a headache for them. We are interested in a set of processing techniques used for phone recognition that are reasonably termed as time domain methods. By this we mean that the processing methods involve the wave form of the speech signal directly. Some examples of representation of speech signal in terms of time domain measurements include average zerocrossing rate, energy and autocorrelation function. The time domain method like zerocrossing analysis technique has been applied to several signal processing and signal analysis tasks. Some of this task include speech analysis and speech recognition [1] [2] [3] [4], electroencephalographic (EEG), biomedical applications, communication application, oceanographic analysis and many others [5]. The relative easy method by which Zerocrossing information can be extracted and its low cost implementation made this technique attractive. Zerocrossing rate is widely used in many speech analysis and recognition purposes. In application involving speech processing and speech recognition, additional interest in Zerocrossing analysis has gained impetus through observation of Licklider and Pollack who shoed that clipped speech is highly intelligible [6]. As a result, numerous speech analysis and recognition devices have been built utilizing Zerocrossing analysis techniques [7]. In
  • 2. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 34 spite of this simplicity, the resulting representation provides a useful basis for estimating important features of the speech signal. Phoneme recognition has major contribution in designing practical efficient speech recognition systems. Many phoneme recognition systems are reported in literature [8] [9] [10] [11]. In this paper we modelled the speech signal with a computationally simple speech parameter using the average energy information in the zerocrossing intervals of the signal and used it for phoneme recognition applications. We have used five Malayalam (one of the most popular Indian language) vowels for the conduct of recognition experiments. The present work also illustrates how the proposed parameters can be effectively used in neural network based phoneme recognition systems. The paper is organized in five sections. Session 1.1 describes the representation of energy in the discrete speech signals. Session 2 describes the speech modelling technique using average energy in the zecrocrossing interval and the Session 3 describes the speech parameterization using the average energy in the zerocrossing interval distribution. Session 4 describes vowel recognition using Artificial Neural Network (ANN) and the final section deals the conclusion. 1.1. Energy of the Discrete Speech Signal In a typical speech signal we can see that its certain properties considerably changes with time. For example, we can observe a significant variation in the peak amplitude of the signal and a considerable variation of fundamental frequency within voiced regions in a speech signal. These facts suggest that simple time domain processing techniques should be capable of providing useful information of signal features, such as intensity, excitation mode, pitch, and possibly even vocal tract parameters, such as formant frequencies. Most of the short time processing techniques that give time domain features (Qn), can be mathematically represented as =
  • 3. − ∞ =−∞ where T[] is the transformation matrix which may be either linear or nonlinear, X(m) represents the data sequence and W(n-m) represents a limited time window sequence. The energy of the discrete time signal is defined as = Such a quantity has little meaning or utility for speech since it gives little information about time dependent properties of speech signal. We have observed that the amplitude of the speech signal varies appreciably with time. In particular, the amplitude of the unvoiced segment is generally much lower than amplitude of the voiced segment. The short time energy of the speech signal provides a convenient representation that reflects the amplitude variation and can be defined as = −
  • 4. The major significance of En is that it provides a basis for distinguishing voiced speech segment from unvoiced speech segment. It can be seen that the value of En for the unvoiced segments are significantly smaller than voiced segments. The energy function can also be used to locate approximately the time at which voiced speech become unvoiced speech and vice versa, and for high quality speech (high signal to noise ratio) the energy can be used to distinguish speech from
  • 5. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 35 silence. The above discussion cites the importance of energy function (En) for speech analysis purpose. 2. SPEECH MODELLING USING AVERAGE ENERGY IN THE ZEROCROSSING INTERVAL The speech production model [12] suggests that the energy of the voiced speech is concentrated below about 3 kHz, where as in the case of unvoiced speech, most of the energy is found at higher frequencies. Since high frequency implies high zerocrossing rate and low frequency implies low zerocrossing rate, there is strong correlation between zerocrossing rate and energy distribution with frequency [13] [14]. This motivates us to model the speech signal using average energy in zerocrossing interval of the signal. Consider the speech segment shown in Figure 1. The ZCk i shows the ith zerocrossing and ZCk i+1 shows the i+1th zerocrossing of kth observation window. The time interval between these two points is called ith zerocrossing interval Tk i in the kth observation window Figure 1. Speech segment in kth observation window. The average energy in the ith zerocrossing interval can be obtained by the expression = 1 2 +1 Ek i is the average energy of the signal in Tk i th zerocrossing interval of kth observation window and X(t) is the instantaneous signal amplitude. The aim of the present study is to find a robust coefficient for speech recognition application using the average energy in the zerocrossing interval (AEZI). An XY plot is generated by plotting index number of zero crossing interval along X axis and Average Energy in the Zerocrossing Interval (AEZI) along Y axis. Figure 2 (a-e) represents the average energy in the zerocrossing interval vs index number of the zerocrossing interval for the Malayalam vowels /a/, /i/, /u/, /e/, /o/ respectively. ZCk i ZCk i+1 Tk i
  • 6. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 36 0 50 100 150 200 0.0 0.2 0.4 0.6 0.8 0 50 100 150 200 0.0 0.2 0.4 0.6 0.8 A ve ra ge en e rgy in th e zerocrossin g in te rva l Zerocrossing interval index number Figure 2 (a) Average energy in the zerocrossing interval vs zerocrossing interval index number of vowel /a/ Figure 2 (b) Average energy in the zerocrossing interval vs zerocrossing interval index number of vowel /i/ Figure 2 (c) Average energy in the zerocrossing interval vs zerocrossing interval index number of vowel /u/ 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 0 .0 0 0 .0 5 0 .1 0 0 .1 5 0 .2 0 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 0 .0 0 0 .0 5 0 .1 0 0 .1 5 0 .2 0 Average energy in the zerocrossing interval Z e ro c ro s s in g in te rv a l in d e x n u m b e r 0 2 0 4 0 6 0 8 0 1 0 0 0 .0 0 0 .0 5 0 .1 0 0 .1 5 0 .2 0 0 2 0 4 0 6 0 8 0 1 0 0 0 .0 0 0 .0 5 0 .1 0 0 .1 5 0 .2 0 Z e ro c ro s s in g in te rv a l in d e x n u m b e r Average energy in the zerocrossing interval
  • 7. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 37 Figure 2 (d) Average energy in the zerocrossing interval vs zerocrossing interval index number of vowel /e/ Figure 2 (e) Average energy in the zerocrossing interval vs zerocrossing interval index number of vowel /o/ From the Figures 2 (a-e), we can see that average energy in the consecutive zerocrossing interval are varying and its variation is different for different vowels . So this information can be used as a signature for identifying different vowels. 3. SPEECH MODELLING USING AVERAGE ENERGY IN THE ZEROCROSSING INTERVAL DISTRIBUTION Let us define a range of energy values ej = range [Emax(j) , Emin(j)], where Emax(j) is the maximum and Emin(j ), the minimum of jth range of energy. Now we define a parameter Qk (ej) that gives the distribution of average energy in the zerocrossing interval of the signal X(t) in a particular range of energy ej of kth observation window Wk . The function can be mathematically written as ! = # $ =1 !, 0 5 0 1 0 0 1 5 0 2 0 0 2 5 0 -0 .1 0 .0 0 .1 0 .2 0 .3 0 .4 0 .5 0 .6 0 .7 0 .8 0 .9 0 5 0 1 0 0 1 5 0 2 0 0 2 5 0 -0 .1 0 .0 0 .1 0 .2 0 .3 0 .4 0 .5 0 .6 0 .7 0 .8 0 .9 Z e ro c ro s s in g in te rv a l in d e x n u m b e r Averge energy in the zerocrossing interval 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 0 .0 0 .1 0 .2 0 .3 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 0 .0 0 .1 0 .2 0 .3 Average energy in the zerocrossing interval Z e ro c ro ss in g in te rva l in d e x n u m b e r
  • 8. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 38 where j = 1,2,…………L ; k= 1,2,3 ………….. N and η (ej , Ek j) = 1 if Ek j lies in between the range specified for ej and equal to zero otherwise. For k = 1 we get the distribution values in all the L ranges as Q1 (e1), Q1 (e2), Q1 (e3)………………………. Q1 (eL) For k = 2 the distribution values are Q2 (e1), Q2 (e2), Q2 (e3)………………………. Q2 (eL) Or, in general, for k = N the distribution values are QN (e1), QN (e2), QN (e3)………………………. QN (eL) For j = 1 we get the total value of the distribution function as Qtot(e1) = Q1 (e1)+ Q2 (e1)+ Q3 (e1)……………. QN (e1) For j =2, Qtot(e2) = Q1 (e2)+ Q2 (e2)+ Q3 (e2)……………. QN (e2) etc. Or, in general, for j = L the total value of the distribution function will be Qtot(eL) = Q1 (eL)+ Q2 (eL)+ Q3 (eL)……………. QN (eL) There fore, ! = ' =1 !; = 1, 2,3, … ., , where Qtot(ej) represents the distribution of average energy in the zerocrossing intervals in the range ej. 3.1. Pattern formation Speech signal is low pass filtered to 4kHz and sampled at 8kHz rate and digitized using 16 bit A/D converter. Five Malayalam vowels /a/, /i/, /u/, /e/, and /o/ uttered by a single speaker is used in the present study. From the band limited speech data, we have extracted Average Energy in the Zerocrossing Intervals (AEZI) using the equation = 1 2 +1 For finding the distribution of average energy in the zerocrossing interval for each vowel, first we fix a maximum threshold of energy value as 0.8 by examining the computed energy values of all the vowels. This upper threshold is divided into twenty uniform ranges (0-0.04 , 0.04-0.08,…. ……..0.76-0.8). The distribution of the average energy of the vowels in the zerocrossing interval among all the ranges mentioned above is obtained next. Figure 3(a-e) shows the distribution of average energy in the zerocrossing intervals of Malayalam vowels /a/, /i/, /u/, /e/ and /o/. The normal slim lines represent the distribution graph obtained using the repeated utterances of same vowels by the same speaker. The bold line shows the mean distribution plot.
  • 9. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 39 Figure 3 (a) Number of zerocrossing intervals vs Energy range number –AEZI of the vowel /a/ Figure 3 (b) Number of zerocrossing intervals vs Energy range number –AEZI of the vowel /i/ Figure 3 (c) Number of zerocrossing intervals vs Energy range number –AEZI of the vowel /u/ 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 -1 0 0 1 0 2 0 3 0 4 0 5 0 6 0 7 0 8 0 9 0 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 -1 0 0 1 0 2 0 3 0 4 0 5 0 6 0 7 0 8 0 9 0 M E A N Number of zerocrossing intervals E n e rg y ra n g e n u m b e r- A E Z I 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 -1 0 0 1 0 2 0 3 0 4 0 5 0 6 0 7 0 8 0 9 0 1 0 0 1 1 0 1 2 0 1 3 0 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 - 1 0 0 1 0 2 0 3 0 4 0 5 0 6 0 7 0 8 0 9 0 1 0 0 1 1 0 1 2 0 1 3 0 M E A N Number of zerocrossing intervals E n e rg y ra n g e n u m b e r - A E Z I 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 0 1 0 2 0 3 0 4 0 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 0 1 0 2 0 3 0 4 0 M E A N Number of zerocrossing intervals E n e rg y ra n g e n u m b e r - A E Z I
  • 10. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 40 Figure 3 (d) Number of zerocrossing intervals vs Energy range number –AEZI of the vowel /e/ Figure 3 (e) Number of zerocrossing intervals vs Energy range number –AEZI of the vowel /o/ From the distribution graphs we can see that the distribution of average energy in the zerocrossing interval has almost similar pattern for repeated utterances of a particular vowel and varies from vowel to vowel. So this information can be used as a parameter for vowel recognition applications. 4. VOWEL RECOGNITION USING ANN The application of artificial neural networks to speech recognition is the youngest and least understood of the recognition technologies. The ANN is based on the notion that complex “computing” operations can be implemented by massive integration of individual computing units, each of which performs an elementary computation. Artificial neural networks have several advantages relative to sequential machines. First, the ability to adapt is at the very center of ANN operations. Adaptation takes the form of adjusting the connection weights in order to achieve desired mappings. Furthermore ANN can continue to adapt and learn, which is extremely useful in processing and recognition of speech. Second, ANN tend to move robust or fault tolerant than Von Neumann machines because the network is composed of many interconnected neurons, all computing in parallel, and failure of a few processing units can often be compensated for by the redundancy in the network. Similarly ANN can often generalize from incomplete or noisy data. Finally ANN when used as classifier does not require strong statistical characterization or parameterization of data [15] [16] [17] [18] [19]. These are the main motivations to choose artificial neural networks for phoneme recognition. We used Multilayer Layer Feed Forward architecture for this experiment. The network consists of 0 5 1 0 1 5 2 0 - 2 0 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 1 6 0 0 5 1 0 1 5 2 0 - 2 0 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 1 6 0 M E A N Number of zerocrossing intervals E n e r g y r a n g e n u m b e r - A E Z I 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 0 1 0 2 0 3 0 4 0 5 0 6 0 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0 2 2 0 1 0 2 0 3 0 4 0 5 0 6 0 m e a n Number of zerocrossing intervals E n e r g y r a n g e n u m b e r - A E Z I
  • 11. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 41 20 input nodes and five output nodes, representing the five vowels. The network is trained using the energy information of the signal mentioned in Section 3. The experiment is repeated by changing the number of hidden layers and adding Additive White Gaussian Noise (AWGN) to the signal with different Signal to Noise Ratio (SNR). We used the database of 150 samples of each vowel spoken by a single male speaker for this purpose. A disjoint set of training and test patterns are formed by taking the 50% of the total patterns for each set. The error tolerance (Emax) is fixed as 0.001 and learning parameter (η) is chosen between 0.01 and 1.0. The training of the network is done using error back propagation learning method. The recognition results are tabulated in Table 1 for neural network with three hidden layers. The recognition results indicates that the network shows poor recognition accuracy when the signal is added with Additive White Gaussian Noise of 0dB, 3dB and 10dB SNR. Above 20dB SNR, it shows better results. The average vowel recognition accuracy obtained for five Malayalam vowels in this experiment is 87.94%. This percentage is comparable with the phoneme recognition using spectral method based on ANN. Table 1. Recognition accuracy for five Malayalam vowels with additive white Gaussian noise of different dB levels. 5. CONCLUSIONS The speech signal is modelled using the average energy in the zerocrossing interval of the signal. Variation of these energies in the zerocrossing interval of the signal is studied and the distribution of this parameters through out the signal is evaluated. We found that the distribution patterns are almost similar for repeated utterances of the same vowels and varies from vowel to vowel. This distribution pattern is used for recognizing five Malayalam vowels using multilayer feed forward artificial neural network. The performance of this system using additive white Gaussian noise corrupted speech (vowel) is also studied for different SNR levels. It is found that recognition accuracy of the vowels is good when the signal is corrupted with low noise levels. The recognition accuracy obtained for five Malayalam vowels is 87.94 %. The present work can be extended, with the proposed neural network model and training algorithm, using vowels samples uttered by different male and female speakers under different age groups for analyzing the speaker independency of the proposed parameters. REFERENCES [1] Arai T Yoshida Y. (1990) “Study on Zerocrossing of speech signals by means of analytic signal”. Journal of Acoustical Society if Japan.; VolxxV+692, pp.242-246 Lee, S.hyun. Kim Mi Na, (2008) “This is my paper”, ABC Transactions on ECE, Vol. 10, No. 5, pp120-122. [2] Russell J.Niederjohn,, Michal W. Krutz Bruce M . Brown, (1987) “An experimental investigation of perceptual effects of altering the Zerocrossing of a speech signal”, IEEE Trans. Acoust. , speech and signal processing, Vol. ASSP-35, No 5,pp.618-625.
  • 12. International Journal of Information Sciences and Techniques (IJIST) Vol.2, No.6, November 2012 42 [3] Erdol N, CastellucciaC, Zilouchian A, (1993) “Recovery of missing speech packet using the short time energy and Zerocrossing measurements”, IEEE Trans. Acoust. , speech and audio processing, Vol.1.1, no3,pp.295-303. [4] Sreenivas T.V and Russell J.Niederjohn, (1992), “Zerocrossing based spectral analysis and SVD spectral analysis for formant frequency estimation in noise”, IEEE trans. on signal processing, Vol.40 No2. [5] Russell J.Niederjohn, (1975), “A mathematical formulation and comparison of Zerocrossing Analysis Techniques which have been applied to automatic speech recognition” , IEEE Trans. Accust. , speech and signal processing, Vol. ASSP-23, No.4, pp373-380. [6] Licklider J.C.R and Pollack I, (1948), “Effects of differentiation , Integration and infinite pack clipping upon intelligibility of of speech”, J. Accoust. Soc., Amer, Vol-20, p42. [7] Doh-Suk Kim, Jae Hoon Jeong, Jae Weon Kim Soo Young Lee, (1996), “Feature extraction based on Zerocrossing with peak amplitudes for for robust speech recognition in noisy environments”, IEEE Trans. On Audio Electro acoust., Vol AU 17, pp.61-64 [8] Chandrasekhar.C and Yegnanarayana .B, (1996), “Recognition of Stop – Consonant – Vowel (SCV) segments in continuous speech using neural network models”, Journal of Institution of Electronics and Telecommunication Engineers(IETE), Vol 42,pp.269-280 [9] Ki-Seok-Kim; Hee-Yeung-Hwang, (1991), “A study on the speech recognition of Korean phonemes using recurrent neural network models”, Transactions of the Korean Institute of Electrical Engineers. Vol.40,no.8,p782-91. [10] Rabiner L. R. Juang.B.H, (1993), Fundamentals of Speech Recognition , Prentice Hall [11] Sunilkumar R.K and Lajish.V.L, (2012), “Phone recognition using Zerocrossing Interval Distribution of Speech Patters and ANN”, International Journal of Speech Technology, (IJST), Springer, pp.1-7. [12] Flangen J.L, (1972), Speech Synthesis, analysis and perception, New York, Springer Verlag [13] Rabiner, L. R., and Schafer, R. W., (1978), Digital Processing of Speech Signals, Prentice-Hall, Englewood Cliffs, NJ. [14] L. R. Rabiner and M. R. Sambur, (1975), “An algorithm for determining the endpoints of isolated utterances”, Bell System Technical Journal, vol. 54, no. 2, pp. 297–315. [15] Robert Hecht-Nielsen, (1990), Neurocomputing, Addison-Wesley. [16] Sada Siva Varma A, Strube H.W, Rajesh Varma and Agarwal SS, ( 1996), “Recognition of Hindi Consonant Using Time Delay Neural Network”, Journal of the Acoustical Society Of India, Vol.24,pp III-10.1-10.6. [17] Richard P. Lippmann, (1989), “Review of Neural Networks for Speech Recognition”, Neural Computation, Spring, Vol. 1, no. 1, pp. 1-38. [18] W.Y. Huang and RP. Lippmann,(1987), Neural Net and Traditional Classifiers, Proc. IEEE Corif'. on Neural Information Processing Systems - Natural and Synthetic, IEEE, New York. [19] Simon King, Paul Taylor, (2000), Detection of phonological features in continuous speech using neural networks, Computer Speech Language, Elsevier, Vol.14, no.4, pp.333-353. Authors Dr Sunilkumar R.K earned his Ph.D in Speech Signal Processing from University of Calicut, Kerala, India in 2004. He has published several research papers in National and International levels in the area of signal processing and artificial neural networks. His research interest includes speech signal processing, neural networks and active noise cancellation. Dr.Lajish.V.L has been associated with University of Calicut, Kerala, INDIA as Head of the Department of Computer Science. He has worked as Scientist RD in TCS Innovation Labs, Tata Consultancy Services Ltd. Mumbai, prior to joining the University. His prime areas of research include Digital speech and image processing, Pattern recognition algorithms and Indian language script technology solutions for mobile devices. He has more than thirty research publications in journals and peer- reviewed National and International conferences to his credit. After his masters in Computer Science from Vellore Institute of Technology, Dr.Lajish earned his Ph.D in Computer Science from University of Calicut in 2007. He is a senior life member of International Association of Computer Science and Information Technology.