SlideShare a Scribd company logo
1 of 8
Download to read offline
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 590
PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH
RECOGNITION
Sonia Suuny1
, David Peter S2
, K. Poulose Jacob3
1
Associate Professor, 2, 3
Professor,
1
Dept. of Computer Science, Prajyoti Niketan College, Kerala, India,
2, 3
Professor., Dept. of Computer Science, Cochin University of Science & Technology, Kerala, India,
sonia.deepak@yahoo.co.in, davidpeter@cusat.ac.in, kpj@cusat.ac.in
Abstract
Speech is the most natural means of communication among human beings and speech processing and recognition are intensive areas
of research for the last five decades. Since speech recognition is a pattern recognition problem, classification is an important part of
any speech recognition system. In this work, a speech recognition system is developed for recognizing speaker independent spoken
digits in Malayalam. Voice signals are sampled directly from the microphone. The proposed method is implemented for 1000 speakers
uttering 10 digits each. Since the speech signals are affected by background noise, the signals are tuned by removing the noise from it
using wavelet denoising method based on Soft Thresholding. Here, the features from the signals are extracted using Discrete Wavelet
Transforms (DWT) because they are well suitable for processing non-stationary signals like speech. This is due to their multi-
resolutional, multi-scale analysis characteristics. Speech recognition is a multiclass classification problem. So, the feature vector set
obtained are classified using three classifiers namely, Artificial Neural Networks (ANN), Support Vector Machines (SVM) and Naive
Bayes classifiers which are capable of handling multiclasses. During classification stage, the input feature vector data is trained using
information relating to known patterns and then they are tested using the test data set. The performances of all these classifiers are
evaluated based on recognition accuracy. All the three methods produced good recognition accuracy. DWT and ANN produced a
recognition accuracy of 89%, SVM and DWT combination produced an accuracy of 86.6% and Naive Bayes and DWT combination
produced an accuracy of 83.5%. ANN is found to be better among the three methods.
Index Terms: Speech Recognition, Soft Thresholding, Discrete Wavelet Transforms, Artificial Neural Networks, Support
Vector Machines and Naive Bayes Classifier.
-----------------------------------------------------------------------***-----------------------------------------------------------------------
1. INTRODUCTION
Automatic Speech recognition (ASR) is one of the intensive
areas of research since it helps people to communicate in a
more natural and effective way[1]. Speech recognition
systems can be characterized by many parameters. The
commonly used method to measure the performance of a
speech recognition system is the recognition accuracy. Many
parameters affect the accuracy of the speech recognition
system. The accuracy and acceptance of speech recognition
has improved a lot in the last few years. Though automatic
speech recognition systems have improved a lot these years
and are now extensively used, their accuracy continues to lag
behind human performance, particularly in adverse conditions
[2]. In spite of the advances in ASR, there is still a
considerable gap between human and machine performance
[3]. Speech is a multi-component signal with time varying
frequency and amplitude. Due to this variability, transitions
may occur at different times in different frequency bands.
Automatic recognition of spoken digits is one of the
challenging tasks in the field of speech recognition [4]. A
spoken digit recognition process is needed in many
applications that need numbers as input such as automated
banking system, airline reservations, voice dialing telephone,
automatic data entry, command and control etc [5]. ASR is
basically a pattern recognition problem which also involves a
number of technologies and research areas like Signal
Processing, Natural Language Processing, Statistics etc [6].
Speech signals are non stationary in nature. Speech
recognition is a complex task due to the differences in gender,
emotional state, accent, pronunciation, articulation, nasality,
pitch, volume, and speed variability in people speak [7].
Presence of background noise and other types of disturbances
also affect the performance of a speech recognition system.
Speaker independence is difficult to achieve because these
models recognize the speech patterns of a large group of
people. The paper is organized as follows. Section 2 gives a
brief description of the problem definition. The methodology
used in the design is explained in section 3. The creation of the
digits database is illustrated in section 4. Section 5 describes
the method used for preprocessing. Section 6 elaborates the
feature extraction technique used in this work followed by the
classification techniques in section 7. Section 8 presents a
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 591
detailed analysis of the experiments done and the results
obtained. Conclusion is given in the last section.
2. PROBLEM DEFINITION
The main objective of a speech recognition system is to
recognize speech with maximum accuracy. Here, digits in
Malayalam, one of the four major Dravidian languages of
southern India and the official language of the people of
Kerala is chosen for recognition. Research in Malayalam is in
its infancy stage and only very few works have been reported
in Malayalam. The performance of the overall speech
recognition system depends on the techniques used for pre-
processing, feature extraction technique selected and the
classifiers used. In this work, wavelet denoising method based
on soft thresholding is used for pre-processing the captured
signals [8]. Wavelet based feature extraction method namely
DWT is selected for feature extraction due to the good time
and frequency resolution properties of the wavelets. For
classification, three classifiers are selected namely ANN,
SVM and Naive Bayes classifier. The performance of these
classification methods in classifying the digits is evaluated
along with DWT.
3. ARCHITECTURE OF THE SYSTEM
In this work, the speech recognition system is divided into 4
modules. First the spoken digits database is created since there
is no built-in standard database available in Malayalam. The
speech samples in the database are often corrupted by additive
noises like background noise. So these speech signals are pre-
processed using wavelet denoising techniques to suppress the
noise in it. For processing speech, the speech signal has to be
represented in the form of parameters. So, after denoising the
signals, they are presented to feature extraction stage where
the features are extracted using a wavelet based feature
extraction technique called Discrete Wavelet Transforms. The
extracted features are then classified into appropriate classes
during classification phase. Here, three classifiers namely
ANN, SVM and Naive Bayes classifier are used and the
performance of these techniques are evaluated based on
recognition accuracy. The four stages of this work are
explained below.
4. CREATION OF THE SPEECH DATABASE
In Malayalam, since there is no standard database available, a
spoken digits database is created using 200 speakers of age
between 6 and 70 uttering 10 Malayalam digits. We have used
80 male speakers, 80 female speakers and 40 children for
creating the database. Male and female speech differ in pitch,
frequency, phonetics and many other factors due to the
difference in physiological as well as psychological factors.
The samples stored in the database are recorded by using a
high quality studio-recording microphone at a sampling rate of
8 KHz (4 KHz band limited). Recognition has been made on
the ten Malayalam digits from 0 to 9 under the same
configuration. Our database consists of a total of 2000
utterances of the digits. The spoken digits are preprocessed,
numbered and stored in the appropriate classes in the database.
The number digit, spoken digits in Malayalam, their
International Phonetic Alphabet (IPA) format and English
translation are shown in Table 1.
Table -1: Malayalam digits database
Number
digit
Digits in
Malayalam
IPA format
English
Translation
0 /puː//d͡ ʒya/ /ṁ/ Zero
1 /o/ /nn/ One
2 /ɾa/ /ṇṭ/ Two
3 /muː/ /nn/ Three
4 /n̪ aː/ /l/ Four
5 /a/ /ñc/ Five
6 /aː/ /r/ Six
7 /eː/ /ɻ/ Seven
8 /e/ /ṭṭ/ Eight
9 /o//na//pa/ /t̪/ Nine
5. PRE-PROCESSIG SIGNALS
Noises from background cause degradation in the speech
signals. So, these signals are tuned so that the noise present in
it is removed before extracting the features. There are a
number of techniques available for speech enhancement. Since
we are using wavelets for feature extraction, wavelet
denoising algorithms are used for reducing the noise in the
signal. The two popular thresholding functions used in wavelet
denoising method are the hard and the soft thresholding
functions [9]. In both the methods, a threshold value is
selected. In hard thresholding, if the absolute value of an
element is lower than the threshold, then these values are set
to zero. Soft thresholding is an extension of hard thresholding.
Here, the elements whose absolute values are lower than the
threshold are first set to zero and then the nonzero elements
are shrinked towards 0. Hard and soft thresholding can be
expressed as
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 592
Where X represents the wavelet coefficients and ι is the
threshold value. In this work, soft thresholding technique is
used. There are different standard threshold values. In this
work, we have used the universal threshold derived by
Donoho and Johnstone [10] for the white Gaussian noise
under a mean square error criterion which is defined as
ι = (3)
Where σ is the standard deviation and N is the length of the
signal. Standard deviation σ can be calculated as σ =
MAD/0.6745, where MAD is the median of the absolute value
of the wavelet coefficients. The outline of the algorithm used
for denoising mainly consists of 3 steps.
 Apply wavelet transform to the noisy signal to
produce the noisy wavelet coefficients up to 8 levels.
 Detail wavelet coefficients are then shrinked using
soft thresholding technique by selecting an
appropriate threshold limit.
 The inverse discrete wavelet transform of the
threshold wavelet coefficients is computed which
produces the denoised signal.
6. FEATURE EXTRACTION
Feature Extraction is a major part of the speech recognition
system since it plays an important role to separate one speech
from other and this has been an important area of research for
many years. Selection of the feature extraction technique plays
an important role in the recognition accuracy, which is the
main criterion for a good speech recognition system. Here,
DWT is used for extracting features and a brief description of
this is given below.
6.1 Discrete Wavelet Transforms
DWT is a relatively recent and computationally efficient
technique for extracting information from non-stationary
signals like audio. The main advantage of the wavelet
transforms is that it has a varying window size, being broad at
low frequencies and narrow at high frequencies, thus leading
to an optimal time–frequency resolution in all frequency
ranges [11]. DWT uses digital filtering techniques to obtain a
time-scale representation of the signals. DWT is defined by
(4)
Where Ψ (t) is the basic analyzing function called the mother
wavelet. In DWT, the original signal passes through a low-
pass filter and a high-pass filter and emerges as two signals,
called approximation coefficients and detail coefficients [12].
In speech signals, low frequency components h[n] are of
greater importance than high frequency components g[n] as
the low frequency components characterize a signal more than
its high frequency components [13]. The successive high pass
and low pass filtering of the signal is given by
(5)
(6)
Where Yhigh (detail coefficients) and Ylow (approximation
coefficients) are the outputs of the high pass and low pass
filters obtained by sub sampling by 2. The filtering process is
continued until the desired level is reached according to Mallat
algorithm [14]. The discrete wavelet decomposition tree is
shown in figure 1.
Fig -1: DWT Decomposition over 3 levels
7. CLASSIFICATION
Classification is another important part of a speech recognition
system since the patterns are classified into different classes
during this stage. Pattern recognition is becoming increasingly
important in the age of automation and information handling
and retrieval. During classification stage, decisions are made
based on the similarity measures from training patterns using
information relating to known patterns. Then they are tested
using the unknown patterns. Since there are a number of
different classes in a speech recognition problem, a multiclass
classification technique is needed.
In almost all classification methods, the data is separated into
training and test sets. Each instance in the training set contains
a target value which represents the corresponding class and a
( ) (| | | |) | |
0 | | (2){
sign X X if X
Soft if XX
 

 

| |
0 | | (1){X if X
Hard if XX 



/2
( , ) ( )2 (2 )
j j
j k
W j K X k n k
 
 
[ ] [ ] [2 ]high
n
Y k x n g k n 
[ ] [ ] [2 ]low
n
Y k x n h k n 
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 593
set of attributes. The test data do not contain a target value.
The objective of the classifier is to produce a model from the
training data which predicts the target values of the test data.
A brief description of the various classifiers used in this work
like ANN, SVM and Naive Bayes classifier are given below.
7.1 Artificial Neural Networks
Nowadays, ANNs are utilized in many applications due to
their parallel distributed processing, distributed memories,
error stability, and pattern learning and distinguishing ability.
ANN is an information processing paradigm consisting of a
number of simple processing units or nodes called neurons.
Each neuron accepts a weighted set of inputs and produces an
output [15]. Algorithms based on ANN are well suitable for
addressing speech recognition tasks. Inspired by the human
brain, neural network models use a number of characteristics
such as learning, generalization, adaptively, fault tolerance etc.
[16].
In this work, we are using the architecture of the MLP
network, which consists of an input layer, one or more hidden
layers, and an output layer. The algorithm used is the back
propagation training algorithm. In this type of network, the
input is presented to the network and moves through the
weights and nonlinear activation functions towards the output
layer, and the error is corrected in a backward direction using
the well-known error back propagation correction algorithm.
After extensive training, the network eventually establishes
the input-output relationships through the adjusted weights on
the network. After training the network, it is tested with the
dataset used for testing.
7.2 Support Vector Machines
SVM is a very useful technique used for classification. It is a
classifier which performs classification methods by
constructing hyper planes in a multidimensional space that
separates different class labels based on statistical learning
theory [17][18]. Though SVM is inherently a binary nonlinear
classifier, we can extend it to multiclass classification since
ASR is a multiclass problem. There are two major strategies
for multiclass classification namely One-against-All [17] and
One-against-One or pair wise classification [19]. The
conventional way is to decompose the M-class problem into a
series of two-class problems and construct several binary
classifiers. In this work, we have used One-against-One
method in which there is one binary SVM for each pair of
classes to separate members of one class from members of the
other. This method allows us to train all the system, with a
maximum number of different samples for each class, with a
limited computer memory [20].
7.3 Naive Bayes Classifier
Since speech recognition is a multiclass classification problem
and Naive Bayes classifiers can handle multiclass
classification problems, it is also used here for classifying the
digits. Naive Bayes classifier is based on the Bayesian theory
which is a simple and effective probability classification
method. This is a supervised classification technique. For each
class value it estimates that a given instance belongs to that
class [16]. The feature items in one class are assumed to be
independent of other attribute values called class conditional
independence [17]. Naive Bayes classifier needs only small
amount of training set to estimate the parameters for
classification. The classifier is stated as
P(A|B) = P (B|A) * P (A)/P(B) (7)
Where P(A) is the prior probability of marginal probability of
A, P(A|B) is the conditional probability of A, given B called
the posterior probability, P(B|A) is the conditional probability
of B given A and P(B) is the prior or marginal probability of
B which acts as a normalizing constant. The probability value
of the winning class dominates over that of the others [18].
8. EXPERIMENTS
Using DWT, the signals are decomposed into approximation
and detail coefficients. The low frequency components are
decomposed upto 10th level. The number of features obtained
for a digit at 10th level is 16. The decomposition of digits
zero to nine using DWT at 10th level is shown below.
Fig -2: Decomposition of digit zero at 10th level
0 0.5 1 1.5 2 2.5 3
x 10
4
-1
-0.5
0
0.5
Original Signal
0 5 10 15 20
-0.05
0
0.05
0.1
0.15
Word -zero Approximation coefficients Level 8
0 5 10 15 20
-0.2
-0.1
0
0.1
0.2
Detail coefficients Level 8
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 594
Fig- 3: Decomposition of digit one at 10th level
Fig- 4: Decomposition of digit two at 10th level
Fig- 5: Decomposition of digit three at 10th level
Fig- 6: Decomposition of digit five at 10th level
Fig- 7: Decomposition of digit four at 10th level
Fig- 8: Decomposition of digit six at 10th level
0 0.5 1 1.5 2 2.5 3
x 10
4
-0.4
-0.2
0
0.2
0.4
Original Signal
0 10 20 30
-0.1
-0.05
0
0.05
0.1
Word -four Approximation coefficients Level 8
0 10 20 30
-0.05
0
0.05
Detail coefficients Level 8
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
x 10
4
-0.5
0
0.5
Original Signal
0 5 10 15 20
-0.04
-0.02
0
0.02
Word -one Approximation coefficients Level 8
0 5 10 15 20
-0.04
-0.02
0
0.02
0.04
Detail coefficients Level 8
0 0.5 1 1.5 2 2.5 3 3.5
x 10
4
-1
-0.5
0
0.5
1
Original Signal
0 10 20 30
-1.5
-1
-0.5
0
0.5
Word -two Approximation coefficients Level 8
0 10 20 30
-0.2
-0.1
0
0.1
0.2
Detail coefficients Level 8
0 0.5 1 1.5 2 2.5
x 10
4
-0.5
0
0.5
1
Original Signal
0 5 10 15 20
-0.05
0
0.05
Word -three Approximation coefficients Level 8
0 5 10 15 20
-0.1
-0.05
0
0.05
0.1
Detail coefficients Level 8
0 0.5 1 1.5 2 2.5 3
x 10
4
-0.4
-0.2
0
0.2
0.4
Original Signal
0 10 20 30
-0.2
-0.1
0
0.1
0.2
Word -five Approximation coefficients Level 8
0 10 20 30
-1
-0.5
0
0.5
Detail coefficients Level 8
0 0.5 1 1.5 2 2.5 3
x 10
4
-1
-0.5
0
0.5
1
Original Signal
0 5 10 15 20
-1
-0.5
0
0.5
Word -six Approximation coefficients Level 8
0 5 10 15 20
-2
-1
0
1
Detail coefficients Level 8
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 595
Fig- 9: Decomposition of digit seven at 10th level
Fig- 10: Decomposition of digit eight at 10th level
Fig- 11: Decomposition of digit nine at 10th level
For classification using ANN, SVM and Naive Bayes
classifier, the feature set data is divided into training,
validation and test set. 70% data is used for training, 15% for
validation and 15% for testing. That is, 7000 data is used for
training, 1500 data for validation and 1500 for testing.
8.1 Performance Evaluation of Classifiers
For classification using ANN, one input layer, one hidden
layer and one output layer is used. Back propagation algorithm
is used for error correction.
For classification using SVM, since one-against-one SVM
classification is used, we construct a binary SVM for each pair
of classes. The number of binary SVMs built is defined as C*
(C-1)/2, where C is the number of classes. So a total of
10*(10-1)/2 = 45 binary SVMs are built. The table given
below shows the comparison of various classifiers.
Table -2: Performance Evaluation of Classifiers
Classifier Recognition Accuracy %
ANN 89
SVM 86.6
Naive Bayes 83.5
From the results it is clear that all these classifiers are good in
recognizing speech. But better results are obtained using
ANN.
CONCLUSIONS
In this work, a speech recognition system is designed for
speaker independent spoken digits in Malayalam. The speech
samples in the database are applied to wavelet denoising using
soft thresholding for removing noise. DWT is used for feature
extraction. Three classifiers namely ANN, SVM and Naïve
Bayes classifier are used for classifying the digits into proper
classes. A comparative study of these three classifiers is
performed. The performance of all these techniques are tested
and evaluated. All techniques are found to be efficient in
recognizing speech. The accuracy rate obtained by using DWT
and ANN combination is 89% which is found to be better
than that of the other two methods. The experiment results
show that this hybrid architecture using discrete wavelet
transforms and neural networks could effectively extract the
features from the speech signal and classify it for efficient
recognition of digits. For future work, other feature extraction
techniques like Linear Predictive Coding (LPC), Mel
Frequency Cepstral Coefficients (MFCC), Wavelet packet
Decomposition (WPD) can also be used and classified using
these classifiers.
ACKNOWLEDGEMENTS
I wish to express my sincere gratitude to all who have
contributed throughout the course of this work.
0 0.5 1 1.5 2 2.5 3
x 10
4
-1
-0.5
0
0.5
1
Original Signal
0 5 10 15 20
-0.04
-0.02
0
0.02
0.04
Word -seven Approximation coefficients Level 8
0 5 10 15 20
-0.1
-0.05
0
0.05
0.1
Detail coefficients Level 8
0 0.5 1 1.5 2 2.5 3
x 10
4
-1
-0.5
0
0.5
1
Original Signal
0 5 10 15 20
-0.6
-0.4
-0.2
0
0.2
Word -eight Approximation coefficients Level 8
0 5 10 15 20
-0.6
-0.4
-0.2
0
0.2
Detail coefficients Level 8
0 0.5 1 1.5 2 2.5
x 10
4
-1
-0.5
0
0.5
1
Original Signal
0 5 10 15 20
-0.02
0
0.02
0.04
Word -nine Approximation coefficients Level 8
0 5 10 15 20
-0.1
-0.05
0
0.05
0.1
Detail coefficients Level 8
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 596
REFERENCES:
[1]. L. Rabiner, B. H. Juang, “Fundamentals of Speech
Recognition”, Prentice-Hall, Englewood Cliffs, NJ, 1993.
[2]. Kenneth Thomas Schutte, “Parts-based Models and Local
Features for Automatic Speech recognition”, PhD Thesis in
Electrical Engineering and Computer Science, 2009.
[3]. Scharenborg, “Reaching Over the Gap: A Review of
Efforts to Link Human and Automatic Speech Recognition
Research”, Speech Communication, 49:336–347, 2007.
[4]. Y. Ajami Alotaibi, Investigating Spoken Arabic Digits in
Speech Recognition Setting, Information Sciences, Vol 173,
pp. 115-139, 2005.
[5]. C. Kurian, K. Balakrishnan, Speech Recognition of
Malayalam Numbers, World Congress on Nature and
Biologically Inspired Computing, pp. 1475-1479, 2009.
[6]. B. Gold, N. Morgan, Speech and Audio Signal Processing,
New York, John Wiley and Sons, 2002.
[7].recognition.http://www.learnartificialneuralnetworks.com/
speechrecognition.html
[8]. Daniel M. Rasetshwane, J. Robert Boston, Ching-Chung
Li, Identification of Speech Transients Using Variable Frame
Rate Analysis and Wavelet Packets, Proc.of the 28th IEEE
EMBS Annual International Conference, USA, 2006.
[9]. Yasser Ghanbari, Mohammad Reza Karami, A new
Approach for Speech Enhancement based on the Adaptive
Thresholding of the Wavelet Packets, Speech
Communication, Vol. 48 (8), pp. 927–940, 2006.
[10]. D.L. Donoho., De-noising by Soft Thresholding, IEEE
transactions on Information Theory, vol. 41, No. 3, pp. 613-
627, 1995.
[11]. Elif Derya Ubeyil.,Combined Neural Network model
Employing Wavelet Coefficients for ECG Signals
Classification, Digital Signal Processing, Vol 19, pp. 297-308,
2009.
[12]. S. Chan Woo, C.Peng Lin, R. Osman., Development of
a Speaker RecognitionSystem using Wavelets and Artificial
Neural Networks, Proc. of Int. Symposium on Intelligent
Multimedia, Video and Speech processing, pp. 413-416,
2001.
[13]. S. Kadambe, P. Srinivasan., Application of Adaptive
Wavelets for Speech, Optical Engineering , Vol 33(7), pp.
2204-2211, 1994.
[14]. S .G. Mallat., A Theory for Multiresolution Signal
Decomposition: The Wavelet Representation, IEEE
Transactions on Pattern Analysis And Machine Intelligence,
Vol.11, 674-693, 1989.
[15]. Freeman J. A, Skapura D. M., 2006. Neural Networks
Algorithm, Application and Programming Techniques,
Pearson Education.
[16]. Economou K., Lymberopoulos D., 1999. A New
Perspective in Learning Pattern Generation for Teaching
Neural Networks, Volume 12, Issue 4-5, 767-775.
[17]. V.N. Vapnik., Statistical Learning Theory, J. Wiley,
N.Y., 1998.
[18]. N. Cristianini, J. Shawe-Taylor., An introduction to
Support Vector Machines, Cambridge University Press,
Cambridge, U.K., 2000.
[19]. Ulrich H.-G. Kreßel., Pairwise Classification and
Support Vector Machines, Advances inKernel Methods
Support Vector Machine Learning, Cambridge, MA, MIT
press, pp. 255-268, 1999.
[20]. C.W. Hsu, C.J. Lin, A Comparison of Methods for
Multi-class Support Vector Machines. IEEE Transactions on
Neural Networks, 13(2), pp. 415–425, 2002.
[21]. Li Dan, Liu Lihua, Zhang Zhaoxin, Research of Text
Categorization on WEKA, proc. Of Third International
Conference on Intelligent System Design and Engineering
Applications, pp. 1129-1131, 2013
[22]. J. Han and M. Kamber, Data Mining Concepts and
Techniques, Morgan Kauffman Publishers, 2000.
[23]. Laszlo Toth, Andras Kocsor, Janos Csirik, On Naive
Bayes In Speech Recognition, Int. J. Appl. Math. Comput.
Sci., Vol. 15, No. 2, pp. 287–294, 2005.
BIOGRAPHIES:
Sonia Sunny is working as Associate
professor in the department of Computer
Science, Prajyoti Niketan College,
Pudukad, Thrissur, Kerala State, India.
Currently she is doing research in the
area of Speech Recognition at Cochin
University of Science and Technology,
Cochin, Kerala State, India. In 1995, she
received her M.Sc in Computer Science
from Bharathidasan University, Thiruchirappalli and M.Phil in
Computer Science from Bharathiar University, Coimbatore in
2008. Her research interest includes Speech Processing,
Artificial Intelligence and Pattern Recognition.
David Peter S is presently working as
professor in Computer Science at School
of Engineering, Cochin University of
Science and Technology, Cochin,
Kerala, India. He did his Post-
graduation in Computer Science at IIT
Madras and PhD at Cochin University.
He has presented research papers in
several International Conferences and
has published many articles in International Journals. His
areas of interest includes Artificial Intelligence, Natural
Language Processing, Information Systems, Engineering, etc.
K. Poulose Jacob, Professor of Computer
Science at Cochin University of Science
and Technology (CUSAT) since 1994, is
currently Director of the School of
Computer Science Studies. He holds
additional charge as the honorary Director
of CUSAT Planning & Development. He
has presented research papers in several
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 597
International Conferences in Europe, USA, UK, Australia and
other countries. He has delivered invited talks at several
national and international events. Dr. Jacob is a Professional
member of the ACM (Association for Computing Machinery)
and a Life Member of the Computer Society of India. Till
now twelve candidates have obtained PhD degrees in
Computer Science & Engineering under his supervision. He
has been PhD Theses examiner for several Universities. He
has more than 75 research publications to his credit. His
research interests are in Information Systems Engineering,
Intelligent Architectures and Networks.

More Related Content

What's hot

An optimized approach to voice translation on mobile phones
An optimized approach to voice translation on mobile phonesAn optimized approach to voice translation on mobile phones
An optimized approach to voice translation on mobile phoneseSAT Journals
 
An Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity RemovalAn Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity RemovalWaqas Tariq
 
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURESSPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURESacijjournal
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Waqas Tariq
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01ijcsit
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba languageAlexander Decker
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...rahulmonikasharma
 
B tech project_report
B tech project_reportB tech project_report
B tech project_reportabhiuaikey
 

What's hot (10)

An optimized approach to voice translation on mobile phones
An optimized approach to voice translation on mobile phonesAn optimized approach to voice translation on mobile phones
An optimized approach to voice translation on mobile phones
 
An Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity RemovalAn Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity Removal
 
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURESSPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba language
 
Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...
 
B tech project_report
B tech project_reportB tech project_report
B tech project_report
 

Viewers also liked

The construction labour shortage in johor bahru, malaysia
The construction labour shortage in johor bahru, malaysiaThe construction labour shortage in johor bahru, malaysia
The construction labour shortage in johor bahru, malaysiaeSAT Journals
 
Bi directional speed control of dc motor and stepper motor through mat lab us...
Bi directional speed control of dc motor and stepper motor through mat lab us...Bi directional speed control of dc motor and stepper motor through mat lab us...
Bi directional speed control of dc motor and stepper motor through mat lab us...eSAT Journals
 
Comparative study of tribological characteristics of aa2024+10% fly ash compo...
Comparative study of tribological characteristics of aa2024+10% fly ash compo...Comparative study of tribological characteristics of aa2024+10% fly ash compo...
Comparative study of tribological characteristics of aa2024+10% fly ash compo...eSAT Journals
 
Scour investigation around single and two piers sidebyside arrangement
Scour investigation around single and two piers sidebyside arrangementScour investigation around single and two piers sidebyside arrangement
Scour investigation around single and two piers sidebyside arrangementeSAT Journals
 
Study of talc filled polypropylene a concept for improving mechanical proper...
Study of talc filled polypropylene  a concept for improving mechanical proper...Study of talc filled polypropylene  a concept for improving mechanical proper...
Study of talc filled polypropylene a concept for improving mechanical proper...eSAT Journals
 
The review on automatic license plate recognition (alpr)
The review on automatic license plate recognition (alpr)The review on automatic license plate recognition (alpr)
The review on automatic license plate recognition (alpr)eSAT Journals
 
Toxicity immobilization of refinery sludge containing heavy metals via vitrif...
Toxicity immobilization of refinery sludge containing heavy metals via vitrif...Toxicity immobilization of refinery sludge containing heavy metals via vitrif...
Toxicity immobilization of refinery sludge containing heavy metals via vitrif...eSAT Journals
 
Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...
Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...
Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...eSAT Journals
 
A review on power electronics application on wind turbines
A review on power electronics application on wind turbinesA review on power electronics application on wind turbines
A review on power electronics application on wind turbineseSAT Journals
 
Production of lactic acid from sweet meat industry waste by lactobacillus del...
Production of lactic acid from sweet meat industry waste by lactobacillus del...Production of lactic acid from sweet meat industry waste by lactobacillus del...
Production of lactic acid from sweet meat industry waste by lactobacillus del...eSAT Journals
 
Report based payment scheme for multihop wireless networks
Report based payment scheme for multihop wireless networksReport based payment scheme for multihop wireless networks
Report based payment scheme for multihop wireless networkseSAT Journals
 
Two parallel single phase rectifiers by using single phase to three phase ind...
Two parallel single phase rectifiers by using single phase to three phase ind...Two parallel single phase rectifiers by using single phase to three phase ind...
Two parallel single phase rectifiers by using single phase to three phase ind...eSAT Journals
 
Terrestrial gps positioning system
Terrestrial gps positioning systemTerrestrial gps positioning system
Terrestrial gps positioning systemeSAT Journals
 
A study on effect of bowing on a scans using pulse echo ultrasonic technique
A study on effect of bowing on a scans using pulse echo ultrasonic technique A study on effect of bowing on a scans using pulse echo ultrasonic technique
A study on effect of bowing on a scans using pulse echo ultrasonic technique eSAT Journals
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data miningeSAT Journals
 
A review of pre combustion co2 capture in igcc
A review of pre combustion co2 capture in igccA review of pre combustion co2 capture in igcc
A review of pre combustion co2 capture in igcceSAT Journals
 
Estimation of traffic density to compare speed density models with moving obs...
Estimation of traffic density to compare speed density models with moving obs...Estimation of traffic density to compare speed density models with moving obs...
Estimation of traffic density to compare speed density models with moving obs...eSAT Journals
 
Production of banana alcohol and utilization of banana residue
Production of banana alcohol and utilization of banana residueProduction of banana alcohol and utilization of banana residue
Production of banana alcohol and utilization of banana residueeSAT Journals
 

Viewers also liked (18)

The construction labour shortage in johor bahru, malaysia
The construction labour shortage in johor bahru, malaysiaThe construction labour shortage in johor bahru, malaysia
The construction labour shortage in johor bahru, malaysia
 
Bi directional speed control of dc motor and stepper motor through mat lab us...
Bi directional speed control of dc motor and stepper motor through mat lab us...Bi directional speed control of dc motor and stepper motor through mat lab us...
Bi directional speed control of dc motor and stepper motor through mat lab us...
 
Comparative study of tribological characteristics of aa2024+10% fly ash compo...
Comparative study of tribological characteristics of aa2024+10% fly ash compo...Comparative study of tribological characteristics of aa2024+10% fly ash compo...
Comparative study of tribological characteristics of aa2024+10% fly ash compo...
 
Scour investigation around single and two piers sidebyside arrangement
Scour investigation around single and two piers sidebyside arrangementScour investigation around single and two piers sidebyside arrangement
Scour investigation around single and two piers sidebyside arrangement
 
Study of talc filled polypropylene a concept for improving mechanical proper...
Study of talc filled polypropylene  a concept for improving mechanical proper...Study of talc filled polypropylene  a concept for improving mechanical proper...
Study of talc filled polypropylene a concept for improving mechanical proper...
 
The review on automatic license plate recognition (alpr)
The review on automatic license plate recognition (alpr)The review on automatic license plate recognition (alpr)
The review on automatic license plate recognition (alpr)
 
Toxicity immobilization of refinery sludge containing heavy metals via vitrif...
Toxicity immobilization of refinery sludge containing heavy metals via vitrif...Toxicity immobilization of refinery sludge containing heavy metals via vitrif...
Toxicity immobilization of refinery sludge containing heavy metals via vitrif...
 
Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...
Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...
Transfer entropy estimation supplements time domain beat-to-beat baroreflex s...
 
A review on power electronics application on wind turbines
A review on power electronics application on wind turbinesA review on power electronics application on wind turbines
A review on power electronics application on wind turbines
 
Production of lactic acid from sweet meat industry waste by lactobacillus del...
Production of lactic acid from sweet meat industry waste by lactobacillus del...Production of lactic acid from sweet meat industry waste by lactobacillus del...
Production of lactic acid from sweet meat industry waste by lactobacillus del...
 
Report based payment scheme for multihop wireless networks
Report based payment scheme for multihop wireless networksReport based payment scheme for multihop wireless networks
Report based payment scheme for multihop wireless networks
 
Two parallel single phase rectifiers by using single phase to three phase ind...
Two parallel single phase rectifiers by using single phase to three phase ind...Two parallel single phase rectifiers by using single phase to three phase ind...
Two parallel single phase rectifiers by using single phase to three phase ind...
 
Terrestrial gps positioning system
Terrestrial gps positioning systemTerrestrial gps positioning system
Terrestrial gps positioning system
 
A study on effect of bowing on a scans using pulse echo ultrasonic technique
A study on effect of bowing on a scans using pulse echo ultrasonic technique A study on effect of bowing on a scans using pulse echo ultrasonic technique
A study on effect of bowing on a scans using pulse echo ultrasonic technique
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data mining
 
A review of pre combustion co2 capture in igcc
A review of pre combustion co2 capture in igccA review of pre combustion co2 capture in igcc
A review of pre combustion co2 capture in igcc
 
Estimation of traffic density to compare speed density models with moving obs...
Estimation of traffic density to compare speed density models with moving obs...Estimation of traffic density to compare speed density models with moving obs...
Estimation of traffic density to compare speed density models with moving obs...
 
Production of banana alcohol and utilization of banana residue
Production of banana alcohol and utilization of banana residueProduction of banana alcohol and utilization of banana residue
Production of banana alcohol and utilization of banana residue
 

Similar to Performance of different classifiers in speech recognition

Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...csandit
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...cscpconf
 
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...AIRCC Publishing Corporation
 
Identity authentication using voice biometrics technique
Identity authentication using voice biometrics techniqueIdentity authentication using voice biometrics technique
Identity authentication using voice biometrics techniqueeSAT Journals
 
A Review Paper on Speech Based Emotion Detection Using Deep Learning
A Review Paper on Speech Based Emotion Detection Using Deep LearningA Review Paper on Speech Based Emotion Detection Using Deep Learning
A Review Paper on Speech Based Emotion Detection Using Deep LearningIRJET Journal
 
Robust Speech Recognition Technique using Mat lab
Robust Speech Recognition Technique using Mat labRobust Speech Recognition Technique using Mat lab
Robust Speech Recognition Technique using Mat labIRJET Journal
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identificationIJCSEA Journal
 
Speech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine LearningSpeech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine LearningIRJET Journal
 
Detection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysisDetection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysiseSAT Journals
 
Detection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysisDetection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysiseSAT Publishing House
 
IRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition SystemIRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition SystemIRJET Journal
 
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...CSEIJJournal
 
Isolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationIsolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationeSAT Publishing House
 
Isolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationIsolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationeSAT Journals
 
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionA survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionIRJET Journal
 

Similar to Performance of different classifiers in speech recognition (20)

Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
 
Ijetcas14 390
Ijetcas14 390Ijetcas14 390
Ijetcas14 390
 
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
 
Identity authentication using voice biometrics technique
Identity authentication using voice biometrics techniqueIdentity authentication using voice biometrics technique
Identity authentication using voice biometrics technique
 
A Review Paper on Speech Based Emotion Detection Using Deep Learning
A Review Paper on Speech Based Emotion Detection Using Deep LearningA Review Paper on Speech Based Emotion Detection Using Deep Learning
A Review Paper on Speech Based Emotion Detection Using Deep Learning
 
Robust Speech Recognition Technique using Mat lab
Robust Speech Recognition Technique using Mat labRobust Speech Recognition Technique using Mat lab
Robust Speech Recognition Technique using Mat lab
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
50120140502007
5012014050200750120140502007
50120140502007
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Speech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine LearningSpeech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine Learning
 
Detection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysisDetection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysis
 
Detection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysisDetection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysis
 
IRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition SystemIRJET- Comparative Analysis of Emotion Recognition System
IRJET- Comparative Analysis of Emotion Recognition System
 
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
 
Isolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationIsolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantization
 
Isolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantizationIsolated word recognition using lpc & vector quantization
Isolated word recognition using lpc & vector quantization
 
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionA survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech Recognition
 

More from eSAT Journals

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementseSAT Journals
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case studyeSAT Journals
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case studyeSAT Journals
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreeSAT Journals
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialseSAT Journals
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...eSAT Journals
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...eSAT Journals
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizereSAT Journals
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementeSAT Journals
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...eSAT Journals
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteeSAT Journals
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...eSAT Journals
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...eSAT Journals
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabseSAT Journals
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaeSAT Journals
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...eSAT Journals
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodeSAT Journals
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniqueseSAT Journals
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...eSAT Journals
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...eSAT Journals
 

More from eSAT Journals (20)

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavements
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case study
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case study
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangalore
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizer
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources management
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concrete
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabs
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in india
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn method
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniques
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...
 

Recently uploaded

History of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationHistory of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationEmaan Sharma
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfJNTUA
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfssuser5c9d4b1
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...IJECEIAES
 
Insurance management system project report.pdf
Insurance management system project report.pdfInsurance management system project report.pdf
Insurance management system project report.pdfKamal Acharya
 
Intro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniIntro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniR. Sosa
 
Circuit Breakers for Engineering Students
Circuit Breakers for Engineering StudentsCircuit Breakers for Engineering Students
Circuit Breakers for Engineering Studentskannan348865
 
Adsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptAdsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptjigup7320
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisDr.Costas Sachpazis
 
21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docxrahulmanepalli02
 
Working Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfWorking Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfSkNahidulIslamShrabo
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...drjose256
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxCHAIRMAN M
 
engineering chemistry power point presentation
engineering chemistry  power point presentationengineering chemistry  power point presentation
engineering chemistry power point presentationsj9399037128
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...josephjonse
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailingAshishSingh1301
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024EMMANUELLEFRANCEHELI
 
analog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxanalog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxKarpagam Institute of Teechnology
 
Artificial Intelligence in due diligence
Artificial Intelligence in due diligenceArtificial Intelligence in due diligence
Artificial Intelligence in due diligencemahaffeycheryld
 
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...Amil baba
 

Recently uploaded (20)

History of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationHistory of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & Modernization
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdf
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...
 
Insurance management system project report.pdf
Insurance management system project report.pdfInsurance management system project report.pdf
Insurance management system project report.pdf
 
Intro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniIntro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney Uni
 
Circuit Breakers for Engineering Students
Circuit Breakers for Engineering StudentsCircuit Breakers for Engineering Students
Circuit Breakers for Engineering Students
 
Adsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptAdsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) ppt
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
 
21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx
 
Working Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfWorking Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdf
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
 
engineering chemistry power point presentation
engineering chemistry  power point presentationengineering chemistry  power point presentation
engineering chemistry power point presentation
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailing
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
 
analog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxanalog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptx
 
Artificial Intelligence in due diligence
Artificial Intelligence in due diligenceArtificial Intelligence in due diligence
Artificial Intelligence in due diligence
 
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
 

Performance of different classifiers in speech recognition

  • 1. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 590 PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION Sonia Suuny1 , David Peter S2 , K. Poulose Jacob3 1 Associate Professor, 2, 3 Professor, 1 Dept. of Computer Science, Prajyoti Niketan College, Kerala, India, 2, 3 Professor., Dept. of Computer Science, Cochin University of Science & Technology, Kerala, India, sonia.deepak@yahoo.co.in, davidpeter@cusat.ac.in, kpj@cusat.ac.in Abstract Speech is the most natural means of communication among human beings and speech processing and recognition are intensive areas of research for the last five decades. Since speech recognition is a pattern recognition problem, classification is an important part of any speech recognition system. In this work, a speech recognition system is developed for recognizing speaker independent spoken digits in Malayalam. Voice signals are sampled directly from the microphone. The proposed method is implemented for 1000 speakers uttering 10 digits each. Since the speech signals are affected by background noise, the signals are tuned by removing the noise from it using wavelet denoising method based on Soft Thresholding. Here, the features from the signals are extracted using Discrete Wavelet Transforms (DWT) because they are well suitable for processing non-stationary signals like speech. This is due to their multi- resolutional, multi-scale analysis characteristics. Speech recognition is a multiclass classification problem. So, the feature vector set obtained are classified using three classifiers namely, Artificial Neural Networks (ANN), Support Vector Machines (SVM) and Naive Bayes classifiers which are capable of handling multiclasses. During classification stage, the input feature vector data is trained using information relating to known patterns and then they are tested using the test data set. The performances of all these classifiers are evaluated based on recognition accuracy. All the three methods produced good recognition accuracy. DWT and ANN produced a recognition accuracy of 89%, SVM and DWT combination produced an accuracy of 86.6% and Naive Bayes and DWT combination produced an accuracy of 83.5%. ANN is found to be better among the three methods. Index Terms: Speech Recognition, Soft Thresholding, Discrete Wavelet Transforms, Artificial Neural Networks, Support Vector Machines and Naive Bayes Classifier. -----------------------------------------------------------------------***----------------------------------------------------------------------- 1. INTRODUCTION Automatic Speech recognition (ASR) is one of the intensive areas of research since it helps people to communicate in a more natural and effective way[1]. Speech recognition systems can be characterized by many parameters. The commonly used method to measure the performance of a speech recognition system is the recognition accuracy. Many parameters affect the accuracy of the speech recognition system. The accuracy and acceptance of speech recognition has improved a lot in the last few years. Though automatic speech recognition systems have improved a lot these years and are now extensively used, their accuracy continues to lag behind human performance, particularly in adverse conditions [2]. In spite of the advances in ASR, there is still a considerable gap between human and machine performance [3]. Speech is a multi-component signal with time varying frequency and amplitude. Due to this variability, transitions may occur at different times in different frequency bands. Automatic recognition of spoken digits is one of the challenging tasks in the field of speech recognition [4]. A spoken digit recognition process is needed in many applications that need numbers as input such as automated banking system, airline reservations, voice dialing telephone, automatic data entry, command and control etc [5]. ASR is basically a pattern recognition problem which also involves a number of technologies and research areas like Signal Processing, Natural Language Processing, Statistics etc [6]. Speech signals are non stationary in nature. Speech recognition is a complex task due to the differences in gender, emotional state, accent, pronunciation, articulation, nasality, pitch, volume, and speed variability in people speak [7]. Presence of background noise and other types of disturbances also affect the performance of a speech recognition system. Speaker independence is difficult to achieve because these models recognize the speech patterns of a large group of people. The paper is organized as follows. Section 2 gives a brief description of the problem definition. The methodology used in the design is explained in section 3. The creation of the digits database is illustrated in section 4. Section 5 describes the method used for preprocessing. Section 6 elaborates the feature extraction technique used in this work followed by the classification techniques in section 7. Section 8 presents a
  • 2. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 591 detailed analysis of the experiments done and the results obtained. Conclusion is given in the last section. 2. PROBLEM DEFINITION The main objective of a speech recognition system is to recognize speech with maximum accuracy. Here, digits in Malayalam, one of the four major Dravidian languages of southern India and the official language of the people of Kerala is chosen for recognition. Research in Malayalam is in its infancy stage and only very few works have been reported in Malayalam. The performance of the overall speech recognition system depends on the techniques used for pre- processing, feature extraction technique selected and the classifiers used. In this work, wavelet denoising method based on soft thresholding is used for pre-processing the captured signals [8]. Wavelet based feature extraction method namely DWT is selected for feature extraction due to the good time and frequency resolution properties of the wavelets. For classification, three classifiers are selected namely ANN, SVM and Naive Bayes classifier. The performance of these classification methods in classifying the digits is evaluated along with DWT. 3. ARCHITECTURE OF THE SYSTEM In this work, the speech recognition system is divided into 4 modules. First the spoken digits database is created since there is no built-in standard database available in Malayalam. The speech samples in the database are often corrupted by additive noises like background noise. So these speech signals are pre- processed using wavelet denoising techniques to suppress the noise in it. For processing speech, the speech signal has to be represented in the form of parameters. So, after denoising the signals, they are presented to feature extraction stage where the features are extracted using a wavelet based feature extraction technique called Discrete Wavelet Transforms. The extracted features are then classified into appropriate classes during classification phase. Here, three classifiers namely ANN, SVM and Naive Bayes classifier are used and the performance of these techniques are evaluated based on recognition accuracy. The four stages of this work are explained below. 4. CREATION OF THE SPEECH DATABASE In Malayalam, since there is no standard database available, a spoken digits database is created using 200 speakers of age between 6 and 70 uttering 10 Malayalam digits. We have used 80 male speakers, 80 female speakers and 40 children for creating the database. Male and female speech differ in pitch, frequency, phonetics and many other factors due to the difference in physiological as well as psychological factors. The samples stored in the database are recorded by using a high quality studio-recording microphone at a sampling rate of 8 KHz (4 KHz band limited). Recognition has been made on the ten Malayalam digits from 0 to 9 under the same configuration. Our database consists of a total of 2000 utterances of the digits. The spoken digits are preprocessed, numbered and stored in the appropriate classes in the database. The number digit, spoken digits in Malayalam, their International Phonetic Alphabet (IPA) format and English translation are shown in Table 1. Table -1: Malayalam digits database Number digit Digits in Malayalam IPA format English Translation 0 /puː//d͡ ʒya/ /ṁ/ Zero 1 /o/ /nn/ One 2 /ɾa/ /ṇṭ/ Two 3 /muː/ /nn/ Three 4 /n̪ aː/ /l/ Four 5 /a/ /ñc/ Five 6 /aː/ /r/ Six 7 /eː/ /ɻ/ Seven 8 /e/ /ṭṭ/ Eight 9 /o//na//pa/ /t̪/ Nine 5. PRE-PROCESSIG SIGNALS Noises from background cause degradation in the speech signals. So, these signals are tuned so that the noise present in it is removed before extracting the features. There are a number of techniques available for speech enhancement. Since we are using wavelets for feature extraction, wavelet denoising algorithms are used for reducing the noise in the signal. The two popular thresholding functions used in wavelet denoising method are the hard and the soft thresholding functions [9]. In both the methods, a threshold value is selected. In hard thresholding, if the absolute value of an element is lower than the threshold, then these values are set to zero. Soft thresholding is an extension of hard thresholding. Here, the elements whose absolute values are lower than the threshold are first set to zero and then the nonzero elements are shrinked towards 0. Hard and soft thresholding can be expressed as
  • 3. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 592 Where X represents the wavelet coefficients and ι is the threshold value. In this work, soft thresholding technique is used. There are different standard threshold values. In this work, we have used the universal threshold derived by Donoho and Johnstone [10] for the white Gaussian noise under a mean square error criterion which is defined as ι = (3) Where σ is the standard deviation and N is the length of the signal. Standard deviation σ can be calculated as σ = MAD/0.6745, where MAD is the median of the absolute value of the wavelet coefficients. The outline of the algorithm used for denoising mainly consists of 3 steps.  Apply wavelet transform to the noisy signal to produce the noisy wavelet coefficients up to 8 levels.  Detail wavelet coefficients are then shrinked using soft thresholding technique by selecting an appropriate threshold limit.  The inverse discrete wavelet transform of the threshold wavelet coefficients is computed which produces the denoised signal. 6. FEATURE EXTRACTION Feature Extraction is a major part of the speech recognition system since it plays an important role to separate one speech from other and this has been an important area of research for many years. Selection of the feature extraction technique plays an important role in the recognition accuracy, which is the main criterion for a good speech recognition system. Here, DWT is used for extracting features and a brief description of this is given below. 6.1 Discrete Wavelet Transforms DWT is a relatively recent and computationally efficient technique for extracting information from non-stationary signals like audio. The main advantage of the wavelet transforms is that it has a varying window size, being broad at low frequencies and narrow at high frequencies, thus leading to an optimal time–frequency resolution in all frequency ranges [11]. DWT uses digital filtering techniques to obtain a time-scale representation of the signals. DWT is defined by (4) Where Ψ (t) is the basic analyzing function called the mother wavelet. In DWT, the original signal passes through a low- pass filter and a high-pass filter and emerges as two signals, called approximation coefficients and detail coefficients [12]. In speech signals, low frequency components h[n] are of greater importance than high frequency components g[n] as the low frequency components characterize a signal more than its high frequency components [13]. The successive high pass and low pass filtering of the signal is given by (5) (6) Where Yhigh (detail coefficients) and Ylow (approximation coefficients) are the outputs of the high pass and low pass filters obtained by sub sampling by 2. The filtering process is continued until the desired level is reached according to Mallat algorithm [14]. The discrete wavelet decomposition tree is shown in figure 1. Fig -1: DWT Decomposition over 3 levels 7. CLASSIFICATION Classification is another important part of a speech recognition system since the patterns are classified into different classes during this stage. Pattern recognition is becoming increasingly important in the age of automation and information handling and retrieval. During classification stage, decisions are made based on the similarity measures from training patterns using information relating to known patterns. Then they are tested using the unknown patterns. Since there are a number of different classes in a speech recognition problem, a multiclass classification technique is needed. In almost all classification methods, the data is separated into training and test sets. Each instance in the training set contains a target value which represents the corresponding class and a ( ) (| | | |) | | 0 | | (2){ sign X X if X Soft if XX       | | 0 | | (1){X if X Hard if XX     /2 ( , ) ( )2 (2 ) j j j k W j K X k n k     [ ] [ ] [2 ]high n Y k x n g k n  [ ] [ ] [2 ]low n Y k x n h k n 
  • 4. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 593 set of attributes. The test data do not contain a target value. The objective of the classifier is to produce a model from the training data which predicts the target values of the test data. A brief description of the various classifiers used in this work like ANN, SVM and Naive Bayes classifier are given below. 7.1 Artificial Neural Networks Nowadays, ANNs are utilized in many applications due to their parallel distributed processing, distributed memories, error stability, and pattern learning and distinguishing ability. ANN is an information processing paradigm consisting of a number of simple processing units or nodes called neurons. Each neuron accepts a weighted set of inputs and produces an output [15]. Algorithms based on ANN are well suitable for addressing speech recognition tasks. Inspired by the human brain, neural network models use a number of characteristics such as learning, generalization, adaptively, fault tolerance etc. [16]. In this work, we are using the architecture of the MLP network, which consists of an input layer, one or more hidden layers, and an output layer. The algorithm used is the back propagation training algorithm. In this type of network, the input is presented to the network and moves through the weights and nonlinear activation functions towards the output layer, and the error is corrected in a backward direction using the well-known error back propagation correction algorithm. After extensive training, the network eventually establishes the input-output relationships through the adjusted weights on the network. After training the network, it is tested with the dataset used for testing. 7.2 Support Vector Machines SVM is a very useful technique used for classification. It is a classifier which performs classification methods by constructing hyper planes in a multidimensional space that separates different class labels based on statistical learning theory [17][18]. Though SVM is inherently a binary nonlinear classifier, we can extend it to multiclass classification since ASR is a multiclass problem. There are two major strategies for multiclass classification namely One-against-All [17] and One-against-One or pair wise classification [19]. The conventional way is to decompose the M-class problem into a series of two-class problems and construct several binary classifiers. In this work, we have used One-against-One method in which there is one binary SVM for each pair of classes to separate members of one class from members of the other. This method allows us to train all the system, with a maximum number of different samples for each class, with a limited computer memory [20]. 7.3 Naive Bayes Classifier Since speech recognition is a multiclass classification problem and Naive Bayes classifiers can handle multiclass classification problems, it is also used here for classifying the digits. Naive Bayes classifier is based on the Bayesian theory which is a simple and effective probability classification method. This is a supervised classification technique. For each class value it estimates that a given instance belongs to that class [16]. The feature items in one class are assumed to be independent of other attribute values called class conditional independence [17]. Naive Bayes classifier needs only small amount of training set to estimate the parameters for classification. The classifier is stated as P(A|B) = P (B|A) * P (A)/P(B) (7) Where P(A) is the prior probability of marginal probability of A, P(A|B) is the conditional probability of A, given B called the posterior probability, P(B|A) is the conditional probability of B given A and P(B) is the prior or marginal probability of B which acts as a normalizing constant. The probability value of the winning class dominates over that of the others [18]. 8. EXPERIMENTS Using DWT, the signals are decomposed into approximation and detail coefficients. The low frequency components are decomposed upto 10th level. The number of features obtained for a digit at 10th level is 16. The decomposition of digits zero to nine using DWT at 10th level is shown below. Fig -2: Decomposition of digit zero at 10th level 0 0.5 1 1.5 2 2.5 3 x 10 4 -1 -0.5 0 0.5 Original Signal 0 5 10 15 20 -0.05 0 0.05 0.1 0.15 Word -zero Approximation coefficients Level 8 0 5 10 15 20 -0.2 -0.1 0 0.1 0.2 Detail coefficients Level 8
  • 5. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 594 Fig- 3: Decomposition of digit one at 10th level Fig- 4: Decomposition of digit two at 10th level Fig- 5: Decomposition of digit three at 10th level Fig- 6: Decomposition of digit five at 10th level Fig- 7: Decomposition of digit four at 10th level Fig- 8: Decomposition of digit six at 10th level 0 0.5 1 1.5 2 2.5 3 x 10 4 -0.4 -0.2 0 0.2 0.4 Original Signal 0 10 20 30 -0.1 -0.05 0 0.05 0.1 Word -four Approximation coefficients Level 8 0 10 20 30 -0.05 0 0.05 Detail coefficients Level 8 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x 10 4 -0.5 0 0.5 Original Signal 0 5 10 15 20 -0.04 -0.02 0 0.02 Word -one Approximation coefficients Level 8 0 5 10 15 20 -0.04 -0.02 0 0.02 0.04 Detail coefficients Level 8 0 0.5 1 1.5 2 2.5 3 3.5 x 10 4 -1 -0.5 0 0.5 1 Original Signal 0 10 20 30 -1.5 -1 -0.5 0 0.5 Word -two Approximation coefficients Level 8 0 10 20 30 -0.2 -0.1 0 0.1 0.2 Detail coefficients Level 8 0 0.5 1 1.5 2 2.5 x 10 4 -0.5 0 0.5 1 Original Signal 0 5 10 15 20 -0.05 0 0.05 Word -three Approximation coefficients Level 8 0 5 10 15 20 -0.1 -0.05 0 0.05 0.1 Detail coefficients Level 8 0 0.5 1 1.5 2 2.5 3 x 10 4 -0.4 -0.2 0 0.2 0.4 Original Signal 0 10 20 30 -0.2 -0.1 0 0.1 0.2 Word -five Approximation coefficients Level 8 0 10 20 30 -1 -0.5 0 0.5 Detail coefficients Level 8 0 0.5 1 1.5 2 2.5 3 x 10 4 -1 -0.5 0 0.5 1 Original Signal 0 5 10 15 20 -1 -0.5 0 0.5 Word -six Approximation coefficients Level 8 0 5 10 15 20 -2 -1 0 1 Detail coefficients Level 8
  • 6. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 595 Fig- 9: Decomposition of digit seven at 10th level Fig- 10: Decomposition of digit eight at 10th level Fig- 11: Decomposition of digit nine at 10th level For classification using ANN, SVM and Naive Bayes classifier, the feature set data is divided into training, validation and test set. 70% data is used for training, 15% for validation and 15% for testing. That is, 7000 data is used for training, 1500 data for validation and 1500 for testing. 8.1 Performance Evaluation of Classifiers For classification using ANN, one input layer, one hidden layer and one output layer is used. Back propagation algorithm is used for error correction. For classification using SVM, since one-against-one SVM classification is used, we construct a binary SVM for each pair of classes. The number of binary SVMs built is defined as C* (C-1)/2, where C is the number of classes. So a total of 10*(10-1)/2 = 45 binary SVMs are built. The table given below shows the comparison of various classifiers. Table -2: Performance Evaluation of Classifiers Classifier Recognition Accuracy % ANN 89 SVM 86.6 Naive Bayes 83.5 From the results it is clear that all these classifiers are good in recognizing speech. But better results are obtained using ANN. CONCLUSIONS In this work, a speech recognition system is designed for speaker independent spoken digits in Malayalam. The speech samples in the database are applied to wavelet denoising using soft thresholding for removing noise. DWT is used for feature extraction. Three classifiers namely ANN, SVM and Naïve Bayes classifier are used for classifying the digits into proper classes. A comparative study of these three classifiers is performed. The performance of all these techniques are tested and evaluated. All techniques are found to be efficient in recognizing speech. The accuracy rate obtained by using DWT and ANN combination is 89% which is found to be better than that of the other two methods. The experiment results show that this hybrid architecture using discrete wavelet transforms and neural networks could effectively extract the features from the speech signal and classify it for efficient recognition of digits. For future work, other feature extraction techniques like Linear Predictive Coding (LPC), Mel Frequency Cepstral Coefficients (MFCC), Wavelet packet Decomposition (WPD) can also be used and classified using these classifiers. ACKNOWLEDGEMENTS I wish to express my sincere gratitude to all who have contributed throughout the course of this work. 0 0.5 1 1.5 2 2.5 3 x 10 4 -1 -0.5 0 0.5 1 Original Signal 0 5 10 15 20 -0.04 -0.02 0 0.02 0.04 Word -seven Approximation coefficients Level 8 0 5 10 15 20 -0.1 -0.05 0 0.05 0.1 Detail coefficients Level 8 0 0.5 1 1.5 2 2.5 3 x 10 4 -1 -0.5 0 0.5 1 Original Signal 0 5 10 15 20 -0.6 -0.4 -0.2 0 0.2 Word -eight Approximation coefficients Level 8 0 5 10 15 20 -0.6 -0.4 -0.2 0 0.2 Detail coefficients Level 8 0 0.5 1 1.5 2 2.5 x 10 4 -1 -0.5 0 0.5 1 Original Signal 0 5 10 15 20 -0.02 0 0.02 0.04 Word -nine Approximation coefficients Level 8 0 5 10 15 20 -0.1 -0.05 0 0.05 0.1 Detail coefficients Level 8
  • 7. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 596 REFERENCES: [1]. L. Rabiner, B. H. Juang, “Fundamentals of Speech Recognition”, Prentice-Hall, Englewood Cliffs, NJ, 1993. [2]. Kenneth Thomas Schutte, “Parts-based Models and Local Features for Automatic Speech recognition”, PhD Thesis in Electrical Engineering and Computer Science, 2009. [3]. Scharenborg, “Reaching Over the Gap: A Review of Efforts to Link Human and Automatic Speech Recognition Research”, Speech Communication, 49:336–347, 2007. [4]. Y. Ajami Alotaibi, Investigating Spoken Arabic Digits in Speech Recognition Setting, Information Sciences, Vol 173, pp. 115-139, 2005. [5]. C. Kurian, K. Balakrishnan, Speech Recognition of Malayalam Numbers, World Congress on Nature and Biologically Inspired Computing, pp. 1475-1479, 2009. [6]. B. Gold, N. Morgan, Speech and Audio Signal Processing, New York, John Wiley and Sons, 2002. [7].recognition.http://www.learnartificialneuralnetworks.com/ speechrecognition.html [8]. Daniel M. Rasetshwane, J. Robert Boston, Ching-Chung Li, Identification of Speech Transients Using Variable Frame Rate Analysis and Wavelet Packets, Proc.of the 28th IEEE EMBS Annual International Conference, USA, 2006. [9]. Yasser Ghanbari, Mohammad Reza Karami, A new Approach for Speech Enhancement based on the Adaptive Thresholding of the Wavelet Packets, Speech Communication, Vol. 48 (8), pp. 927–940, 2006. [10]. D.L. Donoho., De-noising by Soft Thresholding, IEEE transactions on Information Theory, vol. 41, No. 3, pp. 613- 627, 1995. [11]. Elif Derya Ubeyil.,Combined Neural Network model Employing Wavelet Coefficients for ECG Signals Classification, Digital Signal Processing, Vol 19, pp. 297-308, 2009. [12]. S. Chan Woo, C.Peng Lin, R. Osman., Development of a Speaker RecognitionSystem using Wavelets and Artificial Neural Networks, Proc. of Int. Symposium on Intelligent Multimedia, Video and Speech processing, pp. 413-416, 2001. [13]. S. Kadambe, P. Srinivasan., Application of Adaptive Wavelets for Speech, Optical Engineering , Vol 33(7), pp. 2204-2211, 1994. [14]. S .G. Mallat., A Theory for Multiresolution Signal Decomposition: The Wavelet Representation, IEEE Transactions on Pattern Analysis And Machine Intelligence, Vol.11, 674-693, 1989. [15]. Freeman J. A, Skapura D. M., 2006. Neural Networks Algorithm, Application and Programming Techniques, Pearson Education. [16]. Economou K., Lymberopoulos D., 1999. A New Perspective in Learning Pattern Generation for Teaching Neural Networks, Volume 12, Issue 4-5, 767-775. [17]. V.N. Vapnik., Statistical Learning Theory, J. Wiley, N.Y., 1998. [18]. N. Cristianini, J. Shawe-Taylor., An introduction to Support Vector Machines, Cambridge University Press, Cambridge, U.K., 2000. [19]. Ulrich H.-G. Kreßel., Pairwise Classification and Support Vector Machines, Advances inKernel Methods Support Vector Machine Learning, Cambridge, MA, MIT press, pp. 255-268, 1999. [20]. C.W. Hsu, C.J. Lin, A Comparison of Methods for Multi-class Support Vector Machines. IEEE Transactions on Neural Networks, 13(2), pp. 415–425, 2002. [21]. Li Dan, Liu Lihua, Zhang Zhaoxin, Research of Text Categorization on WEKA, proc. Of Third International Conference on Intelligent System Design and Engineering Applications, pp. 1129-1131, 2013 [22]. J. Han and M. Kamber, Data Mining Concepts and Techniques, Morgan Kauffman Publishers, 2000. [23]. Laszlo Toth, Andras Kocsor, Janos Csirik, On Naive Bayes In Speech Recognition, Int. J. Appl. Math. Comput. Sci., Vol. 15, No. 2, pp. 287–294, 2005. BIOGRAPHIES: Sonia Sunny is working as Associate professor in the department of Computer Science, Prajyoti Niketan College, Pudukad, Thrissur, Kerala State, India. Currently she is doing research in the area of Speech Recognition at Cochin University of Science and Technology, Cochin, Kerala State, India. In 1995, she received her M.Sc in Computer Science from Bharathidasan University, Thiruchirappalli and M.Phil in Computer Science from Bharathiar University, Coimbatore in 2008. Her research interest includes Speech Processing, Artificial Intelligence and Pattern Recognition. David Peter S is presently working as professor in Computer Science at School of Engineering, Cochin University of Science and Technology, Cochin, Kerala, India. He did his Post- graduation in Computer Science at IIT Madras and PhD at Cochin University. He has presented research papers in several International Conferences and has published many articles in International Journals. His areas of interest includes Artificial Intelligence, Natural Language Processing, Information Systems, Engineering, etc. K. Poulose Jacob, Professor of Computer Science at Cochin University of Science and Technology (CUSAT) since 1994, is currently Director of the School of Computer Science Studies. He holds additional charge as the honorary Director of CUSAT Planning & Development. He has presented research papers in several
  • 8. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 02 Issue: 04 | Apr-2013, Available @ http://www.ijret.org 597 International Conferences in Europe, USA, UK, Australia and other countries. He has delivered invited talks at several national and international events. Dr. Jacob is a Professional member of the ACM (Association for Computing Machinery) and a Life Member of the Computer Society of India. Till now twelve candidates have obtained PhD degrees in Computer Science & Engineering under his supervision. He has been PhD Theses examiner for several Universities. He has more than 75 research publications to his credit. His research interests are in Information Systems Engineering, Intelligent Architectures and Networks.