SlideShare a Scribd company logo
1 of 6
Download to read offline
M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30
www.ijera.com 25 | P a g e
Identification of Sex of the Speaker With Reference To Bodo
Vowels: A Comparative Experimental Study
M.K.Deka1
, B. Sarma2
, P.H. Talukdar3
1
Assistant Professor, Dept. of Computer Science & Technology , Bodoland University
2
Research Fellow, Dept. of Instrumentation, Gauhati University
3
Professor, Dept. of Instrumentation, Gauhati University
ABSTRACT
This work presents an application of Fundamental Frequency (Pitch), Linear Predictive Cepstral Coefficient
(LPCC) and Mel Frequency Cepstral Coefficient (MFCC) in identification of sex of the speaker in speech
recognition research. The aim of this article is to compare the performance of these three methods for
identification of sex of the speakers. A successful speech recognition system can help in non critical operations
such as presenting the driving route to the driver, dialing a phone number, light switch turn on/off, the coffee
machine on/off etc. apart from speaker verification-caste wise, community wise and locality wise including
identification of sex. Here an attempt has been made to identify the sex of Bodo speakers through vowel
utterance by following Pitch value, LPCC and MFCC techniques. It is found here that the feature vector
organization of LPCC coefficients provides a more promising way of speech-speaker recognition in case of
Bodo Language than that of Pitch and MFCC.
Keywords: Fundamental Frequency or Pitch, Linear Predictive Cepstral Coefficient (LPCC), Mel Frequency
Cepstral Coefficient (MFCC), sex of speakers, Bodo Language
I. INTRODUCTION
The language used in the present study is
Bodo language. Linguistically, the Bodo belongs to
Sino-Tibeton group of languages [1]. The Bodo
language has 22 phonemes which include 6 vowels,
14 consonants and 2 semi-vowels.
The Pitch, LPCC (Linear Predictive
Cepstral Co-efficient), and MFCC as speech
recognizer have been used by several workers [2]
in the last two decades. This paper focuses on the
identification of the speaker with reference to the
sex of the speaker. How we can identify whether
the speaker is male or female is studied in this
paper through three techniques and gives a
comparison among these techniques to find out
which one technique is more reliable. In the present
study, to reduce the volume of input data,
clustering techniques [3].
II. FEATURE EXTRACTION
In the present study, LPC-based Cepstral
Coefficients, Mel Frequency-based Cepstral
Coefficients and phonetically important parameters
are used as feature vectors.
Speech signal is first subjected to low-pass
filter to prevent the aliasing effect. The vowel
speech waveform is then sampled at 8 KHz and
quantized by a 16 bit resolution. The use of end
point detection algorithm removes the unnecessary
silence period at the beginning and at the end of the
speech signal. To bring the signal in the spectral
domain the signal is subjected to pre-emphasis
procedure through a first order digital filter whose
transfer function has been given by (1-0.95z-
1
).Consecutive speech signal is blocked after every
31.25 milliseconds. 250 samples have been allotted
to each of the 32 frames. Frames are overlapped by
20 milliseconds to produce a frame rate of 10
milliseconds. To reduce the undesired effect of
Gibbs phenomenon4
, the frames are multiplied by a
windows function (Hamming window), which is
given by,
(1)
Where N is the number of sample in a block
III. DATABASE OF BODO VOWELS
The database used in the present study
consists of isolated utterances of Bodo vowels
recorded from 20 Bodo speakers comprised of 10
males and 10 females speakers. The testing data
base has been created from 24 speakers from each
group.
IV. FUNDAMENTAL FREQUENCY
ESTIMATION
The opening and closing of the vocal folds
that occur during speaking break the air steam into
chains of pulses. The rate of repetition of these
pulses is the pitch and it defines the fundamental
frequency of the speech signal [4]. In other words,
the rate of vibrations of the vocal folds is the
RESEARCH ARTICLE OPEN ACCESS
M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30
www.ijera.com 26 | P a g e
fundamental frequency of the voice. The frequency
increases when the vocal folds are made taut.
Relative differences in the fundamental frequency
of the voice are utilized in all languages to study
the various aspects of linguistic information [5]
conveyed by it. The general problem of
fundamental frequency estimation is to take a
portion of signal and to find the dominant
frequency of repetition. Thus, the difficulties that
arises in the estimation of fundamental frequency
are (i) all signals are not periodic, (ii) those are
periodic may be changing in fundamental
frequency over the time of interest, (iii) signals
may be contaminated with noise, even with
periodic signals of other fundamental frequencies,
(iv) signals which are periodic with interval T are
also periodic with interval 2T, 3T etc., so we need
to find the smallest periodic interval or the highest
fundamental frequency, and (v) even signals of
constant fundamental frequency may be changing
in other ways over the interval of interest. In
general, the fundamental frequency of the speech
wave is estimated using autocorrelation. The
mathematical model used for estimating the
fundamental frequency is given below [6]:
A discrete short-time sequence is given by
sn[m]=s[m]w[n-m] (2)
Where w[n] is an analysis window of duration N
w
.
The short-time autocorrelation function r
n
[τ] is
defined by
sn[m]=s[m]w[n-m]
α
=∑ Sn[m]Sn[m+τ] (3)
m= -α
Where s[m] is periodic with period p, r
n
[τ] contains
peak at or near the pitch period p. For unvoiced
sound no clear peak occurs near an expected pitch
period. Location of the peak in the pitch period
range provides a measure of pitch estimation and
voicing decision. The above correlation pitch
estimator can be obtained, more formally, by
minimizing over possible pitch periods (p>0), and
the error criterion is given by
α
E[p]=∑ (Sn[m]-Sn[m+p])2
(4)
m= -α
Minimizing E[p] with respect to p yields
α
p=max ( ∑ (Sn[m]Sn[m+p] ) (5)
m= -α
Where p >є i.e., p is sufficiently far from zero.
This alternative view of autocorrelation
pitch estimation is used for detecting the pitch of
Bodo vowels. The speech waveform and
corresponding pitch spectra of six Bodo vowels
have been depicted in Fig.-1 and Fig.-2 for both
male and female informants. The estimated values
of the fundamental frequency or pitch have been
given in Table-1 for Bodo vowels.
Table-1: Pitch of six Bodo vowels
Sex Of The
Specimen
Fundamental Frequency (Hz) For Vowel
/a/ /e/ /i/ /o/ /u/ /w/
Male 129.03 380.95 148.15 153.85 125.00 145.46
Feamle 228.57 242.42 266.67 250.00 117.65 250.00
Fig.-1: Estimation of typical pitch of Bodo vowel corresponding to male informants (time domain)
M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30
www.ijera.com 27 | P a g e
Fig.-2: Estimation of typical pitch of Bodo vowel corresponding to female informants (time domain)
4.1 Results and Discussion
Typically, the pitch or fundamental
frequency ranges from 80Hz to 160Hz for male
speakers and from 140Hz to 400Hz for female
speakers [3].The estimation of pitch finds extensive
use in speech encoding, synthesis and recognition.
In adult, generally the length of vocal folds in male
is more than that of female counterpart. The more
is the vocal fold length, less is the pitch frequency.
Thus the pitch differs in male and female
informants. In our present study, as given in Table-
1, we observe that the values of pitch or
fundamental frequency for female informant are
higher than that of male informants as proposed by
Pinto-et-al [7]. Thus, the pitch or fundamental
frequency can be affectively used for sex
verification of Bodo speakers.
V. LINEAR PREDICTIVE CEPSTRAL
COEFFICIENTS (LPCC)
Speech compression/Speech coding is a
method for reducing the amount of information
needed to represent a speech signal. LPC is one of
the methods of compression that models the
process of speech production. A digital method for
encoding an analog signal in which a particular
value is predicted by a linear function of the past
values of the signal. At a particular time, t, the
speech sample s(t) is represented as a linear sum of
the p previous samples [7]. The general algorithm
for linear predictive coding involves an analysis or
encoding part and a synthesis or decoding part. In
the encoding, LPC takes the speech signal in blocks
or frames of speech and determines the input signal
and the coefficients of the filter that will be capable
of reproducing the current block of speech. The
LPC based cepstral coefficients are described as:





1
1
,
m
k
kmkmm
ac
m
k
ac pk 1 (6)





1
1
,
m
k
kmkm
ac
m
k
c pm  (7)
Where, ck is an LPC based cepstral coefficient. It
has been established that LPC based cepstral
coefficients produce better recognition results when
they are appropriately weighted. The weighting
function is given as:
,sin
2
1)( 








Q
mQ
mw

Qm 1 (8)
Where Q is the order of the cepstral coefficients.
In addition to the cepstral coefficients, their time
derivative approximations are used as feature
vectors to account for the dynamic characteristic of
speech signal. The time derivative is approximated
by a linear regression coefficient over a finite
window, which is defined as [1] –
,.)(ˆ)(ˆ Gmckmc
k
kk
kll 





 


Qml  (9)
Where )(ˆ mc l
the mth is weighted cepstral
coefficients at time and is a constant, used to
make the variances of the derivative terms, equal to
those with the original cepstral coefficients.
In the present investigation, the following typical
values are used –
n=240, p=10, Q=12, k=2 and G=0.316 [4], where p
is predictor order.
5.1 Results and Discussion
In the present study, it has been observed
that by careful selection of the feature set (Frame
no 11, Frame no 12, Frame no 15) could increase
the efficiency of the recognizer. It provides a basis
for Bodo vowels recognizer. Fig 3, Fig 4, Fig 5
have shown the prominent distinction between
male and female utterances for the Bodo vowel
/a/.From our analysis for other vowels also,
distinguishable feature vectors are found for male
and female utterances for the same set of frames.
The linear cepstral coefficients characteristics of
six Bodo vowels, have been studied and depicted in
Fig 3, Fig 4, Fig 5 for male & female informants
respectively. It is clearly observed, that, the Bodo
phonemes(vowels), show a clear distinction
between the spectral characteristics of Male &
Female while using LPCC. It is thus concluded in
the present study that LPCC measure may be a
better technique for speaker verification with
respect to sex.
M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30
www.ijera.com 28 | P a g e
0 2 4 6 8 10 12 14 16 18 20
-4
-2
0
2
4
6
8
10
Cepstral Coefficients
LogMagnitude(dB)
Frame no.:11
Male
Female
Fig. (3): Cepstral Coefficients of the Bodo Vowel /a/ for Frame 11
0 2 4 6 8 10 12 14 16 18 20
-1
-0.5
0
0.5
1
1.5
Cepstral Coefficients
LogMagnitude(dB)
Frame no.:12
Male
Female
Fig. (4) : Cepstral Coefficients of the Bodo Vowel /a/ for Frame 12
0 2 4 6 8 10 12 14 16 18 20
-3
-2
-1
0
1
2
3
4
5
6
Cepstral Coefficients
LogMagnitude(dB)
Frame no.:15
Male
Female
Fig. (5): Cepstral Coefficients of the Bodo Vowel /a/ for Frame 15
M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30
www.ijera.com 29 | P a g e
VI. MEL-CEPSTRUM
Instead of using linear prediction, another
method is often used in speech recognition, namely
the Mel-cepstrum. This method consists of two
parts: the cepstrum calculation and a method called
Mel scaling. The cepstrum method is a way of
finding the vocal tract filter H (z) with
“homomorphic processing”. Homomorphic signal
processing is generally concerned with the
transformation to linear domain of signals,
combined in a nonlinear way. In this case, the two
signals are not combined linearly, as convolution
can’t be described as a simple linear combination.
The speech signal s(n), can be seen as the result of
a convolution between u(n) and h(n):
s(n)=b0.u(n) * h(n) (10)
Where u(n) is a normalized excitation signal, b0 is
the gain of the excitation signal, and h(n) is a vocal
tract transfer characteristic.
In frequency domain, in equation (10) is described
as –
s(z)=b0.U(z)H(z) (11)
Since the term, representing excitation, U(z), and
the vocal tract function, H(z), are combined
multiplicative, it is difficult to separate them
explicitly. But, if the log operation is applied to, the
task will become additive:
Logs(z)=log(b0.U(z)H(z))=log(b0.U(z))+log(H(z))
(12)
The additive property of the log spectrum
also applies in an inverse transformation. The result
of this operation is called a cepstrum.
6.1 Measure of Mel Frequency Cepstral
Coefficients (MFCC) for BODO Phonemes
In the earlier method of feature extraction,
preferably, used the Linear Predictive Coding
(LPC) [3, 4, 7]. Mel Frequency Cepstral
Coefficient (MFCC) analysis has been widely used
in signal processing in general and speech
processing in particular [8]. This coefficient has a
great success in speech recognition
application.Fig.6 represents the algorithms
involved while calculating Mel Frequency Cepstral
Coefficients (MFCC).
Fig 6: Algorithm for calculating Mel Frequency Cepstral Coefficients
6.2 Results and Discussion
The typical Mel frequency cepstral
coefficients six Bodo vowels, corresponding to
male and female informants, have been depicted in
Fig 7, Fig 8 . As depicts in Fig.(7,8), it is clearly
observed, that, the Bodo phonemes unable to show
a clear distinction between the spectral
characteristics of Male & Female while using
MFCC i.e. MFCC technique does not provide
sufficient clues to distinguish the male and female
utterances.
Again, it is seen from the present study
that range of the fundamental frequency for male is
80Hz to 160Hz whereas that of for female is from
140Hz to 400Hz [3].It is thus difficult to determine
whether the sound is from male or female in the
range from 140Hz to 160Hz.In this particular range
the utterance may be from man or female, we
cannot say definitely whether the sound is from
man or female. In case of LPC, such type of
problem is absent. In the frame no. 11, 12, 15 there
are distinct differences between the male and
female utterances as shown in Figures 3, 4, and
5.By observing these particular -numbers of frames
we can easily and correctly differentiate the male or
female sound. So, it can be concluded that LPCC
method is more reliable than the Pitch based and
MFCC based method for the identification of the
sex of the Bodo informants.
M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30
www.ijera.com 30 | P a g e
0 2 4 6 8 10 12
-25
-20
-15
-10
-5
0
5
mel Cepstral Coefficients
LogMagnitude(dB)
Vowel/i/
Male
Female
Fig 7: Mel Frequency Cepstral Coefficients of the Bodo Vowel /i/
0 2 4 6 8 10 12
-40
-35
-30
-25
-20
-15
-10
-5
0
5
mel Cepstral Coefficients
LogMagnitude(dB)
Vowel/e/
Male
Female
Fig 8: Mel Frequency Cepstral Coefficients of the Bodo Vowel /e/
REFERENCES
[1]. Goswami,S.N.“Studies in Sino-Tibetan
Languages”,Published by M.Goswami,
1988.
[2]. Furui S,IEEE Trans. Acoustic Speech,
Signal Processing,34,1986,52.
[3]. Rabiner LR, Levinson SE, Rosenberg AE,
and Wilpon JG,IEEE Trans Acoustics,
Speech, Signal Proc ASSP-27,1979,336.
[4]. Jantzen,J.,“Neurofuzzy modeling”,
Technical Report 98-H-874,Technical
University of Denmark: Oersted-DTU,
1998.
[5]. Barman J., Kalita S., Talukdar P. H. –
Feature Extraction of Bodo
VowelsThrough LPC-Analysis,
Proceedings of Frontiers of Research on
Speech and Music(FRSM – 2004)
[6]. Rabiner Lawrence, Jwang Biing-Hwang,
Fundamentals of Speech Recognitions,
Prentice Hall, New Jersey, 1993, ISBN 0–
13 – 015157 – 2
[7]. Rabiner, L. R., Pan, K.C. and Soong, F.
K., “On the performance of isolated word
speech recognition using vector
quantization and temporal energy
contours”, AT & T Bell Lab. Tech. J. Vol.
63, pp. 1245– 1260, September, 1984.
[8]. Juang, B. H., Rabiner, L. R. and Wilpon, J.
G., “On the use of band pass filter in
speech recognition”, IEEE Trans. On
Acoustics, Speech and Signal processing,
pp. 947 – 954, 1987.

More Related Content

What's hot

ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...cscpconf
 
Phonetic distance based accent
Phonetic distance based accentPhonetic distance based accent
Phonetic distance based accentsipij
 
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURESSPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURESacijjournal
 
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...acijjournal
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Reviewijiert bestjournal
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesissipij
 
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...TELKOMNIKA JOURNAL
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based featuresijsc
 
Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...csandit
 
Paper id 28201448
Paper id 28201448Paper id 28201448
Paper id 28201448IJRAT
 
Speaker identification using mel frequency
Speaker identification using mel frequency Speaker identification using mel frequency
Speaker identification using mel frequency Phan Duy
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 

What's hot (19)

F010334548
F010334548F010334548
F010334548
 
speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
 
Phonetic distance based accent
Phonetic distance based accentPhonetic distance based accent
Phonetic distance based accent
 
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURESSPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
SPEAKER VERIFICATION USING ACOUSTIC AND PROSODIC FEATURES
 
G010424248
G010424248G010424248
G010424248
 
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesis
 
19 ijcse-01227
19 ijcse-0122719 ijcse-01227
19 ijcse-01227
 
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
 
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based features
 
Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...
 
Paper id 28201448
Paper id 28201448Paper id 28201448
Paper id 28201448
 
Speaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract FeaturesSpeaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract Features
 
Speaker identification using mel frequency
Speaker identification using mel frequency Speaker identification using mel frequency
Speaker identification using mel frequency
 
D04812125
D04812125D04812125
D04812125
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 

Viewers also liked

Fibrous Scaffold Produced By Rotary Jet Spinning Technique
Fibrous Scaffold Produced By Rotary Jet Spinning TechniqueFibrous Scaffold Produced By Rotary Jet Spinning Technique
Fibrous Scaffold Produced By Rotary Jet Spinning TechniqueIJERA Editor
 
On-state Torque Optimization for Synthesized MR Fluid
On-state Torque Optimization for Synthesized MR FluidOn-state Torque Optimization for Synthesized MR Fluid
On-state Torque Optimization for Synthesized MR FluidIJERA Editor
 
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...IJERA Editor
 
Conceptual Model of Information Technology and Its Impact on Financial Univer...
Conceptual Model of Information Technology and Its Impact on Financial Univer...Conceptual Model of Information Technology and Its Impact on Financial Univer...
Conceptual Model of Information Technology and Its Impact on Financial Univer...IJERA Editor
 
Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...
Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...
Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...IJERA Editor
 
Hybrid system with multi-connected boost converter
Hybrid system with multi-connected boost converterHybrid system with multi-connected boost converter
Hybrid system with multi-connected boost converterIJERA Editor
 
Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...
Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...
Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...IJERA Editor
 
Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...
Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...
Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...IJERA Editor
 
Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...
Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...
Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...IJERA Editor
 
Security Issues and Challenges in Internet of Things – A Review
Security Issues and Challenges in Internet of Things – A ReviewSecurity Issues and Challenges in Internet of Things – A Review
Security Issues and Challenges in Internet of Things – A ReviewIJERA Editor
 
Liquid Level Estimation in Dynamic Condition using Kalman Filter
Liquid Level Estimation in Dynamic Condition using Kalman FilterLiquid Level Estimation in Dynamic Condition using Kalman Filter
Liquid Level Estimation in Dynamic Condition using Kalman FilterIJERA Editor
 
A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...
A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...
A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...IJERA Editor
 
Gas-Particulate Models of Flow through Porous Structures
Gas-Particulate Models of Flow through Porous StructuresGas-Particulate Models of Flow through Porous Structures
Gas-Particulate Models of Flow through Porous StructuresIJERA Editor
 
On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...
On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...
On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...IJERA Editor
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsIJERA Editor
 
Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...
Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...
Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...IJERA Editor
 
An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...
An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...
An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...IJERA Editor
 
Effect of MR Fluid Damping during Milling of CFRP Laminates
Effect of MR Fluid Damping during Milling of CFRP LaminatesEffect of MR Fluid Damping during Milling of CFRP Laminates
Effect of MR Fluid Damping during Milling of CFRP LaminatesIJERA Editor
 

Viewers also liked (18)

Fibrous Scaffold Produced By Rotary Jet Spinning Technique
Fibrous Scaffold Produced By Rotary Jet Spinning TechniqueFibrous Scaffold Produced By Rotary Jet Spinning Technique
Fibrous Scaffold Produced By Rotary Jet Spinning Technique
 
On-state Torque Optimization for Synthesized MR Fluid
On-state Torque Optimization for Synthesized MR FluidOn-state Torque Optimization for Synthesized MR Fluid
On-state Torque Optimization for Synthesized MR Fluid
 
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
 
Conceptual Model of Information Technology and Its Impact on Financial Univer...
Conceptual Model of Information Technology and Its Impact on Financial Univer...Conceptual Model of Information Technology and Its Impact on Financial Univer...
Conceptual Model of Information Technology and Its Impact on Financial Univer...
 
Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...
Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...
Increasing Security Level in Data Sharing Using Ring Signature in Cloud Envir...
 
Hybrid system with multi-connected boost converter
Hybrid system with multi-connected boost converterHybrid system with multi-connected boost converter
Hybrid system with multi-connected boost converter
 
Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...
Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...
Petrological and Geochemical Studies on Granitoids in BibinagarBhongir Area, ...
 
Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...
Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...
Design and Implementation of Digital Chebyshev Type II Filter using XSG for N...
 
Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...
Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...
Energy Storage: CFD Modeling of Phase Change Materials For Thermal Energy Sto...
 
Security Issues and Challenges in Internet of Things – A Review
Security Issues and Challenges in Internet of Things – A ReviewSecurity Issues and Challenges in Internet of Things – A Review
Security Issues and Challenges in Internet of Things – A Review
 
Liquid Level Estimation in Dynamic Condition using Kalman Filter
Liquid Level Estimation in Dynamic Condition using Kalman FilterLiquid Level Estimation in Dynamic Condition using Kalman Filter
Liquid Level Estimation in Dynamic Condition using Kalman Filter
 
A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...
A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...
A Study on Guided and Unguided Transmission Medias and a Proposed Idea to Ext...
 
Gas-Particulate Models of Flow through Porous Structures
Gas-Particulate Models of Flow through Porous StructuresGas-Particulate Models of Flow through Porous Structures
Gas-Particulate Models of Flow through Porous Structures
 
On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...
On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...
On The Substitution of Energy Sources: The Effect of Flex Fuel Vehicles in th...
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing Platforms
 
Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...
Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...
Mechanical and Chemical Properties of Bamboo/Glass Fibers Reinforced Polyeste...
 
An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...
An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...
An Overview of Landslide Forecasting Using Wireless Sensor Network and Geogra...
 
Effect of MR Fluid Damping during Milling of CFRP Laminates
Effect of MR Fluid Damping during Milling of CFRP LaminatesEffect of MR Fluid Damping during Milling of CFRP Laminates
Effect of MR Fluid Damping during Milling of CFRP Laminates
 

Similar to Identification of Sex of the Speaker With Reference To Bodo Vowels: A Comparative Experimental Study

Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...ijcisjournal
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundTELKOMNIKA JOURNAL
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALGENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALIJCSEIT Journal
 
Subjective comparison of_speech_enhancement_algori (1)
Subjective comparison of_speech_enhancement_algori (1)Subjective comparison of_speech_enhancement_algori (1)
Subjective comparison of_speech_enhancement_algori (1)Priyanka Reddy
 
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...IJERA Editor
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...April Smith
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...CSCJournals
 
129966864160453838[1]
129966864160453838[1]129966864160453838[1]
129966864160453838[1]威華 王
 
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...Cemal Ardil
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...CSCJournals
 
Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...
Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...
Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...IJERA Editor
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 

Similar to Identification of Sex of the Speaker With Reference To Bodo Vowels: A Comparative Experimental Study (20)

Bz33462466
Bz33462466Bz33462466
Bz33462466
 
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
Slope at Zero Crossings (ZC) of Speech Signal for Multi-Speaker Activity Dete...
 
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables SoundWavelet Based Feature Extraction for the Indonesian CV Syllables Sound
Wavelet Based Feature Extraction for the Indonesian CV Syllables Sound
 
Lr3620092012
Lr3620092012Lr3620092012
Lr3620092012
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALGENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
 
Subjective comparison of_speech_enhancement_algori (1)
Subjective comparison of_speech_enhancement_algori (1)Subjective comparison of_speech_enhancement_algori (1)
Subjective comparison of_speech_enhancement_algori (1)
 
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
Compressive Sensing in Speech from LPC using Gradient Projection for Sparse R...
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
 
129966864160453838[1]
129966864160453838[1]129966864160453838[1]
129966864160453838[1]
 
L046056365
L046056365L046056365
L046056365
 
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
 
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
A Comparative Study: Gammachirp Wavelets and Auditory Filter Using Prosodic F...
 
seminar slides
 seminar slides  seminar slides
seminar slides
 
V041203124126
V041203124126V041203124126
V041203124126
 
Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...
Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...
Implementation of Interleaving Methods on MELP 2.4 Coder to Reduce Packet Los...
 
H0814247
H0814247H0814247
H0814247
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 

Recently uploaded

Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 

Recently uploaded (20)

Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 

Identification of Sex of the Speaker With Reference To Bodo Vowels: A Comparative Experimental Study

  • 1. M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30 www.ijera.com 25 | P a g e Identification of Sex of the Speaker With Reference To Bodo Vowels: A Comparative Experimental Study M.K.Deka1 , B. Sarma2 , P.H. Talukdar3 1 Assistant Professor, Dept. of Computer Science & Technology , Bodoland University 2 Research Fellow, Dept. of Instrumentation, Gauhati University 3 Professor, Dept. of Instrumentation, Gauhati University ABSTRACT This work presents an application of Fundamental Frequency (Pitch), Linear Predictive Cepstral Coefficient (LPCC) and Mel Frequency Cepstral Coefficient (MFCC) in identification of sex of the speaker in speech recognition research. The aim of this article is to compare the performance of these three methods for identification of sex of the speakers. A successful speech recognition system can help in non critical operations such as presenting the driving route to the driver, dialing a phone number, light switch turn on/off, the coffee machine on/off etc. apart from speaker verification-caste wise, community wise and locality wise including identification of sex. Here an attempt has been made to identify the sex of Bodo speakers through vowel utterance by following Pitch value, LPCC and MFCC techniques. It is found here that the feature vector organization of LPCC coefficients provides a more promising way of speech-speaker recognition in case of Bodo Language than that of Pitch and MFCC. Keywords: Fundamental Frequency or Pitch, Linear Predictive Cepstral Coefficient (LPCC), Mel Frequency Cepstral Coefficient (MFCC), sex of speakers, Bodo Language I. INTRODUCTION The language used in the present study is Bodo language. Linguistically, the Bodo belongs to Sino-Tibeton group of languages [1]. The Bodo language has 22 phonemes which include 6 vowels, 14 consonants and 2 semi-vowels. The Pitch, LPCC (Linear Predictive Cepstral Co-efficient), and MFCC as speech recognizer have been used by several workers [2] in the last two decades. This paper focuses on the identification of the speaker with reference to the sex of the speaker. How we can identify whether the speaker is male or female is studied in this paper through three techniques and gives a comparison among these techniques to find out which one technique is more reliable. In the present study, to reduce the volume of input data, clustering techniques [3]. II. FEATURE EXTRACTION In the present study, LPC-based Cepstral Coefficients, Mel Frequency-based Cepstral Coefficients and phonetically important parameters are used as feature vectors. Speech signal is first subjected to low-pass filter to prevent the aliasing effect. The vowel speech waveform is then sampled at 8 KHz and quantized by a 16 bit resolution. The use of end point detection algorithm removes the unnecessary silence period at the beginning and at the end of the speech signal. To bring the signal in the spectral domain the signal is subjected to pre-emphasis procedure through a first order digital filter whose transfer function has been given by (1-0.95z- 1 ).Consecutive speech signal is blocked after every 31.25 milliseconds. 250 samples have been allotted to each of the 32 frames. Frames are overlapped by 20 milliseconds to produce a frame rate of 10 milliseconds. To reduce the undesired effect of Gibbs phenomenon4 , the frames are multiplied by a windows function (Hamming window), which is given by, (1) Where N is the number of sample in a block III. DATABASE OF BODO VOWELS The database used in the present study consists of isolated utterances of Bodo vowels recorded from 20 Bodo speakers comprised of 10 males and 10 females speakers. The testing data base has been created from 24 speakers from each group. IV. FUNDAMENTAL FREQUENCY ESTIMATION The opening and closing of the vocal folds that occur during speaking break the air steam into chains of pulses. The rate of repetition of these pulses is the pitch and it defines the fundamental frequency of the speech signal [4]. In other words, the rate of vibrations of the vocal folds is the RESEARCH ARTICLE OPEN ACCESS
  • 2. M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30 www.ijera.com 26 | P a g e fundamental frequency of the voice. The frequency increases when the vocal folds are made taut. Relative differences in the fundamental frequency of the voice are utilized in all languages to study the various aspects of linguistic information [5] conveyed by it. The general problem of fundamental frequency estimation is to take a portion of signal and to find the dominant frequency of repetition. Thus, the difficulties that arises in the estimation of fundamental frequency are (i) all signals are not periodic, (ii) those are periodic may be changing in fundamental frequency over the time of interest, (iii) signals may be contaminated with noise, even with periodic signals of other fundamental frequencies, (iv) signals which are periodic with interval T are also periodic with interval 2T, 3T etc., so we need to find the smallest periodic interval or the highest fundamental frequency, and (v) even signals of constant fundamental frequency may be changing in other ways over the interval of interest. In general, the fundamental frequency of the speech wave is estimated using autocorrelation. The mathematical model used for estimating the fundamental frequency is given below [6]: A discrete short-time sequence is given by sn[m]=s[m]w[n-m] (2) Where w[n] is an analysis window of duration N w . The short-time autocorrelation function r n [τ] is defined by sn[m]=s[m]w[n-m] α =∑ Sn[m]Sn[m+τ] (3) m= -α Where s[m] is periodic with period p, r n [τ] contains peak at or near the pitch period p. For unvoiced sound no clear peak occurs near an expected pitch period. Location of the peak in the pitch period range provides a measure of pitch estimation and voicing decision. The above correlation pitch estimator can be obtained, more formally, by minimizing over possible pitch periods (p>0), and the error criterion is given by α E[p]=∑ (Sn[m]-Sn[m+p])2 (4) m= -α Minimizing E[p] with respect to p yields α p=max ( ∑ (Sn[m]Sn[m+p] ) (5) m= -α Where p >є i.e., p is sufficiently far from zero. This alternative view of autocorrelation pitch estimation is used for detecting the pitch of Bodo vowels. The speech waveform and corresponding pitch spectra of six Bodo vowels have been depicted in Fig.-1 and Fig.-2 for both male and female informants. The estimated values of the fundamental frequency or pitch have been given in Table-1 for Bodo vowels. Table-1: Pitch of six Bodo vowels Sex Of The Specimen Fundamental Frequency (Hz) For Vowel /a/ /e/ /i/ /o/ /u/ /w/ Male 129.03 380.95 148.15 153.85 125.00 145.46 Feamle 228.57 242.42 266.67 250.00 117.65 250.00 Fig.-1: Estimation of typical pitch of Bodo vowel corresponding to male informants (time domain)
  • 3. M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30 www.ijera.com 27 | P a g e Fig.-2: Estimation of typical pitch of Bodo vowel corresponding to female informants (time domain) 4.1 Results and Discussion Typically, the pitch or fundamental frequency ranges from 80Hz to 160Hz for male speakers and from 140Hz to 400Hz for female speakers [3].The estimation of pitch finds extensive use in speech encoding, synthesis and recognition. In adult, generally the length of vocal folds in male is more than that of female counterpart. The more is the vocal fold length, less is the pitch frequency. Thus the pitch differs in male and female informants. In our present study, as given in Table- 1, we observe that the values of pitch or fundamental frequency for female informant are higher than that of male informants as proposed by Pinto-et-al [7]. Thus, the pitch or fundamental frequency can be affectively used for sex verification of Bodo speakers. V. LINEAR PREDICTIVE CEPSTRAL COEFFICIENTS (LPCC) Speech compression/Speech coding is a method for reducing the amount of information needed to represent a speech signal. LPC is one of the methods of compression that models the process of speech production. A digital method for encoding an analog signal in which a particular value is predicted by a linear function of the past values of the signal. At a particular time, t, the speech sample s(t) is represented as a linear sum of the p previous samples [7]. The general algorithm for linear predictive coding involves an analysis or encoding part and a synthesis or decoding part. In the encoding, LPC takes the speech signal in blocks or frames of speech and determines the input signal and the coefficients of the filter that will be capable of reproducing the current block of speech. The LPC based cepstral coefficients are described as:      1 1 , m k kmkmm ac m k ac pk 1 (6)      1 1 , m k kmkm ac m k c pm  (7) Where, ck is an LPC based cepstral coefficient. It has been established that LPC based cepstral coefficients produce better recognition results when they are appropriately weighted. The weighting function is given as: ,sin 2 1)(          Q mQ mw  Qm 1 (8) Where Q is the order of the cepstral coefficients. In addition to the cepstral coefficients, their time derivative approximations are used as feature vectors to account for the dynamic characteristic of speech signal. The time derivative is approximated by a linear regression coefficient over a finite window, which is defined as [1] – ,.)(ˆ)(ˆ Gmckmc k kk kll           Qml  (9) Where )(ˆ mc l the mth is weighted cepstral coefficients at time and is a constant, used to make the variances of the derivative terms, equal to those with the original cepstral coefficients. In the present investigation, the following typical values are used – n=240, p=10, Q=12, k=2 and G=0.316 [4], where p is predictor order. 5.1 Results and Discussion In the present study, it has been observed that by careful selection of the feature set (Frame no 11, Frame no 12, Frame no 15) could increase the efficiency of the recognizer. It provides a basis for Bodo vowels recognizer. Fig 3, Fig 4, Fig 5 have shown the prominent distinction between male and female utterances for the Bodo vowel /a/.From our analysis for other vowels also, distinguishable feature vectors are found for male and female utterances for the same set of frames. The linear cepstral coefficients characteristics of six Bodo vowels, have been studied and depicted in Fig 3, Fig 4, Fig 5 for male & female informants respectively. It is clearly observed, that, the Bodo phonemes(vowels), show a clear distinction between the spectral characteristics of Male & Female while using LPCC. It is thus concluded in the present study that LPCC measure may be a better technique for speaker verification with respect to sex.
  • 4. M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30 www.ijera.com 28 | P a g e 0 2 4 6 8 10 12 14 16 18 20 -4 -2 0 2 4 6 8 10 Cepstral Coefficients LogMagnitude(dB) Frame no.:11 Male Female Fig. (3): Cepstral Coefficients of the Bodo Vowel /a/ for Frame 11 0 2 4 6 8 10 12 14 16 18 20 -1 -0.5 0 0.5 1 1.5 Cepstral Coefficients LogMagnitude(dB) Frame no.:12 Male Female Fig. (4) : Cepstral Coefficients of the Bodo Vowel /a/ for Frame 12 0 2 4 6 8 10 12 14 16 18 20 -3 -2 -1 0 1 2 3 4 5 6 Cepstral Coefficients LogMagnitude(dB) Frame no.:15 Male Female Fig. (5): Cepstral Coefficients of the Bodo Vowel /a/ for Frame 15
  • 5. M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30 www.ijera.com 29 | P a g e VI. MEL-CEPSTRUM Instead of using linear prediction, another method is often used in speech recognition, namely the Mel-cepstrum. This method consists of two parts: the cepstrum calculation and a method called Mel scaling. The cepstrum method is a way of finding the vocal tract filter H (z) with “homomorphic processing”. Homomorphic signal processing is generally concerned with the transformation to linear domain of signals, combined in a nonlinear way. In this case, the two signals are not combined linearly, as convolution can’t be described as a simple linear combination. The speech signal s(n), can be seen as the result of a convolution between u(n) and h(n): s(n)=b0.u(n) * h(n) (10) Where u(n) is a normalized excitation signal, b0 is the gain of the excitation signal, and h(n) is a vocal tract transfer characteristic. In frequency domain, in equation (10) is described as – s(z)=b0.U(z)H(z) (11) Since the term, representing excitation, U(z), and the vocal tract function, H(z), are combined multiplicative, it is difficult to separate them explicitly. But, if the log operation is applied to, the task will become additive: Logs(z)=log(b0.U(z)H(z))=log(b0.U(z))+log(H(z)) (12) The additive property of the log spectrum also applies in an inverse transformation. The result of this operation is called a cepstrum. 6.1 Measure of Mel Frequency Cepstral Coefficients (MFCC) for BODO Phonemes In the earlier method of feature extraction, preferably, used the Linear Predictive Coding (LPC) [3, 4, 7]. Mel Frequency Cepstral Coefficient (MFCC) analysis has been widely used in signal processing in general and speech processing in particular [8]. This coefficient has a great success in speech recognition application.Fig.6 represents the algorithms involved while calculating Mel Frequency Cepstral Coefficients (MFCC). Fig 6: Algorithm for calculating Mel Frequency Cepstral Coefficients 6.2 Results and Discussion The typical Mel frequency cepstral coefficients six Bodo vowels, corresponding to male and female informants, have been depicted in Fig 7, Fig 8 . As depicts in Fig.(7,8), it is clearly observed, that, the Bodo phonemes unable to show a clear distinction between the spectral characteristics of Male & Female while using MFCC i.e. MFCC technique does not provide sufficient clues to distinguish the male and female utterances. Again, it is seen from the present study that range of the fundamental frequency for male is 80Hz to 160Hz whereas that of for female is from 140Hz to 400Hz [3].It is thus difficult to determine whether the sound is from male or female in the range from 140Hz to 160Hz.In this particular range the utterance may be from man or female, we cannot say definitely whether the sound is from man or female. In case of LPC, such type of problem is absent. In the frame no. 11, 12, 15 there are distinct differences between the male and female utterances as shown in Figures 3, 4, and 5.By observing these particular -numbers of frames we can easily and correctly differentiate the male or female sound. So, it can be concluded that LPCC method is more reliable than the Pitch based and MFCC based method for the identification of the sex of the Bodo informants.
  • 6. M.K.Deka.et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 6, ( Part -5) June 2016, pp.25-30 www.ijera.com 30 | P a g e 0 2 4 6 8 10 12 -25 -20 -15 -10 -5 0 5 mel Cepstral Coefficients LogMagnitude(dB) Vowel/i/ Male Female Fig 7: Mel Frequency Cepstral Coefficients of the Bodo Vowel /i/ 0 2 4 6 8 10 12 -40 -35 -30 -25 -20 -15 -10 -5 0 5 mel Cepstral Coefficients LogMagnitude(dB) Vowel/e/ Male Female Fig 8: Mel Frequency Cepstral Coefficients of the Bodo Vowel /e/ REFERENCES [1]. Goswami,S.N.“Studies in Sino-Tibetan Languages”,Published by M.Goswami, 1988. [2]. Furui S,IEEE Trans. Acoustic Speech, Signal Processing,34,1986,52. [3]. Rabiner LR, Levinson SE, Rosenberg AE, and Wilpon JG,IEEE Trans Acoustics, Speech, Signal Proc ASSP-27,1979,336. [4]. Jantzen,J.,“Neurofuzzy modeling”, Technical Report 98-H-874,Technical University of Denmark: Oersted-DTU, 1998. [5]. Barman J., Kalita S., Talukdar P. H. – Feature Extraction of Bodo VowelsThrough LPC-Analysis, Proceedings of Frontiers of Research on Speech and Music(FRSM – 2004) [6]. Rabiner Lawrence, Jwang Biing-Hwang, Fundamentals of Speech Recognitions, Prentice Hall, New Jersey, 1993, ISBN 0– 13 – 015157 – 2 [7]. Rabiner, L. R., Pan, K.C. and Soong, F. K., “On the performance of isolated word speech recognition using vector quantization and temporal energy contours”, AT & T Bell Lab. Tech. J. Vol. 63, pp. 1245– 1260, September, 1984. [8]. Juang, B. H., Rabiner, L. R. and Wilpon, J. G., “On the use of band pass filter in speech recognition”, IEEE Trans. On Acoustics, Speech and Signal processing, pp. 947 – 954, 1987.