SlideShare a Scribd company logo
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 411
SPEAKER IDENTIFICATION SYSTEM USING CLOSE SET
Shweta Bansal1, Ankur Hooda2, Anima3
1, 2, 3
ECE, KIIT College of Engineering, Gurgaon, bansalshwe@gmail.com, ankur.hooda45@gmail.com,
animadahiya806@gmail.com
Abstract
The present paper describes experiments conducted to evaluate the performance of speaker recognition. The
experiments conducted using Neural Network shows that the complexity of speaker recognition increases when the numbers of
speakers to be identified are large in numbers in the text independent situation. In the first experiment error rate was zero for
10 speaker’s classification and performance was good. By increasing the number of speakers the error rate increased,
classification and performance were poor. After 25 speakers, the error rate was very high. For 100 speaker’s classification
MATLAB NN tool did not support for display the confusion matrix. To overcome this problem the second experiment has been
done. In this experiment a close set of 10 groups of 100 speakers (each group of 10 speakers) in terms of cell array in MATLAB has
been defined and we observed that the best result of speaker identification was 100% in 20 continuous features of speaker’s
voice, but it increased time complexity. In the third experiment speaker’s dialect and regions were also identified and
classification performance was 100% at 97 epochs, validation performance was 0.0035046 at 91 epochs and the error rate
was zero has found.
Index Terms: text dependent, text independent, speaker identification, Neural Network, close set, MFCC, Speaker
identification, close set.
-----------------------------------------------------------------------***-----------------------------------------------------------------------
1. INTRODUCTION
Speaker recognition can be performed in two ways. In first way
Text-dependent and Text-independent conditions. In addition
there are two different types of speaker identification, the
closed-set and the open-set identification. In closed-set
identification, the sample of the test speaker is compared
against all the available speaker samples and the speaker ID of
the sample with the closest match is identified. In open-set the
test speaker’s sample is compared with the large number of
reference samples which may or may not contain the test
sample. It verifies that a given speaker is the one whom he
claims to be. If it matches the set threshold then the identity
claim of the user is accepted otherwise rejected. The
identification strategy is based on matching the set threshold
value for accepting or rejecting the speaker. [1][2][3].
Speaker recognition using Neural Network techniques are very
complex for large number of speakers. In the present study
three experiments have been done to find the
performance of speaker recognition versus complexity of
computation. In the first experiment the study of classification of
100 speakers uttered 10,000 sentences have been done by Neural
Network (NN) using back propagation adaptive learning
(Scaled conjugate gradient) method with two hidden layers. For
classification of speakers 70% voice data for training, 15%
for validation and 15% for testing the voice data have been used
for getting the best results. The performances were better with
less number of speakers but the complexity was increased by
increasing number of speakers. To overcome this problem the
second experiment was performed by grouping speakers in
close sets of 10 groups for 100 speakers (10 speakers in each
group) MATLAB has been used to conduct this
experiment. This produce increased the performance of speaker
identification system, but complexity was high. And in the third
experiment dialect and regions of three non-native (South
Indian, Punjabi and Haryanvi) speakers who uttered 150
text dependent Hindi sentences in NN [4] have been classified
with 100% performance.
2. VOICE DATABASE COLLECTION
In the first and second experiments a corpus of 10,000 Hindi
sentences uttered by 100 speakers (each speaker uttered 100
text independent sentences) was created. These sentences were
taken from mobile communication database for the analysis of
speaker’s voice. In the third experiment, a corpus of 450 text
dependent Hindi sentences uttered by three non native
speakers i.e. 150 sentences uttered by South Indian, 150 by
Panjabi and 150 by Haryanvi native speakers. All speakers
were within 18years-60years age groups. Voice data
were recorded using a head held moving coil microphone in a
quite room. “PRAAT” software with sampling rate 16 kHz/16
bit has been used for the analysis of samples.
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 412
3. VOICE FEATURE EXTRACTION
Experiments are based on the average information about the
parameters of one second speech utterance by an speaker.193
Mel Frequency Cepstral Coefficients (MFCC) were extracted as
voice features with sampling rate was 100.The total number of
filters used were 40(13linear filters+ 27 log filters) and the linear
spacing between the filters was 66. 5 prosody related features
consisted of average value of pitch(f0),intensity, duration,
RMS value of sound pressure and power, these were extracted
using PRAAT tool. MFCC values were calculated by
MATLAB [8].
4. EXPERIMENT 1
4.1 Classification of 100 speakers using NN
A multilayer feed forward neural networks with one input layer,
one output layer and two hidden layers have been used with 20
hidden neurons. The training method was Scaled Conjugate
Gradient descent with momentum, Neuron Transfer Function for
hidden layer and output layer were considered Log sigmoid
Transfer Function. In this experiment 70% voice data have been
used for training whereas 15% data each for validation and
testing.
To begin with, speaker recognition was conducted with first
10 test speakers out of 100 speakers. The classification was
80% and the performance was 0.082485 at 36 epochs with zero
error rate. By increasing the number of speakers i.e.by 15 the
classification was 55% and the performance was 0.063405 at 43
epochs with 11.76% error rate .After 25 speakers it was very
difficult to trace the confusion matrix for classification and
error rate was very high. For 100 speaker’s classification
MATLAB NN tool did not support for display the confusion
matrix. The variations in the results by increasing the number of
speakers are shown in table 1:
Table 1: variations in performance of 100 speaker’s
Classification
5. Experiment 2
5.1 Close Set Speaker Identification
To overcome this problem of recognising large number of
speakers using NN technique, our experiment has been done by
grouping speakers in closed sets . A close set of 10 groups of
100 speakers (a group of 10 speakers) in terms of cell array in
MATLAB has been defined. Each cell represents a single group
of close set. An associative memory for all 100 speakers’
voice features in MATLAB Cell array in terms of close set
[11] has been used.
We have followed the speaker identification algorithm shown
in figure 1. This algorithm uses the close set of associative
memory. All the voice features of 100 speakers have been
taken into account in the associative memory [11][12]. The
task is done by using the extracted feature of all speakers
matching with the test speaker’s voice features. If the match is
found then test speaker is identified. Matching performed in
the order of selection choosing 3, 5,10,15,20 continuous
features of test speakers’ voice.
Speaker’s utterances
MFCC + Prosody features extraction
199 features of Test speaker
Select: 3 continuous features
Or
5 continuous features
Or
10 continuous features
Or
15 continuous features
Or
Match selected features with stored patterns
No
Yes
No
If match
Speaker identify
Figure 1: Function diagram of speaker recognition
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 413
shown in figure 1. This algorithm uses the close set of
associative memory. All the voice features of 100 speakers have
been taken into account in the associative memory [11][12]. The
task is done by using the extracted feature ofall speakers
In the first iteration (i, i+1, i+2) match the features of test
speaker with all speakers features, if the match found then
select all those speakers. Otherwise go to the next iteration (i+1,
i+2, i+3) up to the nth iteration. The same rule follow for the 5,
10, 15, 20 continuous features. The cell structure of the test
speaker is shown in figure 2
2nd Iteration
i +1 i+2 i+3
…
i i+1 i+2
1st Iteration
Figure 2: Selection of 3 continuous features
We have done 5 experiments. In the first experiment 3
continuous feature of test speaker have been taken as input. The
identification gave the 52% performance and reduced time
complexity because no more feature had to be searched from the
associative memory for the selection of the speakers.
In second experiment (for 5 continuous features) the
performance was 67% with increased time complexity. In
further experiments we have increased the number of
continuous features (i.e. 10, 15, and 20) of test speakers and the
performance was 79%, 89% and 100% respectively. We noticed
that the best result of speaker identification is 100% in
20 continuous features of speaker’s inputs, but it increased the
value of time complexity. The observations from the all
experiments are shown in table 2 and performance graph in
figure 3
Table 2: Performance of speaker identification algorithm
Numb
er of
contin
uous
Featur
e
3
conti
nuou
s
Featu
res
5
conti
nuou
s
Featu
res
10
conti
nuou
s
Featu
res
15
conti
nuou
s
Featu
res
20
conti
nuou
s
Featu
res
Time
Compl
exity
O(n3
) O(n5
)
10
O(n
)
15
O(n
)
O(n20
)
Speak
er
Identif
ication
52% 67% 79% 89%
100
%
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
Time Com;lexity Identification Performance
3 5 10 15 20 continuous f eatures
Figure 3: Performance graph
6. Experiment 3
6.1 Region and Dialect identification
One hundred fifty text dependent Hindi sentences recorded by
three non native speakers i.e. 450 utterances were used for this
study. 199 extracted features have been taken as inputs with
one input layer, one output layer had three units which
represented output categories i.e. South Indian, Punjabi &
Haryanvi. Two hidden layers with 20 neurons have been used.
The architecture of Multilayer feed forward network for three
non native speaker’s classification is shown in figure 4
Figure 4: Multilayer feed forward network of three non
native’s speaker’s classification
From the total voice database 314 sentences have been used for
training, 68 sentences for validation and 68 sentences for
testing. We observed that the classification performance was
100% at 97 epochs, validation performance was 0.0035046 at
91 epochs and the error rate was zero. The best performance
was 100% because the numbers of speakers were very less i.e.
three and the database was text dependent .The MFCC +
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 414
prosody features were differ for each and every non native
speakers because of their dialects.The best validation
performance for Region and Dialect classification is shown in
figure 5
Figure 5: Best validation performance for Region and Dialect
classification
CONCLUSIONS
In the first experiment speaker’s classification can give better
results with less number of speakers using Neural Network but
the performance decreases by increasing the number of
speakers. It will take either too much time for execution or the
system may not response. For better results experiment 2 has
been done with close set of speakers. That has produced better
results but in more complex manner. In the third experiment
dialect and region identification performance was 100 %,
because only 3 non native speakers were there to be classified
which were very less and the voice database was text
dependent. Future improvement can be done as given below
I. Define the separate close set for male and female
speakers.
II. Define the cluster of nearest distance of voice features
from the neurons can placed inside the same cluster.
III. Used text dependent sentences for better results as we
have discussed in our experiment 3.
In this order to define cluster, the result for recognition of
speaker will be more accurate and also increase the
performance with less time complexity, because the similar
type of speakers’ voice placed inside the same group. That
overcomes the more searching of speakers. In our experiment
we have designed the associative memory for voice features
In earlier experiments the associative memory was used for
pattern segmentation and large vocabulary for continuous
speech recognition [11][12].
ACKNOWLEDGEMENTS
We would like to thanks Ms. Shweta Sinha, Shipra Arora and
Minakshi Mahajan our team members for their valuable
suggestions and support. Our thanks are due to Mr. A.K.
Dhamija, for help in recording as well as maintaining the
speech data. Also our thanks are extended to the supporting
staff Mr. Gautam Kumar, for his continuous effort in the
maintenance and development of text and speech corpora. Our
very special thanks are to KIIT management Dr. Harsh
Vardhan Kamrah and Ms. Neelima Kamrah for providing their
constant encouragement, support and environment that was
conducive for this work.
REFERENCES
[1] L.R.Rabiner and B.H.Juang”Fundamental of
speech recognition”Prentice Hall Englewood
cliffs,N.J.1993.
[2] L.R.Rabiner and R.w.schafer “Digital processing
of speech signal” Prentice Hall Englewood
cliffs,N.J.1978.
[3] Furui, “An overview of speaker recognition
technology”, ESCA Workshop on Automatic
Speaker Recognition, Identification and Verification,
pp. 1-9, 1994.
[4] B.Atal”automatic recognition of speakers from
their voices”proc.IEEE,vol.64,pp.460-475April1976
[5] S.S.Agrawal”Emotion in Hindi speech
analysis,perception and recognition”Oriental COCOSDA
2012.
[6] Samudravijaya K, Speech and Speaker Recognition:
A tutorial, Proc. Int. Workshop on Tech.
Development in
[7] Indian Languages, Kolkata, Jan 22-24, 2003.
[8] Samudravijaya K and Anshu Gupta, Text
prompted speaker verification, J.Acoust. Soc. India,
vol.30, pp. 214-217, 2002
[9] Vibha Tiwari”Mfcc & its application in
speaker recognition” International journal merging
technologies1(1):19-22(2010)
[10] Ran D. Jilka “Text independent speaker
verification using covariance Modeling” IEEE
Signal Processing Letters Vol 8, Page 97{99,2001}.
[11] .“Self organizing map and associative memory model
hybrid classifier for speaker recognition” by M Inal, Y
S Fatihoglu 6thSeminar on neural Network Applications
in Electrical Engineering Neurel2002714 (2002)
[12] Deliang Wong, Joachim Buhamann “Pattern
[13] Segmentation in Associative Memory” Massachusetts
Institute of Technology, Neural Computation 2,94-106
(1990)
[14] Z K Kayikci “Large vocabulary continuous speech
recognition using associative memory and hidden
Markov model” Proceeding SSIP'08 Proceedings of the
8th conference on Signal, Speech and image processing
2008

More Related Content

What's hot

Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
IJERA Editor
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
ijceronline
 
FYPReport
FYPReportFYPReport
FYPReport
David Ferris
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
IJERA Editor
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
Hira Shaukat
 
A Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemA Survey on Speaker Recognition System
A Survey on Speaker Recognition System
Vani011
 
Mood classification of songs based on lyrics
Mood classification of songs based on lyricsMood classification of songs based on lyrics
Mood classification of songs based on lyrics
Francesco Cucari
 
ASA 09 Poster-portlandOR-051209
ASA 09 Poster-portlandOR-051209ASA 09 Poster-portlandOR-051209
ASA 09 Poster-portlandOR-051209
Sangsook Choi
 
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
NU_I_TODALAB
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
Murtadha Alsabbagh
 
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
TELKOMNIKA JOURNAL
 
Presentation劉思竹v4.2 10122608
Presentation劉思竹v4.2 10122608Presentation劉思竹v4.2 10122608
Presentation劉思竹v4.2 10122608
思竹 劉
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
iosrjce
 
Devoxx traitement automatique du langage sur du texte en 2019
Devoxx   traitement automatique du langage sur du texte en 2019 Devoxx   traitement automatique du langage sur du texte en 2019
Devoxx traitement automatique du langage sur du texte en 2019
Alexis Agahi
 
Speaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanSpeaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajan
Abhishek Mahajan
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
IJMER
 
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATIONAN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
cscpconf
 
19 ijcse-01227
19 ijcse-0122719 ijcse-01227
19 ijcse-01227
Shivlal Mewada
 
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022
NU_I_TODALAB
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
ijistjournal
 

What's hot (20)

Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
FYPReport
FYPReportFYPReport
FYPReport
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
A Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemA Survey on Speaker Recognition System
A Survey on Speaker Recognition System
 
Mood classification of songs based on lyrics
Mood classification of songs based on lyricsMood classification of songs based on lyrics
Mood classification of songs based on lyrics
 
ASA 09 Poster-portlandOR-051209
ASA 09 Poster-portlandOR-051209ASA 09 Poster-portlandOR-051209
ASA 09 Poster-portlandOR-051209
 
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
Comparison of Feature Extraction MFCC and LPC in Automatic Speech Recognition...
 
Presentation劉思竹v4.2 10122608
Presentation劉思竹v4.2 10122608Presentation劉思竹v4.2 10122608
Presentation劉思竹v4.2 10122608
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
Devoxx traitement automatique du langage sur du texte en 2019
Devoxx   traitement automatique du langage sur du texte en 2019 Devoxx   traitement automatique du langage sur du texte en 2019
Devoxx traitement automatique du langage sur du texte en 2019
 
Speaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanSpeaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajan
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
 
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATIONAN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
 
19 ijcse-01227
19 ijcse-0122719 ijcse-01227
19 ijcse-01227
 
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
 

Viewers also liked

Phytotoxicity analysis of various plants using industrial sludge
Phytotoxicity analysis of various plants using industrial sludgePhytotoxicity analysis of various plants using industrial sludge
Phytotoxicity analysis of various plants using industrial sludge
eSAT Publishing House
 
Analysis of river flow data to develop stage discharge relationship
Analysis of river flow data to develop stage discharge relationshipAnalysis of river flow data to develop stage discharge relationship
Analysis of river flow data to develop stage discharge relationship
eSAT Publishing House
 
Location updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing inLocation updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing in
eSAT Publishing House
 
Scour investigation around single and two piers
Scour investigation around single and two piersScour investigation around single and two piers
Scour investigation around single and two piers
eSAT Publishing House
 
Secure file sharing of dynamic audit services in cloud storage
Secure file sharing of dynamic audit services in cloud storageSecure file sharing of dynamic audit services in cloud storage
Secure file sharing of dynamic audit services in cloud storage
eSAT Publishing House
 
Pilot aided scheduling for uplink ofdma
Pilot aided scheduling for uplink ofdmaPilot aided scheduling for uplink ofdma
Pilot aided scheduling for uplink ofdma
eSAT Publishing House
 
Radiation effects on heat and mass transfer of a mhd
Radiation effects on heat and mass transfer of a mhdRadiation effects on heat and mass transfer of a mhd
Radiation effects on heat and mass transfer of a mhd
eSAT Publishing House
 
A novel token based approach towards packet loss
A novel token based approach towards packet lossA novel token based approach towards packet loss
A novel token based approach towards packet loss
eSAT Publishing House
 
Growth and physical properties of pure and manganese doped strontium tartrate...
Growth and physical properties of pure and manganese doped strontium tartrate...Growth and physical properties of pure and manganese doped strontium tartrate...
Growth and physical properties of pure and manganese doped strontium tartrate...
eSAT Publishing House
 
Response sensitivity of the structure using vibration
Response sensitivity of the structure using vibrationResponse sensitivity of the structure using vibration
Response sensitivity of the structure using vibration
eSAT Publishing House
 
Classification of appendicitis based on ultrasound image
Classification of appendicitis based on ultrasound imageClassification of appendicitis based on ultrasound image
Classification of appendicitis based on ultrasound image
eSAT Publishing House
 
Survey on sensor protocol for information via negotiation (spin) protocol
Survey on sensor protocol for information via negotiation (spin) protocolSurvey on sensor protocol for information via negotiation (spin) protocol
Survey on sensor protocol for information via negotiation (spin) protocol
eSAT Publishing House
 
Static analysis of c s short cylindrical shell under internal liquid pressure...
Static analysis of c s short cylindrical shell under internal liquid pressure...Static analysis of c s short cylindrical shell under internal liquid pressure...
Static analysis of c s short cylindrical shell under internal liquid pressure...
eSAT Publishing House
 
Partial encryption of compresed video
Partial encryption of compresed videoPartial encryption of compresed video
Partial encryption of compresed video
eSAT Publishing House
 
A novel approach for efficient skull stripping using
A novel approach for efficient skull stripping usingA novel approach for efficient skull stripping using
A novel approach for efficient skull stripping using
eSAT Publishing House
 
Seismic behavior of intake tower of dam
Seismic behavior of intake tower of damSeismic behavior of intake tower of dam
Seismic behavior of intake tower of dam
eSAT Publishing House
 
Microwave dehydrator an environmental friendly step toward improving microwav...
Microwave dehydrator an environmental friendly step toward improving microwav...Microwave dehydrator an environmental friendly step toward improving microwav...
Microwave dehydrator an environmental friendly step toward improving microwav...
eSAT Publishing House
 
Multisensor data fusion based autonomous mobile
Multisensor data fusion based autonomous mobileMultisensor data fusion based autonomous mobile
Multisensor data fusion based autonomous mobile
eSAT Publishing House
 
Study and comparative analysis of resonat frequency for microsrtip fractal an...
Study and comparative analysis of resonat frequency for microsrtip fractal an...Study and comparative analysis of resonat frequency for microsrtip fractal an...
Study and comparative analysis of resonat frequency for microsrtip fractal an...
eSAT Publishing House
 
The effect of fiber orientation and laminate stacking
The effect of fiber orientation and laminate stackingThe effect of fiber orientation and laminate stacking
The effect of fiber orientation and laminate stacking
eSAT Publishing House
 

Viewers also liked (20)

Phytotoxicity analysis of various plants using industrial sludge
Phytotoxicity analysis of various plants using industrial sludgePhytotoxicity analysis of various plants using industrial sludge
Phytotoxicity analysis of various plants using industrial sludge
 
Analysis of river flow data to develop stage discharge relationship
Analysis of river flow data to develop stage discharge relationshipAnalysis of river flow data to develop stage discharge relationship
Analysis of river flow data to develop stage discharge relationship
 
Location updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing inLocation updation for energy efficient geographic routing in
Location updation for energy efficient geographic routing in
 
Scour investigation around single and two piers
Scour investigation around single and two piersScour investigation around single and two piers
Scour investigation around single and two piers
 
Secure file sharing of dynamic audit services in cloud storage
Secure file sharing of dynamic audit services in cloud storageSecure file sharing of dynamic audit services in cloud storage
Secure file sharing of dynamic audit services in cloud storage
 
Pilot aided scheduling for uplink ofdma
Pilot aided scheduling for uplink ofdmaPilot aided scheduling for uplink ofdma
Pilot aided scheduling for uplink ofdma
 
Radiation effects on heat and mass transfer of a mhd
Radiation effects on heat and mass transfer of a mhdRadiation effects on heat and mass transfer of a mhd
Radiation effects on heat and mass transfer of a mhd
 
A novel token based approach towards packet loss
A novel token based approach towards packet lossA novel token based approach towards packet loss
A novel token based approach towards packet loss
 
Growth and physical properties of pure and manganese doped strontium tartrate...
Growth and physical properties of pure and manganese doped strontium tartrate...Growth and physical properties of pure and manganese doped strontium tartrate...
Growth and physical properties of pure and manganese doped strontium tartrate...
 
Response sensitivity of the structure using vibration
Response sensitivity of the structure using vibrationResponse sensitivity of the structure using vibration
Response sensitivity of the structure using vibration
 
Classification of appendicitis based on ultrasound image
Classification of appendicitis based on ultrasound imageClassification of appendicitis based on ultrasound image
Classification of appendicitis based on ultrasound image
 
Survey on sensor protocol for information via negotiation (spin) protocol
Survey on sensor protocol for information via negotiation (spin) protocolSurvey on sensor protocol for information via negotiation (spin) protocol
Survey on sensor protocol for information via negotiation (spin) protocol
 
Static analysis of c s short cylindrical shell under internal liquid pressure...
Static analysis of c s short cylindrical shell under internal liquid pressure...Static analysis of c s short cylindrical shell under internal liquid pressure...
Static analysis of c s short cylindrical shell under internal liquid pressure...
 
Partial encryption of compresed video
Partial encryption of compresed videoPartial encryption of compresed video
Partial encryption of compresed video
 
A novel approach for efficient skull stripping using
A novel approach for efficient skull stripping usingA novel approach for efficient skull stripping using
A novel approach for efficient skull stripping using
 
Seismic behavior of intake tower of dam
Seismic behavior of intake tower of damSeismic behavior of intake tower of dam
Seismic behavior of intake tower of dam
 
Microwave dehydrator an environmental friendly step toward improving microwav...
Microwave dehydrator an environmental friendly step toward improving microwav...Microwave dehydrator an environmental friendly step toward improving microwav...
Microwave dehydrator an environmental friendly step toward improving microwav...
 
Multisensor data fusion based autonomous mobile
Multisensor data fusion based autonomous mobileMultisensor data fusion based autonomous mobile
Multisensor data fusion based autonomous mobile
 
Study and comparative analysis of resonat frequency for microsrtip fractal an...
Study and comparative analysis of resonat frequency for microsrtip fractal an...Study and comparative analysis of resonat frequency for microsrtip fractal an...
Study and comparative analysis of resonat frequency for microsrtip fractal an...
 
The effect of fiber orientation and laminate stacking
The effect of fiber orientation and laminate stackingThe effect of fiber orientation and laminate stacking
The effect of fiber orientation and laminate stacking
 

Similar to Speaker identification system using close set

Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Waqas Tariq
 
D111823
D111823D111823
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
CSEIJJournal
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
sipij
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
CSCJournals
 
Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
Iasir Journals
 
Speaker Identification based on GFCC using GMM-UBM
Speaker Identification based on GFCC using GMM-UBMSpeaker Identification based on GFCC using GMM-UBM
Speaker Identification based on GFCC using GMM-UBM
inventionjournals
 
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
April Smith
 
Speaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract FeaturesSpeaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract Features
International Journal of Engineering Inventions www.ijeijournal.com
 
On the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognitionOn the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognition
journalBEEI
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
TELKOMNIKA JOURNAL
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
phyuhsan
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
General Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker IdentificationGeneral Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker Identification
ijcisjournal
 
To Improve Speech Recognition in Noise for Cochlear Implant Users
To Improve Speech Recognition in Noise for Cochlear Implant UsersTo Improve Speech Recognition in Noise for Cochlear Implant Users
To Improve Speech Recognition in Noise for Cochlear Implant Users
IOSR Journals
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
IOSR Journals
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
IJCSEA Journal
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
eSAT Publishing House
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
Arcanjo Salazaku
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...
TELKOMNIKA JOURNAL
 

Similar to Speaker identification system using close set (20)

Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
 
D111823
D111823D111823
D111823
 
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
Isolated Word Recognition System For Tamil Spoken Language Using Back Propaga...
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
 
Speaker Identification based on GFCC using GMM-UBM
Speaker Identification based on GFCC using GMM-UBMSpeaker Identification based on GFCC using GMM-UBM
Speaker Identification based on GFCC using GMM-UBM
 
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
A Combined Sub-Band And Reconstructed Phase Space Approach To Phoneme Classif...
 
Speaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract FeaturesSpeaker Recognition Using Vocal Tract Features
Speaker Recognition Using Vocal Tract Features
 
On the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognitionOn the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognition
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
General Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker IdentificationGeneral Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker Identification
 
To Improve Speech Recognition in Noise for Cochlear Implant Users
To Improve Speech Recognition in Noise for Cochlear Implant UsersTo Improve Speech Recognition in Noise for Cochlear Implant Users
To Improve Speech Recognition in Noise for Cochlear Implant Users
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...
 

More from eSAT Publishing House

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnam
eSAT Publishing House
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...
eSAT Publishing House
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnam
eSAT Publishing House
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...
eSAT Publishing House
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, india
eSAT Publishing House
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity building
eSAT Publishing House
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...
eSAT Publishing House
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
eSAT Publishing House
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...
eSAT Publishing House
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a review
eSAT Publishing House
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...
eSAT Publishing House
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard management
eSAT Publishing House
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear walls
eSAT Publishing House
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...
eSAT Publishing House
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...
eSAT Publishing House
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of india
eSAT Publishing House
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structures
eSAT Publishing House
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildings
eSAT Publishing House
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
eSAT Publishing House
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
eSAT Publishing House
 

More from eSAT Publishing House (20)

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnam
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnam
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, india
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity building
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a review
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard management
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear walls
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of india
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structures
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildings
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
 

Recently uploaded

Zener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and ApplicationsZener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and Applications
Shiny Christobel
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
mahaffeycheryld
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
harshapolam10
 
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Transcat
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
Roger Rozario
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
ElakkiaU
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
um7474492
 
Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...
Prakhyath Rai
 
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICSUNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
vmspraneeth
 
OOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming languageOOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming language
PreethaV16
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
upoux
 
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
sydezfe
 
Open Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surfaceOpen Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surface
Indrajeet sahu
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
Prakhyath Rai
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
nedcocy
 
Data Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason WebinarData Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason Webinar
UReason
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
Anant Corporation
 

Recently uploaded (20)

Zener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and ApplicationsZener Diode and its V-I Characteristics and Applications
Zener Diode and its V-I Characteristics and Applications
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
 
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
Tools & Techniques for Commissioning and Maintaining PV Systems W-Animations ...
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
 
Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...Software Engineering and Project Management - Software Testing + Agile Method...
Software Engineering and Project Management - Software Testing + Agile Method...
 
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICSUNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
 
OOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming languageOOPS_Lab_Manual - programs using C++ programming language
OOPS_Lab_Manual - programs using C++ programming language
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
 
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
一比一原版(uoft毕业证书)加拿大多伦多大学毕业证如何办理
 
Open Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surfaceOpen Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surface
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
 
Data Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason WebinarData Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason Webinar
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
 

Speaker identification system using close set

  • 1. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 411 SPEAKER IDENTIFICATION SYSTEM USING CLOSE SET Shweta Bansal1, Ankur Hooda2, Anima3 1, 2, 3 ECE, KIIT College of Engineering, Gurgaon, bansalshwe@gmail.com, ankur.hooda45@gmail.com, animadahiya806@gmail.com Abstract The present paper describes experiments conducted to evaluate the performance of speaker recognition. The experiments conducted using Neural Network shows that the complexity of speaker recognition increases when the numbers of speakers to be identified are large in numbers in the text independent situation. In the first experiment error rate was zero for 10 speaker’s classification and performance was good. By increasing the number of speakers the error rate increased, classification and performance were poor. After 25 speakers, the error rate was very high. For 100 speaker’s classification MATLAB NN tool did not support for display the confusion matrix. To overcome this problem the second experiment has been done. In this experiment a close set of 10 groups of 100 speakers (each group of 10 speakers) in terms of cell array in MATLAB has been defined and we observed that the best result of speaker identification was 100% in 20 continuous features of speaker’s voice, but it increased time complexity. In the third experiment speaker’s dialect and regions were also identified and classification performance was 100% at 97 epochs, validation performance was 0.0035046 at 91 epochs and the error rate was zero has found. Index Terms: text dependent, text independent, speaker identification, Neural Network, close set, MFCC, Speaker identification, close set. -----------------------------------------------------------------------***----------------------------------------------------------------------- 1. INTRODUCTION Speaker recognition can be performed in two ways. In first way Text-dependent and Text-independent conditions. In addition there are two different types of speaker identification, the closed-set and the open-set identification. In closed-set identification, the sample of the test speaker is compared against all the available speaker samples and the speaker ID of the sample with the closest match is identified. In open-set the test speaker’s sample is compared with the large number of reference samples which may or may not contain the test sample. It verifies that a given speaker is the one whom he claims to be. If it matches the set threshold then the identity claim of the user is accepted otherwise rejected. The identification strategy is based on matching the set threshold value for accepting or rejecting the speaker. [1][2][3]. Speaker recognition using Neural Network techniques are very complex for large number of speakers. In the present study three experiments have been done to find the performance of speaker recognition versus complexity of computation. In the first experiment the study of classification of 100 speakers uttered 10,000 sentences have been done by Neural Network (NN) using back propagation adaptive learning (Scaled conjugate gradient) method with two hidden layers. For classification of speakers 70% voice data for training, 15% for validation and 15% for testing the voice data have been used for getting the best results. The performances were better with less number of speakers but the complexity was increased by increasing number of speakers. To overcome this problem the second experiment was performed by grouping speakers in close sets of 10 groups for 100 speakers (10 speakers in each group) MATLAB has been used to conduct this experiment. This produce increased the performance of speaker identification system, but complexity was high. And in the third experiment dialect and regions of three non-native (South Indian, Punjabi and Haryanvi) speakers who uttered 150 text dependent Hindi sentences in NN [4] have been classified with 100% performance. 2. VOICE DATABASE COLLECTION In the first and second experiments a corpus of 10,000 Hindi sentences uttered by 100 speakers (each speaker uttered 100 text independent sentences) was created. These sentences were taken from mobile communication database for the analysis of speaker’s voice. In the third experiment, a corpus of 450 text dependent Hindi sentences uttered by three non native speakers i.e. 150 sentences uttered by South Indian, 150 by Panjabi and 150 by Haryanvi native speakers. All speakers were within 18years-60years age groups. Voice data were recorded using a head held moving coil microphone in a quite room. “PRAAT” software with sampling rate 16 kHz/16 bit has been used for the analysis of samples.
  • 2. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 412 3. VOICE FEATURE EXTRACTION Experiments are based on the average information about the parameters of one second speech utterance by an speaker.193 Mel Frequency Cepstral Coefficients (MFCC) were extracted as voice features with sampling rate was 100.The total number of filters used were 40(13linear filters+ 27 log filters) and the linear spacing between the filters was 66. 5 prosody related features consisted of average value of pitch(f0),intensity, duration, RMS value of sound pressure and power, these were extracted using PRAAT tool. MFCC values were calculated by MATLAB [8]. 4. EXPERIMENT 1 4.1 Classification of 100 speakers using NN A multilayer feed forward neural networks with one input layer, one output layer and two hidden layers have been used with 20 hidden neurons. The training method was Scaled Conjugate Gradient descent with momentum, Neuron Transfer Function for hidden layer and output layer were considered Log sigmoid Transfer Function. In this experiment 70% voice data have been used for training whereas 15% data each for validation and testing. To begin with, speaker recognition was conducted with first 10 test speakers out of 100 speakers. The classification was 80% and the performance was 0.082485 at 36 epochs with zero error rate. By increasing the number of speakers i.e.by 15 the classification was 55% and the performance was 0.063405 at 43 epochs with 11.76% error rate .After 25 speakers it was very difficult to trace the confusion matrix for classification and error rate was very high. For 100 speaker’s classification MATLAB NN tool did not support for display the confusion matrix. The variations in the results by increasing the number of speakers are shown in table 1: Table 1: variations in performance of 100 speaker’s Classification 5. Experiment 2 5.1 Close Set Speaker Identification To overcome this problem of recognising large number of speakers using NN technique, our experiment has been done by grouping speakers in closed sets . A close set of 10 groups of 100 speakers (a group of 10 speakers) in terms of cell array in MATLAB has been defined. Each cell represents a single group of close set. An associative memory for all 100 speakers’ voice features in MATLAB Cell array in terms of close set [11] has been used. We have followed the speaker identification algorithm shown in figure 1. This algorithm uses the close set of associative memory. All the voice features of 100 speakers have been taken into account in the associative memory [11][12]. The task is done by using the extracted feature of all speakers matching with the test speaker’s voice features. If the match is found then test speaker is identified. Matching performed in the order of selection choosing 3, 5,10,15,20 continuous features of test speakers’ voice. Speaker’s utterances MFCC + Prosody features extraction 199 features of Test speaker Select: 3 continuous features Or 5 continuous features Or 10 continuous features Or 15 continuous features Or Match selected features with stored patterns No Yes No If match Speaker identify Figure 1: Function diagram of speaker recognition
  • 3. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 413 shown in figure 1. This algorithm uses the close set of associative memory. All the voice features of 100 speakers have been taken into account in the associative memory [11][12]. The task is done by using the extracted feature ofall speakers In the first iteration (i, i+1, i+2) match the features of test speaker with all speakers features, if the match found then select all those speakers. Otherwise go to the next iteration (i+1, i+2, i+3) up to the nth iteration. The same rule follow for the 5, 10, 15, 20 continuous features. The cell structure of the test speaker is shown in figure 2 2nd Iteration i +1 i+2 i+3 … i i+1 i+2 1st Iteration Figure 2: Selection of 3 continuous features We have done 5 experiments. In the first experiment 3 continuous feature of test speaker have been taken as input. The identification gave the 52% performance and reduced time complexity because no more feature had to be searched from the associative memory for the selection of the speakers. In second experiment (for 5 continuous features) the performance was 67% with increased time complexity. In further experiments we have increased the number of continuous features (i.e. 10, 15, and 20) of test speakers and the performance was 79%, 89% and 100% respectively. We noticed that the best result of speaker identification is 100% in 20 continuous features of speaker’s inputs, but it increased the value of time complexity. The observations from the all experiments are shown in table 2 and performance graph in figure 3 Table 2: Performance of speaker identification algorithm Numb er of contin uous Featur e 3 conti nuou s Featu res 5 conti nuou s Featu res 10 conti nuou s Featu res 15 conti nuou s Featu res 20 conti nuou s Featu res Time Compl exity O(n3 ) O(n5 ) 10 O(n ) 15 O(n ) O(n20 ) Speak er Identif ication 52% 67% 79% 89% 100 % 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Time Com;lexity Identification Performance 3 5 10 15 20 continuous f eatures Figure 3: Performance graph 6. Experiment 3 6.1 Region and Dialect identification One hundred fifty text dependent Hindi sentences recorded by three non native speakers i.e. 450 utterances were used for this study. 199 extracted features have been taken as inputs with one input layer, one output layer had three units which represented output categories i.e. South Indian, Punjabi & Haryanvi. Two hidden layers with 20 neurons have been used. The architecture of Multilayer feed forward network for three non native speaker’s classification is shown in figure 4 Figure 4: Multilayer feed forward network of three non native’s speaker’s classification From the total voice database 314 sentences have been used for training, 68 sentences for validation and 68 sentences for testing. We observed that the classification performance was 100% at 97 epochs, validation performance was 0.0035046 at 91 epochs and the error rate was zero. The best performance was 100% because the numbers of speakers were very less i.e. three and the database was text dependent .The MFCC +
  • 4. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 414 prosody features were differ for each and every non native speakers because of their dialects.The best validation performance for Region and Dialect classification is shown in figure 5 Figure 5: Best validation performance for Region and Dialect classification CONCLUSIONS In the first experiment speaker’s classification can give better results with less number of speakers using Neural Network but the performance decreases by increasing the number of speakers. It will take either too much time for execution or the system may not response. For better results experiment 2 has been done with close set of speakers. That has produced better results but in more complex manner. In the third experiment dialect and region identification performance was 100 %, because only 3 non native speakers were there to be classified which were very less and the voice database was text dependent. Future improvement can be done as given below I. Define the separate close set for male and female speakers. II. Define the cluster of nearest distance of voice features from the neurons can placed inside the same cluster. III. Used text dependent sentences for better results as we have discussed in our experiment 3. In this order to define cluster, the result for recognition of speaker will be more accurate and also increase the performance with less time complexity, because the similar type of speakers’ voice placed inside the same group. That overcomes the more searching of speakers. In our experiment we have designed the associative memory for voice features In earlier experiments the associative memory was used for pattern segmentation and large vocabulary for continuous speech recognition [11][12]. ACKNOWLEDGEMENTS We would like to thanks Ms. Shweta Sinha, Shipra Arora and Minakshi Mahajan our team members for their valuable suggestions and support. Our thanks are due to Mr. A.K. Dhamija, for help in recording as well as maintaining the speech data. Also our thanks are extended to the supporting staff Mr. Gautam Kumar, for his continuous effort in the maintenance and development of text and speech corpora. Our very special thanks are to KIIT management Dr. Harsh Vardhan Kamrah and Ms. Neelima Kamrah for providing their constant encouragement, support and environment that was conducive for this work. REFERENCES [1] L.R.Rabiner and B.H.Juang”Fundamental of speech recognition”Prentice Hall Englewood cliffs,N.J.1993. [2] L.R.Rabiner and R.w.schafer “Digital processing of speech signal” Prentice Hall Englewood cliffs,N.J.1978. [3] Furui, “An overview of speaker recognition technology”, ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, pp. 1-9, 1994. [4] B.Atal”automatic recognition of speakers from their voices”proc.IEEE,vol.64,pp.460-475April1976 [5] S.S.Agrawal”Emotion in Hindi speech analysis,perception and recognition”Oriental COCOSDA 2012. [6] Samudravijaya K, Speech and Speaker Recognition: A tutorial, Proc. Int. Workshop on Tech. Development in [7] Indian Languages, Kolkata, Jan 22-24, 2003. [8] Samudravijaya K and Anshu Gupta, Text prompted speaker verification, J.Acoust. Soc. India, vol.30, pp. 214-217, 2002 [9] Vibha Tiwari”Mfcc & its application in speaker recognition” International journal merging technologies1(1):19-22(2010) [10] Ran D. Jilka “Text independent speaker verification using covariance Modeling” IEEE Signal Processing Letters Vol 8, Page 97{99,2001}. [11] .“Self organizing map and associative memory model hybrid classifier for speaker recognition” by M Inal, Y S Fatihoglu 6thSeminar on neural Network Applications in Electrical Engineering Neurel2002714 (2002) [12] Deliang Wong, Joachim Buhamann “Pattern [13] Segmentation in Associative Memory” Massachusetts Institute of Technology, Neural Computation 2,94-106 (1990) [14] Z K Kayikci “Large vocabulary continuous speech recognition using associative memory and hidden Markov model” Proceeding SSIP'08 Proceedings of the 8th conference on Signal, Speech and image processing 2008