SlideShare a Scribd company logo
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE)
e-ISSN: 2278-2834,p- ISSN: 2278-8735. Volume 7, Issue 5 (Sep. - Oct. 2013), PP 22-27
www.iosrjournals.org
www.iosrjournals.org 22 | Page
Text Independent Speaker Identification Using Imfcc Integrated
With Ica
P.Suryakumari1
, N M Ramaligeswararao2
, K.Jyothi3
, Dr.V.Sailaja4
1,2,3,4,
(ECE, Godavari Institute of Engineering & Technology/ Jntuk, India)
Abstract: Over the years, more research work has been reported in literature regarding text independent
speaker identification using MFC coefficients. MFCC is one of the best methods modeled on human auditory
system. Murali et al (2011) [1] has developed a Text independent speaker identification using MFC coefficients
which follows Generalized Gaussian mixer model. MFCC, because of its filter bank structure it captures the
characteristics of information more effectively in lower frequency region than higher region, because of this,
valuable information in high frequency region may be lost. In this paper we rectify the above problem by
retrieving the information in high frequency region by inverting the Mel bank structure. The dimensionality and
dependency of above features were reduced by integrating with ICA. Here Text Independent Speaker
Identification system is developed by using Generalized Gaussian Mixer Model .By the experimentation, it was
observed that this model outperforms the earlier existing models.
Keywords: Independent Component Analysis; Generalized Gaussian Mixer Model; Inverted Mel frequency
cepstral coefficients; Bayesian classifier; EM algorithm.
I. Introduction:
From the past knowledge it is evident that physiological structure of a vocal tract is different for
different persons, each one having different features. Based on these features, we can differentiate one persons
voice from others(Ben gold (2002)[2]). The Mel frequency cepstral coefficients features was first proposed for
speech recognition by S.B.Davis and P.Mermelstein . Over the decades though it was most widely used, there
was a drawback in its filter bank structure, which is able to capture the information in low frequency regions
than higher frequency regions.As the information contained in the higher frequency regions may be missed,
proportionally the average efficiency of identification of speaker may be reduced. To overcome the this problem
Mr.S.Chakraborty (2007) [16] proposed a flipped MFCC filter bank using Inverted Mel scale by considering the
High frequency region coefficients.
Here the main objective is to capture the information which has been missed by the MFCC by using a
new filter bank structure when it is obtained by flipping or inverting the MFCC filters bank structure
(s.chakroborty(2007))[16]. In this feature extraction only few inverted Mel cepstral coefficients are considered
and the remaining coefficients are dropped as insignificant due to high dimensionality problems. Ignoring the
dependences and dropping some of the Mel cepstral coefficients may lead to falsification of model. It is needed
to reduce the dimensionality and avoid dependence among others to having a robust model for speakers. This
can be achieved by integrating the Inverted Mel frequency cepstral coefficients of each speaker’s speech spectra
with Independent component analysis (ICA).The integration of ICA with Inverted Mel frequency cepstral
coefficients will make the feature vector more robust.
The main aim of the ICA is to find out linear representation of non-Gaussian data so that the
components are statistically independent. The required assumption in applying ICA is that the observed signals
are linearly mixed. It helps in capturing some of the essential features of data in many applications
including Automatic Speaker
Recognition Systems for high dimensional multivariate analysis (Hyvarinen(2001))[3]. ICA is a
powerful tool to separate mixed signals mainly find in speech analysis results in computational and conceptual
simplicity.
Hence in this paper a text independent speaker identification method is developed and analyzed by
integrating independent component analysis with Inverted Mel frequency cepstarl coefficient and using
Generalized Gaussian Mixer model. The procedure for extracting the feature vectors of the speaker speech
spectra using inverted Mel frequency cepstral coefficients and independent component analysis is covered in
Section II. Assuming that each individual speech spectra in feature vector follows a Generalized Gaussian Mixer
Model and the estimation of the model parameters is obtained by using EM algorithm. The speaker
identification algorithm under Bayesian frame work is presented in the section IV.
Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 23 | Page
II. Feature Vectors Extraction
In this section we describe the feature vector extraction of each individual speech spectra. Since a long
time MFCC is considered as a reliable front end for speaker identification because it has coefficients that related
to psychophysical studies human perception of the frequency content of sounds in a nonlinear scale called Mel
scale (Ding(2001))[4]. This can be mathematically expressed as
fmel=2595 log[1+(f/700)] (1)
Where fmel is the subjective pitch in Mel’s corresponding to f in Hz.
But Mel cepstral coefficients are obtained by capturing lower frequency regions and neglecting higher regions.
This leads to loss of more information from the recorded data base. By considering high frequency regions by
using flipping or inverting Me3l bank structure (S.chakroborty (2007)) [5], we retrieve more information.
The flip of the MFCC Mel bank is expressed as
𝜳𝒊(k) = ΨQ+1-i (Ms/2 +1-k) (2)
𝛹𝑖(k) is the inverted Mel scale filter response while Ψi(k) is MFCC filter bank response and Q is the number of
filters in the bank. Here inverted Mel-scale is defined as follows
𝒇mel (f)=2195.2860-2595 log10(1+(4031.25-f/700)) (3)
𝑓mel (f) is subjective pitch in the new scale corresponding to f, actual frequency in Hz.
The block diagram of Inverted Mel frequency cepstral coefficients is shown in below figure 1.From the
block diagram of IMFCC the frequency bands are positioned logarithmically (on Inverted Mel scale) by inverted
Mel filter bank structure, which approximated the human auditory systems response (Ruche chaudhary (2012)
[6]). In this IMFCC captures high frequency bands of the Inverted Mel scale, getting more efficiency than
MFCC. These vector coefficients follow Independent component analysis. ICA is linear and not necessarily
orthogonal. It extracts independent components even with smaller magnitudes. Analysis of Text independent
speaker identification model is a challenging task because this system comprises highly complex model with
huge number of feature vectors. Under this circumstance dimension reduction of data plays a major role for
obtaining better identification results. This can be achieved by ICA algorithm.
Fig 1
ICA aims at extracting a set of statistically independent vectors from the matrix of training data the Inverted
Mel cepstral feature vectors derived from the original voice sample. It leads to find directions of minimal mutual
information by capturing certain correlations among the frequencies present in the spectral based representation
of the speech signal. All these aims are achieved by ICA in the form of linear combination of basic filter
functions specific to each person.
The signal X is used as proper Mel-cepstral based representation of original signal and the data can be observed
as a set of multivariate time series resulting from a linear mixing process A of independent functions S (
Hyvarinen, (2001))[7][8].
Linear combination of such sources or functions can be summarized as (Cardoso, (1996)) [9]
Ax=S (4)
There is one problem in ICA to determine both the excitation signal S and the scalars A and the only one known
component is the matrix of IMFCC coefficients of the input speech signal. S can be computed as follows
(hyvarinen, (1997)) [10]
S=Ax-1 (5)
A can be computed by considering X as a vector of observations, where each observation is expressed as a
linear combination of independent component s. In order to estimate one of the independent components. A
linear combination of xi is chosen such that (Hyvarinen, (1997)) [10], (Michael, 2002[11])
Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 24 | Page
Y=WT
X (6)
With respect to these conditions stated in equation (4) and equation (5), the linear combination represented in
equation (6) is a true combination of independent components if w was one of the columns of the inverse of A.
After pre processing and whitening, the final equation is given as (Michael,(1999))[12]
S = Y=WT
X = 𝑾 px (7)
Fast ICA algorithm is used to estimate Wi which constitutes the rows of W. since the components are considered
to be statistically independent, the variance between them is high. The following steps are used to estimate W
1. Choose an initial random guess for W
2. Iterate: W<= E{xg(WT
X)}- E{g’(WT
X)}W
3. Normalize: W<=
𝑊
||𝑊||
4. If not converged , go back to step 2
Trained Speaker
Model
Fig 2
Once W is estimated the final step is to project the signal in to the space created by ICA
New dataset=Wica* mean adjusted original data
Where , Wica is the transformation matrix obtained from Fast ICA algorithm.
The extraction of Feature vectors for speaker identification is done in two steps:
1. Compute IMFCC’s and
2. Apply ICA to transform them to get new feature vectors
The computation steps for extracting the new feature vectors as follows
Step1:
i. Take the fourier transform of a signal
ii. Map the powers of the spectrum obtained above on to the inverted mel scale(IMFCC), using hamming
window
iii. Take the logs of the powers at each of the Inverted Mel frequencies
iv. Take the discrete cosine transform of the list of the inverted Mel log powers, as it were a signal
Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 25 | Page
v. The IMFCCs are the amplitudes of the resulting spectrum
Step2:
vi. Apply ICA transform to Mel frequency cepstral coefficients to get the new feature vectors of each speaker
speech spectra
III. Speaker Identification Model With Generalized Gaussian Distribution:
In this section we briefly describe the speaker identification process. Figure 1 depicts the Text
Independent Speaker Identification model. In this dotted lines represents training phase and bold line represents
testing phase.
The Literature shows that probabilistic models like GGMM for have yielded better performance
results for training both text-dependent and text-independent speaker recognition applications. Due to
the probabilistic property of a GGMM, it can also be applied to speaker identification applications in the
presence of different noises increasing the channel robustness and therefore more suited to this research.
Here it is assumed that the feature vectors (after preprocessing the IMFCC with ICA) follow a
multivariate Generalized Gaussian Mixer model. It is reasonable to assume the acoustic space corresponding to
a speaker voice can be characterized by acoustic classes representing the some broad phonetic events such as
vowels, nasals. These acoustic classes reflect some general speaker dependent vocal tract configurations that are
useful for charactering speaker identity. The shape of each spectra in turn is represented by the mean of its
component density and variation of the average spectral shape represented by co-variance matrix. Hence the
entire speech spectra of the each individual speaker can be characterized as an M component Finite Multivariate
Generalized Gaussian Mixer model.
The Probability density function of the each individual speaker speech spectra is represented as follows:
𝒑(𝒙 𝒕|λ) = 𝜶𝒊 𝒃𝒊
𝑴
𝒊=𝟏 (𝒙 𝒕|λ) (8)
Where, 𝑥𝑡 = (𝑥𝑡𝑖𝑗 ) j=1,2,….D; i=1,2,….M; t=1,2….T is a D dimensional random vector representing
IMFCC vector.
λ is the parametric set such λ= (µ, 𝝆, ∑)
𝛼𝑖 is the component weight such that 𝛂𝐢 = 𝟏𝐌
𝐢=𝟏
𝑏𝑖(𝑥𝑡 | λ) is the probability density function of ith
acoustic class represented by the new vectors of the speech data
the D-dimensional Generalized Gaussian (GG) distribution (M.bicego et al (2008)(13)). It is in the form of the
following
𝒃𝒊(𝒙 𝒕| (µ, 𝝆, ∑)) =
[𝐝𝐞𝐭⁡( )]−𝟏 𝟐
[𝒛 𝝆 𝑨 𝝆,𝝈 ] 𝑫exp −
−𝟏 𝟐(𝒙 𝒕−𝝁 𝒊
𝑨(𝝆,𝝈) 𝝆
(9)
Where z (𝝆) =
𝟐
𝝆
𝚪
𝟏
𝝆
and 𝑨 𝝆, 𝝈 =
𝚪 𝟏 𝝆
𝚪 𝟑 𝝆
The parameter 𝜇𝑖 is the mean vector and the summation of 𝐴(𝜌) is a scaling parameter which allows a variance
var (𝜎2
) and 𝜌 is the shape of the parameter when ρ=1, the Generalized Gaussian corresponds to laplacian or
double exponential Distribution. When ρ=2, the Generalized Gaussian corresponds to a Gaussian distribution. In
limiting case ρ→ + ∞. The model can have one covariance matrix per a Generalized Gaussian density of the
acoustic class of each speaker. The diagonal convergence matrix is used for speaker model, based on the
previous studies. This results diagonal covariance matrix for the feature vector, the features are independent and
the probability density function of the feature vector is
𝒃𝒊(𝒙 𝒕|λ) =
𝐞𝐱𝐩⁡−
𝒙 𝒊𝒋−𝝁 𝒊𝒋
𝑨(𝝆 𝒊𝒋−𝝈 𝒊𝒋
𝝆 𝒊𝒋
𝟐
𝝆 𝒊𝒋
𝚪 𝟏+
𝟏
𝝆 𝒊𝒋
𝑨(𝝆 𝒊𝒋,𝝈 𝒊𝒋)
𝑫
𝒋=𝟏 = 𝒇𝒊𝒋
𝑫
𝒋=𝟏 (𝒙 𝒕𝒊𝒋) (10)
To find the estimate of model parameters αi μij and ij for i= 1,2,3 …,M, j=1,2,…,D, we maximize the expected
value b y using log likelihood function.the parameters are given by Armando.j el at (2003)[14] for each speech
spectra
The following are the updated equations of the parameters of EM algorithm as given by sailaja et al (2010)[15]
𝜶𝒊
(𝒍+𝟏)
=
𝟏
𝑻
𝜶𝒊
𝒍 𝒃 𝒊(𝒙 𝒕,𝝀 𝒍 )
𝜶𝒊
𝒍 𝒃 𝒊(𝒙 𝒕,𝝀 𝒍 )𝑴
𝒊=𝟏
𝑻
𝒕=𝟏 (11)
Where 𝜆 𝑙
= (𝜇𝑖𝑗
(𝑙)
,𝛼𝑖𝑗
(𝑙)
) are the estimates obtained.
Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 26 | Page
The updated equation for estimating 𝜇𝑖𝑗 is
𝝁𝒊𝒋
𝒍+𝟏
=
𝒕 𝟏 𝒙 𝒕 𝝀 𝒍 𝑨 𝑵,𝝆 𝟏
(𝒙 𝒕𝒊𝒋−𝝁 𝒕𝒊𝒋)
𝑻
𝒕=𝟏
𝒕 𝟏
𝑻
𝒕=𝟏 𝒙 𝒕,𝝀 𝒍 𝑨 𝑵,𝝆 𝒊𝒋
(12)
where 𝐴(𝑁, 𝜌𝑖𝑗 ) is a function which must be equal to unity for 𝜌𝑖 = 2and must be equal to
1
𝜌 𝑖𝑗 −1
for 𝜌𝑖 ≠ 1 in the
case of N=2, we have also observed that 𝐴(𝑁, 𝜌𝑖𝑗 ) must be an increasing function of𝜌𝑖𝑗 .
The updated equation for estimating 𝛼𝑖𝑗 is
𝝈𝒊𝒋
(𝒍+𝟏)
=
𝒕 𝟏(𝒙 𝒕 𝝀(𝒍))
𝚪
𝟑
𝝆 𝒊𝒋
𝝆 𝒕 𝚪
𝟏
𝝆 𝒕
𝒙 𝒕𝒊𝒋−𝝁 𝒊𝒋
(𝒍)
𝟏
𝝆 𝒊𝒋𝑵
𝒊=𝟏
𝒕 𝟏(𝑻
𝒕=𝟏 𝒙 𝒕 𝝀(𝒍))
𝟏
𝝆 𝒊𝒋
(13)
`
IV. Speaker Identification Using Bayes’ Decision Rule:
From each test speaker, we obtained the extracted feature vectors were applied to the function “ICA
TRANSFORM” and were estimated into the space of ICA created by the associated speaker with unique
speaker ID. The new feature vectors of the test utterances and the trained models were fed to a Byes classifier
for identification applications which employ large group of data sets and the corresponding test speaker was
identified. (Domingo’s, (1997))[16][17].
𝑝 𝑖
𝑥𝑡
, 𝜆 is the posteriori probability for an acoustic class i and is defined by the following equation
(Reynolds,(1995))[6].
𝐩 𝐢
𝐱 𝐭
, 𝛌 =
𝐩𝐢 𝐛𝐢 𝐱 𝐭
^−
𝐩 𝐤 𝐛 𝐤 𝐱 𝐭
^−𝐌
𝐤=𝟏
(14)
𝒔 = 𝐦𝐚𝐱 𝟏<𝒌<𝒔 𝒑𝒍 𝝀 𝒌 𝑿 = 𝐚𝐫𝐠 𝐦𝐚𝐱 𝟏<𝒌<𝒔[𝒑𝒍 𝝀 𝒌 𝑿 𝒑 𝒓(𝝀 𝒌)] (15)
Finally we computes S using the logarithms and independence between the observations.
V. Expermental Results:
The developed model, IMFCC with integration of ICA with GGMM performance is evaluated. An
experiment was conducted on 30 speakers, for each speaker 10 utterances with each of duration 6sec are
recorded by high quality microphone. Out of this voice corpus, first, fifth and seventh sessions are used for
testing and remaining sessions are used for training. By using front end process explained in section II, IMFCC
feature vectors were obtained which were again refined by using ICA. The global model for each speaker
density is estimated by using the derived parameters. With the test data set, the efficiency of the developed
model is studied by identifying the speaker with the speaker identification algorithm given in section IV. The
average percentage of identification was computed and the results are tabulated.
From the table 1, it is observed that the average percentage of correct identification of speaker for the
developed model is 98.3±10. The percentage correctness for Generalized Gaussian Mixer Model with
Integrated ICA using MFCC as a feature extraction is 97.8 ±12. This clearly shows that the speaker
Identification model with IMFCC in a front end processing is having higher average percentage of correct
identification than the other models.
Table1.Average percentage of correct Identification using MFCC and IMFCC feature vectors
Model MFCC IMFCC
Embedded ICA with GGMM 97.8±12 98.3±10
V. Conclusion:
In this paper we proposed and developed a Text Independent Speaker Identification with IMFC
coefficients as a Feature vectors. Experimental Results shows that this method gives tremendous Improvement
and it can detect the correct speaker among 30 speakers with 10 utterances. Therefore this
developed model will be applicable to real time applications.
Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 27 | Page
References:
[1]. N M ramaligeswararao, Dr.V.Sailaja & Dr.k.Srinivasa Rao(2011) “Text Independent Speaker Identification using Integrated
Independent Component Analysis with Generalized Gaussian Mixture Model”International Journal of Advanced Computer Science
and Applications vol.2, No12, 2011pp.85-91
[2]. Ben Gold and Nelson Morgan (2002), “speech and audio processing “part 4 chapter 14 pp.189-203 John wills and sons
[3]. Hyvarinen,(1999), “Fast and Robust Fixed Point Algorithm for ICA” proceedings of the IEEE transactions on neural
networks,Volume.10,no.3,pp.626-634.
[4]. Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information
processing”.
[5]. S.chakroborty. a. roy and g. saha , “ improved closed ser text independent speaker identification by combining MFCC with evidence
from flipped filter banks ” International journal of signal processing vol.4 , no.2 ,pp.114-121,2007.ISSN 1304-4478
[6]. Ruche chaudhary, National technical research organization ,”performance study of Text Independent speaker Identification system
using MFCC and IMFCC for telephone and microphone speeches” International Journal of computing and Business
Research(IJCBR),vol. 3 issue 2 My 2012
[7]. Hyvarinen, (2001), “Independent component analysis, john Wiley and sons.
[8]. Hyvarinen,(1999), “Fast and Robust Fixed Point Algorithm for ICA” proceedings of the IEEE transactions on neural
networks,Volume.10,no.3,pp.626-634.
[9]. Cardoso,(1996) “Equivarant adaptive source separation IEEE Transaction on signal processing”vol.44.no.12.pp-3017-3030
[10]. Hyvarinen.(1997) “A family of fixed point algorithms for independent component analysis” proceedings of the IEEE international
conference on acoustics speech and signal processing (ICASSP),VOLUME5,pp.3917-3920.
[11]. Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information
processing”.
[12]. Ning Wang , P. C. Ching(2011), “Nengheng Zheng, and Tan Lee, “Robust Speaker Recognition Using Denoised Vocal Source and
Vocal Tract Features speaker verification,” IEEE Transaction on Audio Speech and Language processing, Vol. 1, No. 2, pp. 25-35.
[13]. Md M. Bicego, D Gonzalez, E Grosso and Alba Castro(2008) “Generalized Gaussian distribution for sequential Data
Classification” IEE Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural
networks/neural information processing”.
[14]. Armando. J et al (2003), “A practical procedure to estimate the shape Parameters in the Generalized Gaussian distribution
[15]. V Sailaja, K Srinivasa Rao & K V V S Reddy(2010), “Text Independent Speaker Identification Model with Finite Multivariate
Generalized Gaussian Mixture Model and Hierarchical Clustering Algorithm ” International Journal of Computer applications Vol.
11, No. 11, , pp. 25-31.
[16]. Domingo(1997), “Speaker Recognition”, A Tutorial", Proceedings of the IEEE, Vol. 85, no. 9
[17]. Domingo’s, (1997), “On the optimality of the simple Bayesian classifier under zero-one loss,” Machine Learning, vol. 29, pp.
103-130, 1997

More Related Content

What's hot

Ag04606202206
Ag04606202206Ag04606202206
Ag04606202206
IJERA Editor
 
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
IJECEIAES
 
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet TransformContent Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
IOSR Journals
 
Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...
journalBEEI
 
Image fusion using nsct denoising and target extraction for visual surveillance
Image fusion using nsct denoising and target extraction for visual surveillanceImage fusion using nsct denoising and target extraction for visual surveillance
Image fusion using nsct denoising and target extraction for visual surveillance
eSAT Publishing House
 
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...
IJECEIAES
 
IRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBP
IRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBPIRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBP
IRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBP
IRJET Journal
 
Face recognition using selected topographical features
Face recognition using selected topographical features Face recognition using selected topographical features
Face recognition using selected topographical features
IJECEIAES
 
F EATURE S ELECTION USING F ISHER ’ S R ATIO T ECHNIQUE FOR A UTOMATIC ...
F EATURE  S ELECTION USING  F ISHER ’ S  R ATIO  T ECHNIQUE FOR  A UTOMATIC  ...F EATURE  S ELECTION USING  F ISHER ’ S  R ATIO  T ECHNIQUE FOR  A UTOMATIC  ...
F EATURE S ELECTION USING F ISHER ’ S R ATIO T ECHNIQUE FOR A UTOMATIC ...
IJCI JOURNAL
 
Paper id 252014146
Paper id 252014146Paper id 252014146
Paper id 252014146IJRAT
 
face recognition system using LBP
face recognition system using LBPface recognition system using LBP
face recognition system using LBP
Marwan H. Noman
 
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image RetrievalBeyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Association of Scientists, Developers and Faculties
 
Full parameterization process
Full parameterization processFull parameterization process
Full parameterization process
Osama Awad
 
Ch4201557563
Ch4201557563Ch4201557563
Ch4201557563
IJERA Editor
 
Paper id 27201451
Paper id 27201451Paper id 27201451
Paper id 27201451
IJRAT
 
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...
CSCJournals
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Detection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processingDetection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processing
Naeem Shehzad
 

What's hot (19)

Ag04606202206
Ag04606202206Ag04606202206
Ag04606202206
 
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
 
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet TransformContent Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
 
Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...
 
Image fusion using nsct denoising and target extraction for visual surveillance
Image fusion using nsct denoising and target extraction for visual surveillanceImage fusion using nsct denoising and target extraction for visual surveillance
Image fusion using nsct denoising and target extraction for visual surveillance
 
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...
 
IRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBP
IRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBPIRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBP
IRJET- Diabetic Haemorrhage Detection using DWT and Elliptical LBP
 
Face recognition using selected topographical features
Face recognition using selected topographical features Face recognition using selected topographical features
Face recognition using selected topographical features
 
F EATURE S ELECTION USING F ISHER ’ S R ATIO T ECHNIQUE FOR A UTOMATIC ...
F EATURE  S ELECTION USING  F ISHER ’ S  R ATIO  T ECHNIQUE FOR  A UTOMATIC  ...F EATURE  S ELECTION USING  F ISHER ’ S  R ATIO  T ECHNIQUE FOR  A UTOMATIC  ...
F EATURE S ELECTION USING F ISHER ’ S R ATIO T ECHNIQUE FOR A UTOMATIC ...
 
Paper id 252014146
Paper id 252014146Paper id 252014146
Paper id 252014146
 
face recognition system using LBP
face recognition system using LBPface recognition system using LBP
face recognition system using LBP
 
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image RetrievalBeyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
 
Full parameterization process
Full parameterization processFull parameterization process
Full parameterization process
 
Ch4201557563
Ch4201557563Ch4201557563
Ch4201557563
 
Paper id 27201451
Paper id 27201451Paper id 27201451
Paper id 27201451
 
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
FULL PAPER.PDF
FULL PAPER.PDFFULL PAPER.PDF
FULL PAPER.PDF
 
Detection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processingDetection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processing
 

Viewers also liked

Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...
Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...
Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...
IOSR Journals
 
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
IOSR Journals
 
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...A Convergence Theorem Associated With a Pair of Second Order Differential Equ...
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...
IOSR Journals
 
Revisiting A Panicked Securitization Market
Revisiting A Panicked Securitization MarketRevisiting A Panicked Securitization Market
Revisiting A Panicked Securitization Market
IOSR Journals
 
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...
IOSR Journals
 
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...
IOSR Journals
 
Exploring Values and Value Streams by BPM method solved by Lean Management tool
Exploring Values and Value Streams by BPM method solved by Lean Management toolExploring Values and Value Streams by BPM method solved by Lean Management tool
Exploring Values and Value Streams by BPM method solved by Lean Management tool
IOSR Journals
 
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...
IOSR Journals
 
Complex Project Crashing Algorithm
Complex Project Crashing AlgorithmComplex Project Crashing Algorithm
Complex Project Crashing Algorithm
IOSR Journals
 
A Utility Interactive Electricity Generation Schemes with Renewable Resources
A Utility Interactive Electricity Generation Schemes with Renewable ResourcesA Utility Interactive Electricity Generation Schemes with Renewable Resources
A Utility Interactive Electricity Generation Schemes with Renewable Resources
IOSR Journals
 
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water Supply
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water SupplyNon Revenue Water Reduction- A Tool for Achiving 24x7 Water Supply
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water Supply
IOSR Journals
 
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...
IOSR Journals
 
Uniform particle distribution by a newer method in composite metal of Al/SiC
Uniform particle distribution by a newer method in composite metal of Al/SiCUniform particle distribution by a newer method in composite metal of Al/SiC
Uniform particle distribution by a newer method in composite metal of Al/SiC
IOSR Journals
 
A012610106
A012610106A012610106
A012610106
IOSR Journals
 
Rehabilitation Process and Persons with Physical Dysfunctions
Rehabilitation Process and Persons with Physical DysfunctionsRehabilitation Process and Persons with Physical Dysfunctions
Rehabilitation Process and Persons with Physical Dysfunctions
IOSR Journals
 
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...
IOSR Journals
 
Object Elimination and Reconstruction Using an Effective Inpainting Method
Object Elimination and Reconstruction Using an Effective Inpainting MethodObject Elimination and Reconstruction Using an Effective Inpainting Method
Object Elimination and Reconstruction Using an Effective Inpainting Method
IOSR Journals
 
Has Hiroshima Bombing Continued
Has Hiroshima Bombing ContinuedHas Hiroshima Bombing Continued
Has Hiroshima Bombing Continued
IOSR Journals
 
A017410108
A017410108A017410108
A017410108
IOSR Journals
 
The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9
IOSR Journals
 

Viewers also liked (20)

Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...
Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...
Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...
 
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
 
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...A Convergence Theorem Associated With a Pair of Second Order Differential Equ...
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...
 
Revisiting A Panicked Securitization Market
Revisiting A Panicked Securitization MarketRevisiting A Panicked Securitization Market
Revisiting A Panicked Securitization Market
 
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...
 
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...
 
Exploring Values and Value Streams by BPM method solved by Lean Management tool
Exploring Values and Value Streams by BPM method solved by Lean Management toolExploring Values and Value Streams by BPM method solved by Lean Management tool
Exploring Values and Value Streams by BPM method solved by Lean Management tool
 
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...
 
Complex Project Crashing Algorithm
Complex Project Crashing AlgorithmComplex Project Crashing Algorithm
Complex Project Crashing Algorithm
 
A Utility Interactive Electricity Generation Schemes with Renewable Resources
A Utility Interactive Electricity Generation Schemes with Renewable ResourcesA Utility Interactive Electricity Generation Schemes with Renewable Resources
A Utility Interactive Electricity Generation Schemes with Renewable Resources
 
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water Supply
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water SupplyNon Revenue Water Reduction- A Tool for Achiving 24x7 Water Supply
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water Supply
 
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...
 
Uniform particle distribution by a newer method in composite metal of Al/SiC
Uniform particle distribution by a newer method in composite metal of Al/SiCUniform particle distribution by a newer method in composite metal of Al/SiC
Uniform particle distribution by a newer method in composite metal of Al/SiC
 
A012610106
A012610106A012610106
A012610106
 
Rehabilitation Process and Persons with Physical Dysfunctions
Rehabilitation Process and Persons with Physical DysfunctionsRehabilitation Process and Persons with Physical Dysfunctions
Rehabilitation Process and Persons with Physical Dysfunctions
 
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...
 
Object Elimination and Reconstruction Using an Effective Inpainting Method
Object Elimination and Reconstruction Using an Effective Inpainting MethodObject Elimination and Reconstruction Using an Effective Inpainting Method
Object Elimination and Reconstruction Using an Effective Inpainting Method
 
Has Hiroshima Bombing Continued
Has Hiroshima Bombing ContinuedHas Hiroshima Bombing Continued
Has Hiroshima Bombing Continued
 
A017410108
A017410108A017410108
A017410108
 
The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9The Real Time Drowisness Detection Using Arm 9
The Real Time Drowisness Detection Using Arm 9
 

Similar to Text Independent Speaker Identification Using Imfcc Integrated With Ica

SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITION
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITIONSYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITION
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITION
cscpconf
 
D04812125
D04812125D04812125
D04812125
IOSR-JEN
 
A Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak TransformA Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak Transform
CSCJournals
 
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...Designing an Efficient Multimodal Biometric System using Palmprint and Speech...
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...
IDES Editor
 
F43063841
F43063841F43063841
F43063841
IJERA Editor
 
Single Channel Speech De-noising Using Kernel Independent Component Analysis...
	Single Channel Speech De-noising Using Kernel Independent Component Analysis...	Single Channel Speech De-noising Using Kernel Independent Component Analysis...
Single Channel Speech De-noising Using Kernel Independent Component Analysis...
theijes
 
D111823
D111823D111823
D010512126
D010512126D010512126
D010512126
IOSR Journals
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
CSCJournals
 
IRJET- A Review of Music Analysis Techniques
IRJET- A Review of Music Analysis TechniquesIRJET- A Review of Music Analysis Techniques
IRJET- A Review of Music Analysis Techniques
IRJET Journal
 
Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system
IJECEIAES
 
Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...
IJECEIAES
 
Reduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar ApplicationReduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar Application
IOSR Journals
 
Reduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar ApplicationReduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar Application
IOSR Journals
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
eSAT Journals
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
CSCJournals
 
Ijecet 06 09_010
Ijecet 06 09_010Ijecet 06 09_010
Ijecet 06 09_010
IAEME Publication
 
Speaker and Speech Recognition for Secured Smart Home Applications
Speaker and Speech Recognition for Secured Smart Home ApplicationsSpeaker and Speech Recognition for Secured Smart Home Applications
Speaker and Speech Recognition for Secured Smart Home Applications
Roger Gomes
 
Path Loss Prediction by Robust Regression Methods
Path Loss Prediction by Robust Regression MethodsPath Loss Prediction by Robust Regression Methods
Path Loss Prediction by Robust Regression Methods
ijceronline
 

Similar to Text Independent Speaker Identification Using Imfcc Integrated With Ica (20)

SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITION
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITIONSYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITION
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITION
 
D04812125
D04812125D04812125
D04812125
 
A Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak TransformA Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak Transform
 
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...Designing an Efficient Multimodal Biometric System using Palmprint and Speech...
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...
 
F43063841
F43063841F43063841
F43063841
 
Single Channel Speech De-noising Using Kernel Independent Component Analysis...
	Single Channel Speech De-noising Using Kernel Independent Component Analysis...	Single Channel Speech De-noising Using Kernel Independent Component Analysis...
Single Channel Speech De-noising Using Kernel Independent Component Analysis...
 
D111823
D111823D111823
D111823
 
D010512126
D010512126D010512126
D010512126
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
IRJET- A Review of Music Analysis Techniques
IRJET- A Review of Music Analysis TechniquesIRJET- A Review of Music Analysis Techniques
IRJET- A Review of Music Analysis Techniques
 
Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system
 
Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...Intelligent Arabic letters speech recognition system based on mel frequency c...
Intelligent Arabic letters speech recognition system based on mel frequency c...
 
Reduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar ApplicationReduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar Application
 
Reduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar ApplicationReduction of Side Lobes by Using Complementary Codes for Radar Application
Reduction of Side Lobes by Using Complementary Codes for Radar Application
 
E0622730
E0622730E0622730
E0622730
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
 
Ijecet 06 09_010
Ijecet 06 09_010Ijecet 06 09_010
Ijecet 06 09_010
 
Speaker and Speech Recognition for Secured Smart Home Applications
Speaker and Speech Recognition for Secured Smart Home ApplicationsSpeaker and Speech Recognition for Secured Smart Home Applications
Speaker and Speech Recognition for Secured Smart Home Applications
 
Path Loss Prediction by Robust Regression Methods
Path Loss Prediction by Robust Regression MethodsPath Loss Prediction by Robust Regression Methods
Path Loss Prediction by Robust Regression Methods
 

More from IOSR Journals

A011140104
A011140104A011140104
A011140104
IOSR Journals
 
M0111397100
M0111397100M0111397100
M0111397100
IOSR Journals
 
L011138596
L011138596L011138596
L011138596
IOSR Journals
 
K011138084
K011138084K011138084
K011138084
IOSR Journals
 
J011137479
J011137479J011137479
J011137479
IOSR Journals
 
I011136673
I011136673I011136673
I011136673
IOSR Journals
 
G011134454
G011134454G011134454
G011134454
IOSR Journals
 
H011135565
H011135565H011135565
H011135565
IOSR Journals
 
F011134043
F011134043F011134043
F011134043
IOSR Journals
 
E011133639
E011133639E011133639
E011133639
IOSR Journals
 
D011132635
D011132635D011132635
D011132635
IOSR Journals
 
C011131925
C011131925C011131925
C011131925
IOSR Journals
 
B011130918
B011130918B011130918
B011130918
IOSR Journals
 
A011130108
A011130108A011130108
A011130108
IOSR Journals
 
I011125160
I011125160I011125160
I011125160
IOSR Journals
 
H011124050
H011124050H011124050
H011124050
IOSR Journals
 
G011123539
G011123539G011123539
G011123539
IOSR Journals
 
F011123134
F011123134F011123134
F011123134
IOSR Journals
 
E011122530
E011122530E011122530
E011122530
IOSR Journals
 
D011121524
D011121524D011121524
D011121524
IOSR Journals
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 

Recently uploaded (20)

block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 

Text Independent Speaker Identification Using Imfcc Integrated With Ica

  • 1. IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-ISSN: 2278-2834,p- ISSN: 2278-8735. Volume 7, Issue 5 (Sep. - Oct. 2013), PP 22-27 www.iosrjournals.org www.iosrjournals.org 22 | Page Text Independent Speaker Identification Using Imfcc Integrated With Ica P.Suryakumari1 , N M Ramaligeswararao2 , K.Jyothi3 , Dr.V.Sailaja4 1,2,3,4, (ECE, Godavari Institute of Engineering & Technology/ Jntuk, India) Abstract: Over the years, more research work has been reported in literature regarding text independent speaker identification using MFC coefficients. MFCC is one of the best methods modeled on human auditory system. Murali et al (2011) [1] has developed a Text independent speaker identification using MFC coefficients which follows Generalized Gaussian mixer model. MFCC, because of its filter bank structure it captures the characteristics of information more effectively in lower frequency region than higher region, because of this, valuable information in high frequency region may be lost. In this paper we rectify the above problem by retrieving the information in high frequency region by inverting the Mel bank structure. The dimensionality and dependency of above features were reduced by integrating with ICA. Here Text Independent Speaker Identification system is developed by using Generalized Gaussian Mixer Model .By the experimentation, it was observed that this model outperforms the earlier existing models. Keywords: Independent Component Analysis; Generalized Gaussian Mixer Model; Inverted Mel frequency cepstral coefficients; Bayesian classifier; EM algorithm. I. Introduction: From the past knowledge it is evident that physiological structure of a vocal tract is different for different persons, each one having different features. Based on these features, we can differentiate one persons voice from others(Ben gold (2002)[2]). The Mel frequency cepstral coefficients features was first proposed for speech recognition by S.B.Davis and P.Mermelstein . Over the decades though it was most widely used, there was a drawback in its filter bank structure, which is able to capture the information in low frequency regions than higher frequency regions.As the information contained in the higher frequency regions may be missed, proportionally the average efficiency of identification of speaker may be reduced. To overcome the this problem Mr.S.Chakraborty (2007) [16] proposed a flipped MFCC filter bank using Inverted Mel scale by considering the High frequency region coefficients. Here the main objective is to capture the information which has been missed by the MFCC by using a new filter bank structure when it is obtained by flipping or inverting the MFCC filters bank structure (s.chakroborty(2007))[16]. In this feature extraction only few inverted Mel cepstral coefficients are considered and the remaining coefficients are dropped as insignificant due to high dimensionality problems. Ignoring the dependences and dropping some of the Mel cepstral coefficients may lead to falsification of model. It is needed to reduce the dimensionality and avoid dependence among others to having a robust model for speakers. This can be achieved by integrating the Inverted Mel frequency cepstral coefficients of each speaker’s speech spectra with Independent component analysis (ICA).The integration of ICA with Inverted Mel frequency cepstral coefficients will make the feature vector more robust. The main aim of the ICA is to find out linear representation of non-Gaussian data so that the components are statistically independent. The required assumption in applying ICA is that the observed signals are linearly mixed. It helps in capturing some of the essential features of data in many applications including Automatic Speaker Recognition Systems for high dimensional multivariate analysis (Hyvarinen(2001))[3]. ICA is a powerful tool to separate mixed signals mainly find in speech analysis results in computational and conceptual simplicity. Hence in this paper a text independent speaker identification method is developed and analyzed by integrating independent component analysis with Inverted Mel frequency cepstarl coefficient and using Generalized Gaussian Mixer model. The procedure for extracting the feature vectors of the speaker speech spectra using inverted Mel frequency cepstral coefficients and independent component analysis is covered in Section II. Assuming that each individual speech spectra in feature vector follows a Generalized Gaussian Mixer Model and the estimation of the model parameters is obtained by using EM algorithm. The speaker identification algorithm under Bayesian frame work is presented in the section IV.
  • 2. Text Independent Speaker Identification Using Imfcc Integrated With Ica www.iosrjournals.org 23 | Page II. Feature Vectors Extraction In this section we describe the feature vector extraction of each individual speech spectra. Since a long time MFCC is considered as a reliable front end for speaker identification because it has coefficients that related to psychophysical studies human perception of the frequency content of sounds in a nonlinear scale called Mel scale (Ding(2001))[4]. This can be mathematically expressed as fmel=2595 log[1+(f/700)] (1) Where fmel is the subjective pitch in Mel’s corresponding to f in Hz. But Mel cepstral coefficients are obtained by capturing lower frequency regions and neglecting higher regions. This leads to loss of more information from the recorded data base. By considering high frequency regions by using flipping or inverting Me3l bank structure (S.chakroborty (2007)) [5], we retrieve more information. The flip of the MFCC Mel bank is expressed as 𝜳𝒊(k) = ΨQ+1-i (Ms/2 +1-k) (2) 𝛹𝑖(k) is the inverted Mel scale filter response while Ψi(k) is MFCC filter bank response and Q is the number of filters in the bank. Here inverted Mel-scale is defined as follows 𝒇mel (f)=2195.2860-2595 log10(1+(4031.25-f/700)) (3) 𝑓mel (f) is subjective pitch in the new scale corresponding to f, actual frequency in Hz. The block diagram of Inverted Mel frequency cepstral coefficients is shown in below figure 1.From the block diagram of IMFCC the frequency bands are positioned logarithmically (on Inverted Mel scale) by inverted Mel filter bank structure, which approximated the human auditory systems response (Ruche chaudhary (2012) [6]). In this IMFCC captures high frequency bands of the Inverted Mel scale, getting more efficiency than MFCC. These vector coefficients follow Independent component analysis. ICA is linear and not necessarily orthogonal. It extracts independent components even with smaller magnitudes. Analysis of Text independent speaker identification model is a challenging task because this system comprises highly complex model with huge number of feature vectors. Under this circumstance dimension reduction of data plays a major role for obtaining better identification results. This can be achieved by ICA algorithm. Fig 1 ICA aims at extracting a set of statistically independent vectors from the matrix of training data the Inverted Mel cepstral feature vectors derived from the original voice sample. It leads to find directions of minimal mutual information by capturing certain correlations among the frequencies present in the spectral based representation of the speech signal. All these aims are achieved by ICA in the form of linear combination of basic filter functions specific to each person. The signal X is used as proper Mel-cepstral based representation of original signal and the data can be observed as a set of multivariate time series resulting from a linear mixing process A of independent functions S ( Hyvarinen, (2001))[7][8]. Linear combination of such sources or functions can be summarized as (Cardoso, (1996)) [9] Ax=S (4) There is one problem in ICA to determine both the excitation signal S and the scalars A and the only one known component is the matrix of IMFCC coefficients of the input speech signal. S can be computed as follows (hyvarinen, (1997)) [10] S=Ax-1 (5) A can be computed by considering X as a vector of observations, where each observation is expressed as a linear combination of independent component s. In order to estimate one of the independent components. A linear combination of xi is chosen such that (Hyvarinen, (1997)) [10], (Michael, 2002[11])
  • 3. Text Independent Speaker Identification Using Imfcc Integrated With Ica www.iosrjournals.org 24 | Page Y=WT X (6) With respect to these conditions stated in equation (4) and equation (5), the linear combination represented in equation (6) is a true combination of independent components if w was one of the columns of the inverse of A. After pre processing and whitening, the final equation is given as (Michael,(1999))[12] S = Y=WT X = 𝑾 px (7) Fast ICA algorithm is used to estimate Wi which constitutes the rows of W. since the components are considered to be statistically independent, the variance between them is high. The following steps are used to estimate W 1. Choose an initial random guess for W 2. Iterate: W<= E{xg(WT X)}- E{g’(WT X)}W 3. Normalize: W<= 𝑊 ||𝑊|| 4. If not converged , go back to step 2 Trained Speaker Model Fig 2 Once W is estimated the final step is to project the signal in to the space created by ICA New dataset=Wica* mean adjusted original data Where , Wica is the transformation matrix obtained from Fast ICA algorithm. The extraction of Feature vectors for speaker identification is done in two steps: 1. Compute IMFCC’s and 2. Apply ICA to transform them to get new feature vectors The computation steps for extracting the new feature vectors as follows Step1: i. Take the fourier transform of a signal ii. Map the powers of the spectrum obtained above on to the inverted mel scale(IMFCC), using hamming window iii. Take the logs of the powers at each of the Inverted Mel frequencies iv. Take the discrete cosine transform of the list of the inverted Mel log powers, as it were a signal
  • 4. Text Independent Speaker Identification Using Imfcc Integrated With Ica www.iosrjournals.org 25 | Page v. The IMFCCs are the amplitudes of the resulting spectrum Step2: vi. Apply ICA transform to Mel frequency cepstral coefficients to get the new feature vectors of each speaker speech spectra III. Speaker Identification Model With Generalized Gaussian Distribution: In this section we briefly describe the speaker identification process. Figure 1 depicts the Text Independent Speaker Identification model. In this dotted lines represents training phase and bold line represents testing phase. The Literature shows that probabilistic models like GGMM for have yielded better performance results for training both text-dependent and text-independent speaker recognition applications. Due to the probabilistic property of a GGMM, it can also be applied to speaker identification applications in the presence of different noises increasing the channel robustness and therefore more suited to this research. Here it is assumed that the feature vectors (after preprocessing the IMFCC with ICA) follow a multivariate Generalized Gaussian Mixer model. It is reasonable to assume the acoustic space corresponding to a speaker voice can be characterized by acoustic classes representing the some broad phonetic events such as vowels, nasals. These acoustic classes reflect some general speaker dependent vocal tract configurations that are useful for charactering speaker identity. The shape of each spectra in turn is represented by the mean of its component density and variation of the average spectral shape represented by co-variance matrix. Hence the entire speech spectra of the each individual speaker can be characterized as an M component Finite Multivariate Generalized Gaussian Mixer model. The Probability density function of the each individual speaker speech spectra is represented as follows: 𝒑(𝒙 𝒕|λ) = 𝜶𝒊 𝒃𝒊 𝑴 𝒊=𝟏 (𝒙 𝒕|λ) (8) Where, 𝑥𝑡 = (𝑥𝑡𝑖𝑗 ) j=1,2,….D; i=1,2,….M; t=1,2….T is a D dimensional random vector representing IMFCC vector. λ is the parametric set such λ= (µ, 𝝆, ∑) 𝛼𝑖 is the component weight such that 𝛂𝐢 = 𝟏𝐌 𝐢=𝟏 𝑏𝑖(𝑥𝑡 | λ) is the probability density function of ith acoustic class represented by the new vectors of the speech data the D-dimensional Generalized Gaussian (GG) distribution (M.bicego et al (2008)(13)). It is in the form of the following 𝒃𝒊(𝒙 𝒕| (µ, 𝝆, ∑)) = [𝐝𝐞𝐭⁡( )]−𝟏 𝟐 [𝒛 𝝆 𝑨 𝝆,𝝈 ] 𝑫exp − −𝟏 𝟐(𝒙 𝒕−𝝁 𝒊 𝑨(𝝆,𝝈) 𝝆 (9) Where z (𝝆) = 𝟐 𝝆 𝚪 𝟏 𝝆 and 𝑨 𝝆, 𝝈 = 𝚪 𝟏 𝝆 𝚪 𝟑 𝝆 The parameter 𝜇𝑖 is the mean vector and the summation of 𝐴(𝜌) is a scaling parameter which allows a variance var (𝜎2 ) and 𝜌 is the shape of the parameter when ρ=1, the Generalized Gaussian corresponds to laplacian or double exponential Distribution. When ρ=2, the Generalized Gaussian corresponds to a Gaussian distribution. In limiting case ρ→ + ∞. The model can have one covariance matrix per a Generalized Gaussian density of the acoustic class of each speaker. The diagonal convergence matrix is used for speaker model, based on the previous studies. This results diagonal covariance matrix for the feature vector, the features are independent and the probability density function of the feature vector is 𝒃𝒊(𝒙 𝒕|λ) = 𝐞𝐱𝐩⁡− 𝒙 𝒊𝒋−𝝁 𝒊𝒋 𝑨(𝝆 𝒊𝒋−𝝈 𝒊𝒋 𝝆 𝒊𝒋 𝟐 𝝆 𝒊𝒋 𝚪 𝟏+ 𝟏 𝝆 𝒊𝒋 𝑨(𝝆 𝒊𝒋,𝝈 𝒊𝒋) 𝑫 𝒋=𝟏 = 𝒇𝒊𝒋 𝑫 𝒋=𝟏 (𝒙 𝒕𝒊𝒋) (10) To find the estimate of model parameters αi μij and ij for i= 1,2,3 …,M, j=1,2,…,D, we maximize the expected value b y using log likelihood function.the parameters are given by Armando.j el at (2003)[14] for each speech spectra The following are the updated equations of the parameters of EM algorithm as given by sailaja et al (2010)[15] 𝜶𝒊 (𝒍+𝟏) = 𝟏 𝑻 𝜶𝒊 𝒍 𝒃 𝒊(𝒙 𝒕,𝝀 𝒍 ) 𝜶𝒊 𝒍 𝒃 𝒊(𝒙 𝒕,𝝀 𝒍 )𝑴 𝒊=𝟏 𝑻 𝒕=𝟏 (11) Where 𝜆 𝑙 = (𝜇𝑖𝑗 (𝑙) ,𝛼𝑖𝑗 (𝑙) ) are the estimates obtained.
  • 5. Text Independent Speaker Identification Using Imfcc Integrated With Ica www.iosrjournals.org 26 | Page The updated equation for estimating 𝜇𝑖𝑗 is 𝝁𝒊𝒋 𝒍+𝟏 = 𝒕 𝟏 𝒙 𝒕 𝝀 𝒍 𝑨 𝑵,𝝆 𝟏 (𝒙 𝒕𝒊𝒋−𝝁 𝒕𝒊𝒋) 𝑻 𝒕=𝟏 𝒕 𝟏 𝑻 𝒕=𝟏 𝒙 𝒕,𝝀 𝒍 𝑨 𝑵,𝝆 𝒊𝒋 (12) where 𝐴(𝑁, 𝜌𝑖𝑗 ) is a function which must be equal to unity for 𝜌𝑖 = 2and must be equal to 1 𝜌 𝑖𝑗 −1 for 𝜌𝑖 ≠ 1 in the case of N=2, we have also observed that 𝐴(𝑁, 𝜌𝑖𝑗 ) must be an increasing function of𝜌𝑖𝑗 . The updated equation for estimating 𝛼𝑖𝑗 is 𝝈𝒊𝒋 (𝒍+𝟏) = 𝒕 𝟏(𝒙 𝒕 𝝀(𝒍)) 𝚪 𝟑 𝝆 𝒊𝒋 𝝆 𝒕 𝚪 𝟏 𝝆 𝒕 𝒙 𝒕𝒊𝒋−𝝁 𝒊𝒋 (𝒍) 𝟏 𝝆 𝒊𝒋𝑵 𝒊=𝟏 𝒕 𝟏(𝑻 𝒕=𝟏 𝒙 𝒕 𝝀(𝒍)) 𝟏 𝝆 𝒊𝒋 (13) ` IV. Speaker Identification Using Bayes’ Decision Rule: From each test speaker, we obtained the extracted feature vectors were applied to the function “ICA TRANSFORM” and were estimated into the space of ICA created by the associated speaker with unique speaker ID. The new feature vectors of the test utterances and the trained models were fed to a Byes classifier for identification applications which employ large group of data sets and the corresponding test speaker was identified. (Domingo’s, (1997))[16][17]. 𝑝 𝑖 𝑥𝑡 , 𝜆 is the posteriori probability for an acoustic class i and is defined by the following equation (Reynolds,(1995))[6]. 𝐩 𝐢 𝐱 𝐭 , 𝛌 = 𝐩𝐢 𝐛𝐢 𝐱 𝐭 ^− 𝐩 𝐤 𝐛 𝐤 𝐱 𝐭 ^−𝐌 𝐤=𝟏 (14) 𝒔 = 𝐦𝐚𝐱 𝟏<𝒌<𝒔 𝒑𝒍 𝝀 𝒌 𝑿 = 𝐚𝐫𝐠 𝐦𝐚𝐱 𝟏<𝒌<𝒔[𝒑𝒍 𝝀 𝒌 𝑿 𝒑 𝒓(𝝀 𝒌)] (15) Finally we computes S using the logarithms and independence between the observations. V. Expermental Results: The developed model, IMFCC with integration of ICA with GGMM performance is evaluated. An experiment was conducted on 30 speakers, for each speaker 10 utterances with each of duration 6sec are recorded by high quality microphone. Out of this voice corpus, first, fifth and seventh sessions are used for testing and remaining sessions are used for training. By using front end process explained in section II, IMFCC feature vectors were obtained which were again refined by using ICA. The global model for each speaker density is estimated by using the derived parameters. With the test data set, the efficiency of the developed model is studied by identifying the speaker with the speaker identification algorithm given in section IV. The average percentage of identification was computed and the results are tabulated. From the table 1, it is observed that the average percentage of correct identification of speaker for the developed model is 98.3±10. The percentage correctness for Generalized Gaussian Mixer Model with Integrated ICA using MFCC as a feature extraction is 97.8 ±12. This clearly shows that the speaker Identification model with IMFCC in a front end processing is having higher average percentage of correct identification than the other models. Table1.Average percentage of correct Identification using MFCC and IMFCC feature vectors Model MFCC IMFCC Embedded ICA with GGMM 97.8±12 98.3±10 V. Conclusion: In this paper we proposed and developed a Text Independent Speaker Identification with IMFC coefficients as a Feature vectors. Experimental Results shows that this method gives tremendous Improvement and it can detect the correct speaker among 30 speakers with 10 utterances. Therefore this developed model will be applicable to real time applications.
  • 6. Text Independent Speaker Identification Using Imfcc Integrated With Ica www.iosrjournals.org 27 | Page References: [1]. N M ramaligeswararao, Dr.V.Sailaja & Dr.k.Srinivasa Rao(2011) “Text Independent Speaker Identification using Integrated Independent Component Analysis with Generalized Gaussian Mixture Model”International Journal of Advanced Computer Science and Applications vol.2, No12, 2011pp.85-91 [2]. Ben Gold and Nelson Morgan (2002), “speech and audio processing “part 4 chapter 14 pp.189-203 John wills and sons [3]. Hyvarinen,(1999), “Fast and Robust Fixed Point Algorithm for ICA” proceedings of the IEEE transactions on neural networks,Volume.10,no.3,pp.626-634. [4]. Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information processing”. [5]. S.chakroborty. a. roy and g. saha , “ improved closed ser text independent speaker identification by combining MFCC with evidence from flipped filter banks ” International journal of signal processing vol.4 , no.2 ,pp.114-121,2007.ISSN 1304-4478 [6]. Ruche chaudhary, National technical research organization ,”performance study of Text Independent speaker Identification system using MFCC and IMFCC for telephone and microphone speeches” International Journal of computing and Business Research(IJCBR),vol. 3 issue 2 My 2012 [7]. Hyvarinen, (2001), “Independent component analysis, john Wiley and sons. [8]. Hyvarinen,(1999), “Fast and Robust Fixed Point Algorithm for ICA” proceedings of the IEEE transactions on neural networks,Volume.10,no.3,pp.626-634. [9]. Cardoso,(1996) “Equivarant adaptive source separation IEEE Transaction on signal processing”vol.44.no.12.pp-3017-3030 [10]. Hyvarinen.(1997) “A family of fixed point algorithms for independent component analysis” proceedings of the IEEE international conference on acoustics speech and signal processing (ICASSP),VOLUME5,pp.3917-3920. [11]. Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information processing”. [12]. Ning Wang , P. C. Ching(2011), “Nengheng Zheng, and Tan Lee, “Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features speaker verification,” IEEE Transaction on Audio Speech and Language processing, Vol. 1, No. 2, pp. 25-35. [13]. Md M. Bicego, D Gonzalez, E Grosso and Alba Castro(2008) “Generalized Gaussian distribution for sequential Data Classification” IEE Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information processing”. [14]. Armando. J et al (2003), “A practical procedure to estimate the shape Parameters in the Generalized Gaussian distribution [15]. V Sailaja, K Srinivasa Rao & K V V S Reddy(2010), “Text Independent Speaker Identification Model with Finite Multivariate Generalized Gaussian Mixture Model and Hierarchical Clustering Algorithm ” International Journal of Computer applications Vol. 11, No. 11, , pp. 25-31. [16]. Domingo(1997), “Speaker Recognition”, A Tutorial", Proceedings of the IEEE, Vol. 85, no. 9 [17]. Domingo’s, (1997), “On the optimality of the simple Bayesian classifier under zero-one loss,” Machine Learning, vol. 29, pp. 103-130, 1997