Abstract: Over the years, more research work has been reported in literature regarding text independent
speaker identification using MFC coefficients. MFCC is one of the best methods modeled on human auditory
system. Murali et al (2011) [1] has developed a Text independent speaker identification using MFC coefficients
which follows Generalized Gaussian mixer model. MFCC, because of its filter bank structure it captures the
characteristics of information more effectively in lower frequency region than higher region, because of this,
valuable information in high frequency region may be lost. In this paper we rectify the above problem by
retrieving the information in high frequency region by inverting the Mel bank structure. The dimensionality and
dependency of above features were reduced by integrating with ICA. Here Text Independent Speaker
Identification system is developed by using Generalized Gaussian Mixer Model .By the experimentation, it was
observed that this model outperforms the earlier existing models.
Keywords: Independent Component Analysis; Generalized Gaussian Mixer Model; Inverted Mel frequency
cepstral coefficients; Bayesian classifier; EM algorithm.
Multilinear Kernel Mapping for Feature Dimension Reduction in Content Based M...ijma
In the process of content-based multimedia retrieval, multimedia information is processed in order to
obtain descriptive features. Descriptive representation of features, results in a huge feature count, which
results in processing overhead. To reduce this descriptive feature overhead, various approaches have been
used to dimensional reduction, among which PCA and LDA are the most used methods. However, these
methods do not reflect the significance of feature content in terms of inter-relation among all dataset
features. To achieve a dimension reduction based on histogram transformation, features with low
significance can be eliminated. In this paper, we propose a feature dimensional reduction approaches to
achieve the dimension reduction approach based on a multi-linear kernel (MLK) modeling. A benchmark
dataset for the experimental work is taken and the proposed work is observed to be improved in analysis in
comparison to the conventional system.
A New Approach for Segmentation of Fused Images using Cluster based ThresholdingIDES Editor
This paper proposes the new segmentation technique
with cluster based method. In this, the multi source medical
images like MRI (Magnetic Resonance Imaging), CT
(computed tomography) & PET (positron emission
tomography) are fused and then segmented using cluster based
thresholding approach. The edge details of an image have
become an essential technique in clinical and researchoriented
applications. The more edge details of the fused image
have obtainable with this method. The objective of the
clustering process is to partition a fused image coefficients
into a number of clusters having similar features. These
features are useful to generate the threshold value for further
segmentation of fused image. Finally the segmented output
is compared with standard FCM method and modified Otsu
method. Experimental results have shown that the proposed
cluster based thresholding method is able to effectively extract
important edge details of fused image.
FACE RECOGNITION USING DIFFERENT LOCAL FEATURES WITH DIFFERENT DISTANCE TECHN...IJCSEIT Journal
A face recognition system using different local features with different distance measures is proposed in this
paper. Proposed method is fast and gives accurate detection. Feature vector is based on Eigen values,
Eigen vectors, and diagonal vectors of sub images. Images are partitioned into sub images to detect local
features. Sub partitions are rearranged into vertically and horizontally matrices. Eigen values, Eigenvector
and diagonal vectors are computed for these matrices. Global feature vector is generated for face
recognition. Experiments are performed on benchmark face YALE database. Results indicate that the
proposed method gives better recognition performance in terms of average recognized rate and retrieval
time compared to the existing methods.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITIONcsandit
Emotional state recognition through speech is being a very interesting research topic nowadays.
Using subliminal information of speech, it is possible to recognize the emotional state of the
person. One of the main problems in the design of automatic emotion recognition systems is the
small number of available patterns. This fact makes the learning process more difficult, due to
the generalization problems that arise under these conditions.
In this work we propose a solution to this problem consisting in enlarging the training set
through the creation the new virtual patterns. In the case of emotional speech, most of the
emotional information is included in speed and pitch variations. So, a change in the average
pitch that does not modify neither the speed nor the pitch variations does not affect the
expressed emotion. Thus, we use this prior information in order to create new patterns applying
a pitch shift modification in the feature extraction process of the classification system. For this
purpose, we propose a frequency scaling modification of the Mel Frequency Cepstral
Coefficients, used to classify the emotion. This proposed process allows us to synthetically
increase the number of available patterns in thetraining set, thus increasing the generalization
capability of the system and reducing the test error.
Spectral Density Oriented Feature Coding For Pattern Recognition ApplicationIJERDJOURNAL
ABSTRACT:- The significant of multi spectral band resolution is explored towards selection of feature coefficients based on its energy density. Toward the feature representiaon in transformed domain, multi wavelet transformations were used for finer spectral representation. However, due to a large feature count these features are not optimal under low resource computing system. In the recognition units, running with low resources a new coding approach of feature selection, considering the band spectral density is developed. The effective selection of feature element, based on its spectral density achieve two objective of pattern recognition, the feature coefficient representiaon is minimized, hence leading to lower resource requirement, and dominant feature representation, resulting in higher retrieval performance.
Multilinear Kernel Mapping for Feature Dimension Reduction in Content Based M...ijma
In the process of content-based multimedia retrieval, multimedia information is processed in order to
obtain descriptive features. Descriptive representation of features, results in a huge feature count, which
results in processing overhead. To reduce this descriptive feature overhead, various approaches have been
used to dimensional reduction, among which PCA and LDA are the most used methods. However, these
methods do not reflect the significance of feature content in terms of inter-relation among all dataset
features. To achieve a dimension reduction based on histogram transformation, features with low
significance can be eliminated. In this paper, we propose a feature dimensional reduction approaches to
achieve the dimension reduction approach based on a multi-linear kernel (MLK) modeling. A benchmark
dataset for the experimental work is taken and the proposed work is observed to be improved in analysis in
comparison to the conventional system.
A New Approach for Segmentation of Fused Images using Cluster based ThresholdingIDES Editor
This paper proposes the new segmentation technique
with cluster based method. In this, the multi source medical
images like MRI (Magnetic Resonance Imaging), CT
(computed tomography) & PET (positron emission
tomography) are fused and then segmented using cluster based
thresholding approach. The edge details of an image have
become an essential technique in clinical and researchoriented
applications. The more edge details of the fused image
have obtainable with this method. The objective of the
clustering process is to partition a fused image coefficients
into a number of clusters having similar features. These
features are useful to generate the threshold value for further
segmentation of fused image. Finally the segmented output
is compared with standard FCM method and modified Otsu
method. Experimental results have shown that the proposed
cluster based thresholding method is able to effectively extract
important edge details of fused image.
FACE RECOGNITION USING DIFFERENT LOCAL FEATURES WITH DIFFERENT DISTANCE TECHN...IJCSEIT Journal
A face recognition system using different local features with different distance measures is proposed in this
paper. Proposed method is fast and gives accurate detection. Feature vector is based on Eigen values,
Eigen vectors, and diagonal vectors of sub images. Images are partitioned into sub images to detect local
features. Sub partitions are rearranged into vertically and horizontally matrices. Eigen values, Eigenvector
and diagonal vectors are computed for these matrices. Global feature vector is generated for face
recognition. Experiments are performed on benchmark face YALE database. Results indicate that the
proposed method gives better recognition performance in terms of average recognized rate and retrieval
time compared to the existing methods.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITIONcsandit
Emotional state recognition through speech is being a very interesting research topic nowadays.
Using subliminal information of speech, it is possible to recognize the emotional state of the
person. One of the main problems in the design of automatic emotion recognition systems is the
small number of available patterns. This fact makes the learning process more difficult, due to
the generalization problems that arise under these conditions.
In this work we propose a solution to this problem consisting in enlarging the training set
through the creation the new virtual patterns. In the case of emotional speech, most of the
emotional information is included in speed and pitch variations. So, a change in the average
pitch that does not modify neither the speed nor the pitch variations does not affect the
expressed emotion. Thus, we use this prior information in order to create new patterns applying
a pitch shift modification in the feature extraction process of the classification system. For this
purpose, we propose a frequency scaling modification of the Mel Frequency Cepstral
Coefficients, used to classify the emotion. This proposed process allows us to synthetically
increase the number of available patterns in thetraining set, thus increasing the generalization
capability of the system and reducing the test error.
Spectral Density Oriented Feature Coding For Pattern Recognition ApplicationIJERDJOURNAL
ABSTRACT:- The significant of multi spectral band resolution is explored towards selection of feature coefficients based on its energy density. Toward the feature representiaon in transformed domain, multi wavelet transformations were used for finer spectral representation. However, due to a large feature count these features are not optimal under low resource computing system. In the recognition units, running with low resources a new coding approach of feature selection, considering the band spectral density is developed. The effective selection of feature element, based on its spectral density achieve two objective of pattern recognition, the feature coefficient representiaon is minimized, hence leading to lower resource requirement, and dominant feature representation, resulting in higher retrieval performance.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...IJECEIAES
The objective of low-resolution face recognition is to identify faces from small size or poor quality images with varying pose, illumination, expression, etc. In this work, we propose a robust low face recognition technique based on one-dimensional Hidden Markov Models. Features of each facial image are extracted using three steps: firstly, both Gabor filters and Histogram of Oriented Gradients (HOG) descriptor are calculated. Secondly, the size of these features is reduced using the Linear Discriminant Analysis (LDA) method in order to remove redundant information. Finally, the reduced features are combined using Canonical Correlation Analysis (CCA) method. Unlike existing techniques using HMMs, in which authors consider each state to represent one facial region (eyes, nose, mouth, etc), the proposed system employs 1D-HMMs without any prior knowledge about the localization of interest regions in the facial image. Performance of the proposed method will be measured using the AR database.
Architecture neural network deep optimizing based on self organizing feature ...journalBEEI
Forward neural network (FNN) execution relying on the algorithm of training and architecture selection. Different parameters using for nip out the architecture of FNN such as the connections number among strata, neurons hidden number in each strata hidden and hidden strata number. Feature architectural combinations exponential could be uncontrollable manually so specific architecture can be design automatically by using special algorithm which build system with ability generalization better. Determination of architecture FNN can be done by using the algorithm of optimization numerous. In this paper methodology new proposes achievement where FNN neurons respective with hidden layers estimation work where in this work collect algorithm training self organizing feature map (SOFM) with advantages to explain how the best architectural selected automatically by SOFM from criteria error testing based on architecture populated. Different size of dataset benchmark of 4 classifications tested for approach proposed.
Image fusion using nsct denoising and target extraction for visual surveillanceeSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...IJECEIAES
The major issue of concern in change detection process is the accuracy of the algorithm to recover changed and unchanged pixels. The fusion rules presented in the existing methods could not integrate the features accurately which results in more number of false alarms and speckle noise in the output image. This paper proposes an algorithm which fuses two multi-temporal images through proposed set of fusion rules in stationary wavelet transform. In the first step, the source images obtained from log ratio and mean ratio operators are decomposed into three high frequency sub-bands and one low frequency sub-band by stationary wavelet transform. Then, proposed fusion rules for low and high frequency sub-bands are applied on the coefficient maps to get the fused wavelet coefficients map. The fused image is recovered by applying the inverse stationary wavelet transform (ISWT) on the fused coefficient map. Finally, the changed and unchanged areas are classified using Fuzzy c means clustering. The performance of the algorithm is calculated in terms of percentage correct classification (PCC), overall error (OE) and Kappa coefficient (K ). The qualitative and quantitative results prove that the proposed method offers least error, highest accuracy and Kappa value as compare to its preexistences.
Face recognition using selected topographical features IJECEIAES
This paper represents a new features selection method to improve an existed feature type. Topographical (TGH) features provide large set of features by assigning each image pixel to the related feature depending on image gradient and Hessian matrix. Such type of features was handled by a proposed features selection method. A face recognition feature selector (FRFS) method is presented to inspect TGH features. FRFS depends in its main concept on linear discriminant analysis (LDA) technique, which is used in evaluating features efficiency. FRFS studies feature behavior over a dataset of images to determine the level of its performance. At the end, each feature is assigned to its related level of performance with different levels of performance over the whole image. Depending on a chosen threshold, the highest set of features is selected to be classified by SVM classifier.
F EATURE S ELECTION USING F ISHER ’ S R ATIO T ECHNIQUE FOR A UTOMATIC ...IJCI JOURNAL
Automatic Speech Recognition (ASR) involves mainly
two steps; feature extraction and classification
(pattern recognition). Mel Frequency Cepstral Coeff
icient (MFCC) is used as one of the prominent featu
re
extraction techniques in ASR. Usually, the set of a
ll 12 MFCC coefficients is used as the feature vect
or in
the classification step. But the question is whethe
r the same or improved classification accuracy can
be
achieved by using a subset of 12 MFCC as feature ve
ctor. In this paper, Fisher’s ratio technique is us
ed for
selecting a subset of 12 MFCC coefficients that con
tribute more in discriminating a pattern. The selec
ted
coefficients are used in classification with Hidden
Markov Model (HMM) algorithm. The classification
accuracies that we get by using 12 coefficients and
by using the selected coefficients are compare
One fundamental problem in large scale image retrieval with the bag-of-features is its lack of spatial information, which affects accuracy of image retrieval. Depending on distribution of local features in an image, we propose a novel adaptive Hilbert-scan strategy which computes weight of each path at increasingly fine resolutions. Owing to merits of this strategy, spatial information of object will be preserved more precisely at Hilbert order. Extensive experiments on Caltech-256 show that our method obtains higher accuracy.
Although fuzzy systems demonstrate their ability to
solve different kinds of problems in various applications, there is an increasing interest on developing solid mathematical implementations suitable for control applications such as that used in fuzzy logic controllers (FLC). It is well known that, wide range of parameters is needed to be specified before the construction of a fuzzy system. To simplify in a systematic way the design and construction of a general fuzzy system, and without loss for generality a full parameterization process for a singleton type FLC is proposed in this paper. The resented methodology is very helpful in developing a universal computing algorithm for a standard fuzzy like PID controllers. An illustrative example shows the simplicity of applying the new paradigm.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...CSCJournals
Unethical uses of data hiding methods have made Image Steganalysis a very important area of
research work in the field of Digital Investigations. Effectiveness of any Image Steganalysis
algorithm depends on feature selection and feature reduction. The goal of this paper is to develop
a reduced dimensional merged feature set for universal image steganalysis using Fisher Criterion
and ANOVA techniques. Statistical features extracted from wavelet subbands and binary
similarity patterns extracted from DCT of an image are merged to make combined feature set.
Fisher criterion and ANOVA test are applied to evaluate the combined feature vector score and
then only those features are selected which are found sensitive in both feature selection methods.
These reduced dimensional 15-D feature vector is used to train SVM classifier with RBF kernel.
The proposed algorithm is tested against steganography methods like F5, Outguess and LSB
based method. Stego images are generated using widely available stego tools for two standard
image databases: CorelDraw and BSDS500. Results are further validated using 10 fold cross
validation process. The proposed algorithm achieves overall 97% detection accuracy against
various steganography methods
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
Detection of leaf diseases and classification using digital image processingNaeem Shehzad
In this presentation you can learn how to find leaf disease using k mean algorithm and gray level co-occurrence matrix and support vector machine with complete results.
In this presentation , I mention all the data in very convenient way . I hope you can take it easy.
Thank you
Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...IOSR Journals
Sufficient conditions are obtained for every solution of first order nonlinear neutral delay forced
impulsive differential equations with positive and negative coefficients tends to a constant as t ∞.
Mathematics Subject Classification [MSC 2010]:34A37
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...IOSR Journals
Abstract- This paper deals with residual musical noise which results from the perceptual speech enhancement
type algorithms and especially using wiener filtering approach. Perceptual speech enhancement techniques
perform better than the non perceptual techniques, most of them still return a trouble residual musical noise.
This is due to that only noise above the noise masking threshold (NMT) is filtered out then noise below the noise
masking threshold (NMT) can become audible if its maskers are filtered. It can affect the performance of
perceptual speech enhancement method that process the audible noise only (Residual noise is still present). In
order to overcome this drawback a new speech enhancement technique is proposed here.The main aim here is
to improve the enhanced speech signal quality provided by perceptual wiener filtering and by controlling the
latter via a second filter regarded as a psychoacoustically motivated weighting factor. The simulation results
gives the information that the performance is improved compared to other perceptual speech enhancement
methods.
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...IOSR Journals
We consider the second order matrix differential equation
M 0, 0 x Where M is a second-order matrix differential operator and is a vector having two components. In this
paper we prove a convergence theorem for the vector function 1 2 ( ) ( ) ( ) f x f x f x which is continuous in
0 x and of bounded variation in 0 x , when p(x) and q(x) tend to as x tend to .
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...IJECEIAES
The objective of low-resolution face recognition is to identify faces from small size or poor quality images with varying pose, illumination, expression, etc. In this work, we propose a robust low face recognition technique based on one-dimensional Hidden Markov Models. Features of each facial image are extracted using three steps: firstly, both Gabor filters and Histogram of Oriented Gradients (HOG) descriptor are calculated. Secondly, the size of these features is reduced using the Linear Discriminant Analysis (LDA) method in order to remove redundant information. Finally, the reduced features are combined using Canonical Correlation Analysis (CCA) method. Unlike existing techniques using HMMs, in which authors consider each state to represent one facial region (eyes, nose, mouth, etc), the proposed system employs 1D-HMMs without any prior knowledge about the localization of interest regions in the facial image. Performance of the proposed method will be measured using the AR database.
Architecture neural network deep optimizing based on self organizing feature ...journalBEEI
Forward neural network (FNN) execution relying on the algorithm of training and architecture selection. Different parameters using for nip out the architecture of FNN such as the connections number among strata, neurons hidden number in each strata hidden and hidden strata number. Feature architectural combinations exponential could be uncontrollable manually so specific architecture can be design automatically by using special algorithm which build system with ability generalization better. Determination of architecture FNN can be done by using the algorithm of optimization numerous. In this paper methodology new proposes achievement where FNN neurons respective with hidden layers estimation work where in this work collect algorithm training self organizing feature map (SOFM) with advantages to explain how the best architectural selected automatically by SOFM from criteria error testing based on architecture populated. Different size of dataset benchmark of 4 classifications tested for approach proposed.
Image fusion using nsct denoising and target extraction for visual surveillanceeSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Change Detection from Remotely Sensed Images Based on Stationary Wavelet Tran...IJECEIAES
The major issue of concern in change detection process is the accuracy of the algorithm to recover changed and unchanged pixels. The fusion rules presented in the existing methods could not integrate the features accurately which results in more number of false alarms and speckle noise in the output image. This paper proposes an algorithm which fuses two multi-temporal images through proposed set of fusion rules in stationary wavelet transform. In the first step, the source images obtained from log ratio and mean ratio operators are decomposed into three high frequency sub-bands and one low frequency sub-band by stationary wavelet transform. Then, proposed fusion rules for low and high frequency sub-bands are applied on the coefficient maps to get the fused wavelet coefficients map. The fused image is recovered by applying the inverse stationary wavelet transform (ISWT) on the fused coefficient map. Finally, the changed and unchanged areas are classified using Fuzzy c means clustering. The performance of the algorithm is calculated in terms of percentage correct classification (PCC), overall error (OE) and Kappa coefficient (K ). The qualitative and quantitative results prove that the proposed method offers least error, highest accuracy and Kappa value as compare to its preexistences.
Face recognition using selected topographical features IJECEIAES
This paper represents a new features selection method to improve an existed feature type. Topographical (TGH) features provide large set of features by assigning each image pixel to the related feature depending on image gradient and Hessian matrix. Such type of features was handled by a proposed features selection method. A face recognition feature selector (FRFS) method is presented to inspect TGH features. FRFS depends in its main concept on linear discriminant analysis (LDA) technique, which is used in evaluating features efficiency. FRFS studies feature behavior over a dataset of images to determine the level of its performance. At the end, each feature is assigned to its related level of performance with different levels of performance over the whole image. Depending on a chosen threshold, the highest set of features is selected to be classified by SVM classifier.
F EATURE S ELECTION USING F ISHER ’ S R ATIO T ECHNIQUE FOR A UTOMATIC ...IJCI JOURNAL
Automatic Speech Recognition (ASR) involves mainly
two steps; feature extraction and classification
(pattern recognition). Mel Frequency Cepstral Coeff
icient (MFCC) is used as one of the prominent featu
re
extraction techniques in ASR. Usually, the set of a
ll 12 MFCC coefficients is used as the feature vect
or in
the classification step. But the question is whethe
r the same or improved classification accuracy can
be
achieved by using a subset of 12 MFCC as feature ve
ctor. In this paper, Fisher’s ratio technique is us
ed for
selecting a subset of 12 MFCC coefficients that con
tribute more in discriminating a pattern. The selec
ted
coefficients are used in classification with Hidden
Markov Model (HMM) algorithm. The classification
accuracies that we get by using 12 coefficients and
by using the selected coefficients are compare
One fundamental problem in large scale image retrieval with the bag-of-features is its lack of spatial information, which affects accuracy of image retrieval. Depending on distribution of local features in an image, we propose a novel adaptive Hilbert-scan strategy which computes weight of each path at increasingly fine resolutions. Owing to merits of this strategy, spatial information of object will be preserved more precisely at Hilbert order. Extensive experiments on Caltech-256 show that our method obtains higher accuracy.
Although fuzzy systems demonstrate their ability to
solve different kinds of problems in various applications, there is an increasing interest on developing solid mathematical implementations suitable for control applications such as that used in fuzzy logic controllers (FLC). It is well known that, wide range of parameters is needed to be specified before the construction of a fuzzy system. To simplify in a systematic way the design and construction of a general fuzzy system, and without loss for generality a full parameterization process for a singleton type FLC is proposed in this paper. The resented methodology is very helpful in developing a universal computing algorithm for a standard fuzzy like PID controllers. An illustrative example shows the simplicity of applying the new paradigm.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
ANOVA and Fisher Criterion based Feature Selection for Lower Dimensional Univ...CSCJournals
Unethical uses of data hiding methods have made Image Steganalysis a very important area of
research work in the field of Digital Investigations. Effectiveness of any Image Steganalysis
algorithm depends on feature selection and feature reduction. The goal of this paper is to develop
a reduced dimensional merged feature set for universal image steganalysis using Fisher Criterion
and ANOVA techniques. Statistical features extracted from wavelet subbands and binary
similarity patterns extracted from DCT of an image are merged to make combined feature set.
Fisher criterion and ANOVA test are applied to evaluate the combined feature vector score and
then only those features are selected which are found sensitive in both feature selection methods.
These reduced dimensional 15-D feature vector is used to train SVM classifier with RBF kernel.
The proposed algorithm is tested against steganography methods like F5, Outguess and LSB
based method. Stego images are generated using widely available stego tools for two standard
image databases: CorelDraw and BSDS500. Results are further validated using 10 fold cross
validation process. The proposed algorithm achieves overall 97% detection accuracy against
various steganography methods
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
Detection of leaf diseases and classification using digital image processingNaeem Shehzad
In this presentation you can learn how to find leaf disease using k mean algorithm and gray level co-occurrence matrix and support vector machine with complete results.
In this presentation , I mention all the data in very convenient way . I hope you can take it easy.
Thank you
Asymptotic Behavior of Solutions of Nonlinear Neutral Delay Forced Impulsive ...IOSR Journals
Sufficient conditions are obtained for every solution of first order nonlinear neutral delay forced
impulsive differential equations with positive and negative coefficients tends to a constant as t ∞.
Mathematics Subject Classification [MSC 2010]:34A37
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...IOSR Journals
Abstract- This paper deals with residual musical noise which results from the perceptual speech enhancement
type algorithms and especially using wiener filtering approach. Perceptual speech enhancement techniques
perform better than the non perceptual techniques, most of them still return a trouble residual musical noise.
This is due to that only noise above the noise masking threshold (NMT) is filtered out then noise below the noise
masking threshold (NMT) can become audible if its maskers are filtered. It can affect the performance of
perceptual speech enhancement method that process the audible noise only (Residual noise is still present). In
order to overcome this drawback a new speech enhancement technique is proposed here.The main aim here is
to improve the enhanced speech signal quality provided by perceptual wiener filtering and by controlling the
latter via a second filter regarded as a psychoacoustically motivated weighting factor. The simulation results
gives the information that the performance is improved compared to other perceptual speech enhancement
methods.
A Convergence Theorem Associated With a Pair of Second Order Differential Equ...IOSR Journals
We consider the second order matrix differential equation
M 0, 0 x Where M is a second-order matrix differential operator and is a vector having two components. In this
paper we prove a convergence theorem for the vector function 1 2 ( ) ( ) ( ) f x f x f x which is continuous in
0 x and of bounded variation in 0 x , when p(x) and q(x) tend to as x tend to .
Revisiting A Panicked Securitization MarketIOSR Journals
With the passage of Finance Bill 2013 on April 30 in Lok Sabha proposing to Levy a 30% distribution tax on the investors in securitization deals through special purpose vehicles, there is a stir in the securitization market. The principal investors (banks) were paying the tax on their net income from the securitization transaction through SPVs. Now, they will be taxed on the gross income as per the new Finance Bill. The new securitization guidelines issued in May 2012 dipped the volume of fresh issue to Rs. 28,400 crore from Rs. 44,500 crore in the preceding fiscal.
Some Soliton Solutions of Non Linear Partial Differential Equations by Tan-Co...IOSR Journals
Tan-cot method is applied to get exact soliton solutions of non-linear partial differential equations notably generalized Benjamin-Bona-Mahony, Zakharov-Kuznetsov Benjamin-Bona-Mahony, Kadomtsov-Petviashvilli Benjamin-Bona-Mahony and Korteweg-de Vries equations, which are important evolution equations with wide variety of physical applications. Elastic behavior and soliton fusion/fission is shown graphically and discussed physically as far as possible
Heat Capacity of BN and GaN binary semiconductor under high Pressure-Temperat...IOSR Journals
In this paper, we have calculated the molar heat capacity for cubic zinc blende (cZB) BN and GaN binary semiconductors at high pressure-temperature (PT). For the calculation of heat capacity, we firstly obtained the Debye temperature (ϴD) variation with temperature and at higher temperature it becomes constant with temperature in quasi-harmonic approximation limits. We have also calculated the static Debye temperature (ϴD) from elastic constant for the both BN and GaN binary semiconductors. The elastic constants are calculated from the energy-strain relation using plane wave method in DFT approach. All the calculated results are well consistence with experimental and reported data
Exploring Values and Value Streams by BPM method solved by Lean Management toolIOSR Journals
This paper suggests a continuous improvement plan that can satisfy customer’s value and eliminate
waste in the enterprise business process. In order to explore the applicability of lean management principles in
the enterprise business process, the five fundamental concepts (specify value, identify the value stream, flow,
pull, and perfection) of lean management are being used as a stable and proved approach. In addition, Business
Process Management is applied as a new method to constantly improve the elimination of waste in the
enterprise business process. This can be accomplished by the lean management concept. Moreover business
process problems, such as overlapping work, redoing work, communication gaps, inflexible processes, and
obscure processes, have the possibility of being solved by lean management.
Lean management has traditionally been adopted by manufacturing industries to improve operations through
the identification and elimination of all forms of waste basically. The construction industry has also adopted this
philosophy, primarily in the field of projects. In order to increase an organization’s competitiveness and
productivity, lean management is needed in the any business process as well as in the field. The intent of this
work is to explore a method of introducing lean management which continuously improves any business
processes. The five fundamental concepts (specify value, identify the value stream, flow, pull, and perfection) of
lean management as an approach are being adapted in this project to improve quality of the processes. Hence
the main objective of this paper to apply the tool of lean and six sigma management to improve the elimination
of waste in the enterprises business process. Followed by a literature review which provides a brief summary of
lean thinking and six sigma along with challenges might face while implementing. A case study follows that
demonstrates how; Business Process Management is applied as a tool, method to constantly improve the
business process.
Application of Langmuir-Hinshelwood Model to Bioregeneration of Activated Car...IOSR Journals
Environmental pollution, high cost and high energy consumption associated with thermal regeneration of activated carbon polluted with hydrocarbon necessitated the search for a better way of regenerating activated carbon, bioregeneration. Spent granular activated carbon was regenerated having been initially characterized using cultured Pseudomonas Putida. The rate of bioregeneration was studied by varying the volume of bacteria from 10ml, 20ml, 30ml and 40ml. The regeneration temperature was also varied from 25oC to ambient temperature of 27oC, 35oC and further at 40 and 45oC over a period of 21 days. The experimental results showed clear correlation when validated using the Langmuir-Hinshelwood kinetic model. The experiment at ambient temperature showed a negative correlation due to the fluctuation in the ambient temperature unlike all other experiment where temperature was controlled in an autoclave machine.
Crashing is the procedure by which project duration can be shorten up by expediting selective
activities with in the project. But it requires allocating more resources than usual to compress an activity’s
duration, which in turns increases the budget of that activity. So, crashing is basically a time-cost trade-off by
which specific deadline can be achieved. The traditional method of crashing only considers average activity
times for the calculation of the critical path, ignoring the stochastic nature of activity time. This report is written
to develop an algorithm for optimum crashing method to minimize the required cost while attaining a specified
completion time.
A Utility Interactive Electricity Generation Schemes with Renewable ResourcesIOSR Journals
Abstract: In the recent year’s power utility of coastal areas are experiencing relatively large quantum of solar and wind energy. If the wind is heavy it might produce larger sea waves of high energy contents. The electricity needs of a township or a village situated in a coastal area can be partially fulfilled by installing a modular mini electricity generating unit and an intensified solar heat extractor in buildings. Also, installation of medium sized windmill plant, solar heated steam turbine electricity generator and sea wave energy extracting plants could fulfill the rest of the electricity needs of the township. Here we discuss the regulation of the voltage and frequency of a stand-alone fixed-pitch wind energy conversion system (WECS) based on a self-excited squirrel-cage induction machine. The characteristics of the wind turbine, self-excited generator, and the ratings of the VSI are considered in order to determine the load range for which voltageand frequency can be regulated for a given wind speed range. Keywords: Solar panel, solar tracker, solar water heater,renewable energy, wind mills, induction generator, load management.
Non Revenue Water Reduction- A Tool for Achiving 24x7 Water SupplyIOSR Journals
The availability of water at cheaper rates promotes the wasteful use of water. People give little or no
attention towards conservation of water. For sustenance of any water supply scheme it is essential that
revenue collected should be sufficient to maintain O & M cost and further development activities. Presently
there is major portion of Non Revenue Water (NRW) in the developing countries and there is urgent need to
curb it for efficient functioning of water supply schemes
Parametric Optimization of Eicher 11.10 Chassis Frame for Weight Reduction Us...IOSR Journals
Abstract: The chassis serves as a backbone for supporting the body and different parts of the automobile. It
should be rigid enough to withstand the shock, twist, vibration and other stresses. Along with strength, an
important consideration in chassis design is to have adequate bending stiffness. The main objective of the
research is to obtain the minimum weight of Eicher 11.10 chassis frame. The chassis frame is made of two side
members joined with a series of cross members. The number of cross members, their locations, cross-section
and the sizes of the side and the cross members becomes the design variables. The chassis frame model is to be
developed in Solid works and analyzed using Ansys. Since the no. of parameters and levels are more, the
probable models are too many. So, to select optimum parameters among them large no of modeling and analysis
work is involved which consumes more time. To overcome this problem TAGUCHI method along with FEA is
use. The weight reduction of the side bar is achieved by changing the Parameters using orthogonal array. Then
FEA is performed on those models to get the best solution. This method can save material used, production cost
and time.
Keywords: Parametric optimization, Chassis frame, FE analysis, FEA-DOE hybrid modeling, Weight
reduction
Uniform particle distribution by a newer method in composite metal of Al/SiCIOSR Journals
Abstract: Preparation of composites of metal with ceramic particle reinforced through the casting process
is not uniform because of poor wet ability. The major difficulty is to get a uniform distribution of
reinforcement especially in higher volume fractions. An innovative method of producing cast composites is
tried in present study to overcome this problem we need homogeneity of matrix. The method involves multi
axis rotation of liquid aluminum and silicon carbide particulates packed in a steel pipe inside a rotating drum.
Up to 65 % volume of the metal (aluminum)is incorporated by SIC by this technique. Physical Properties
like hardness, micro hardness, densities and microstructures have been studied. The distribution of
particles as well the mechanical properties are better as compared to that of stir cast composites with similar
volume fraction of silicon carbide reinforcement. The composite with 65-volume percentage of silicon carbide
of particulates showed a Rockwell Hardness value of 67Rb.In few locations the microstructure showed a
non-uniform distribution which can be neglected . There were segregation of silicon carbide particles at a
particular location and the hardness obtained there was much higher. The particle distribution is a result
of the combined influence of random mixing of particles and liquid aluminum and the solidification pattern
obtained.
Key word: Multi axis rotation, microstructure, MMC, Al- SIC matrix
Rehabilitation Process and Persons with Physical DysfunctionsIOSR Journals
Abstract: The main purpose of this study is to determine rehabilitation process and persons with physical
dysfunctions. To achieve the purpose of this study, three hypotheses were formulated. Ex-post facto research
design was adopted for the study. A sample of one hundred persons with disabilities was randomly selected for
the study. The selection was done through the simple random sampling technique. This was to give equal and
independent opportunity to all the respondents to be selected for the study. The questionnaire was the major
instrument used for data collection. The instrument was subjected to both face and content validation by expert
in measurement and evaluation. The reliability estimate of the instrument was established through the test-retest
reliability method Pearson product correlation analysis and independent t-test were employed were adopted to
test the hypotheses at .05 level of significance. The result of the analysis reveals that rehabilitation significantly
relates with persons with orthopedic and neurological impairments. The result also revealed that there is a
significant difference between male and female disabled persons in their perception of rehabilitation of persons
with other health impairments.
Keywords: Rehabilitation process, persons, physical, dysfunctions.
Performance Comparison of AODV, DSR and LAR1 in Mobile Ad-hoc Network based o...IOSR Journals
Abstract: In the last couple of years, the use of wireless networks has become more and more popular. A
MANET is a collection of self-organizing mobile nodes which is infrastructure less, autonomous, and standalone
networks. Each node in a MANET is free to move independently in any direction and will therefore change its
links to other devices frequently. Each must forward traffic unrelated to its own use and therefore be a router.
Simulation result has been obtained by a performance comparison of three routing protocols i.e. Ad hoc Ondemand
Distance Vector (AODV), Dynamic Source Routing (DSR), Location Aided Routing (LAR1) against
Simulation time. The Result is obtained using QualNet simulator version 6.1. Different protocols are evaluated
based on measures such as Average End to End delay (s), Average Jitter(s), and Packet delivery ratio.
Keywords: MANET, AODV, DSR, LAR1, QualNet 6.1
Object Elimination and Reconstruction Using an Effective Inpainting MethodIOSR Journals
Abstract: Three major problems have been found in the existing algorithms of image inpainting:
Reconstruction of large regions, Preference of filling-in and Choice of best exemplars to synthesize the missing
region. The proposed algorithm introduces two ideas that deal with these problems preserving edge continuity
along with decrease in error propagation. The proposed algorithm introduces a modified priority computation
in order to generate better edges in the omitted region and to reduce the transmission of errors in the resultant
image a novel way to find optimal exemplar has been proposed. This proposal optimizes the reconstruction
process and increases the accuracy. The proposed algorithm removes blurness and builds edges efficiently
while reconstructing large target region.
Keywords: Image inpainting, texture synthesis, Image Completion, exemplar-based method
There is increased use of nuclear energy after the Second World War which results in increase in artificial radioactivity on our planet. The objective of this article is to show the estimated amount of artificial radioactivity on earth surface and its effect by comparing it with the radioactive decay which took place in Hiroshima bombing by ‘little boy’. There is estimated amount of 100 trillion curie of radioactivity on earth surface for human use. This man made radioactivity on earth surface has capacity to change temperature of the earth by 0.97oC if heat is evenly distributed and unfortunately Hiroshima bombing did not stop but continued at rate of 39 ‘little boy’ bombing of the earth per second. Every second 39 atomic bombs of ‘little boy’ size are dropped. One can see the large amount of bombing taking place on the earth by artificial radioactivity and the bombing should be stopped and further analysis of artificial radioactivity should also be done
The Real Time Drowisness Detection Using Arm 9IOSR Journals
Abstract: The project is to monitor the driver’s eye movement by using webcam and EOG channel respectively. Embedded project is to design and develop a low cost feature which is based on embedded platform for finding the driver drowsiness. Specifically, our Embedded System includes a webcam placed on the steering column which is capable to capture the eye movements and EOG placed at the forehead of the Driver to find out the visual activity. If the driver is not paying attention on the road ahead and a dangerous situation is detected, the system will warn the driver by giving the warning sounds. Embedded System uses ARM9 32-bit micro controller has a feature of image processing technique as well as Analog to Digital Conversion. Image processing is any form of signal processing for which the input is an image, such as a photograph or video frame; the output of image processing may be either an image or a set of characteristics or parameters related to the image. Keywords:ARM 9, EOGSensor, Webcam, GSM Modem.
SYNTHETICAL ENLARGEMENT OF MFCC BASED TRAINING SETS FOR EMOTION RECOGNITIONcscpconf
Emotional state recognition through speech is being a very interesting research topic nowadays.
Using subliminal information of speech, it is possible to recognize the emotional state of the
person. One of the main problems in the design of automatic emotion recognition systems is the
small number of available patterns. This fact makes the learning process more difficult, due to
the generalization problems that arise under these conditions.
In this work we propose a solution to this problem consisting in enlarging the training set
through the creation the new virtual patterns. In the case of emotional speech, most of the
emotional information is included in speed and pitch variations. So, a change in the average
pitch that does not modify neither the speed nor the pitch variations does not affect the
expressed emotion. Thus, we use this prior information in order to create new patterns applying
a pitch shift modification in the feature extraction process of the classification system. For this
purpose, we propose a frequency scaling modification of the Mel Frequency Cepstral
Coefficients, used to classify the emotion. This proposed process allows us to synthetically
increase the number of available patterns in thetraining set, thus increasing the generalization
capability of the system and reducing the test error.
A Text-Independent Speaker Identification System based on The Zak TransformCSCJournals
A novel text-independent speaker identification system based on the Zak transform is implemented. The data used in this paper are drawn from the ELSDSR database. The efficiency of identification approaches 91.3% using single test file and 100% using two test files. The method shows comparable efficiency results with the well known MFCC method with an advantage of being faster in both modeling and identification.
Designing an Efficient Multimodal Biometric System using Palmprint and Speech...IDES Editor
This paper proposes a multimodal biometric system
using palmprint and speech signal. In this paper, we propose a
novel approaches for both the modalities. We extract the
features using Subband Cepstral Coefficients for speech signal
and Modified Canonical method for palmprint. The individual
feature score are passed to the fusion level. Also we have
proposed a new fusion method called weighted score. This
system is tested on clean and degraded database collected by
the authors for more than 300 subjects. The results show
significant improvement in the recognition rate.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Single Channel Speech De-noising Using Kernel Independent Component Analysis...theijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals
Extraction and selection of the best parametric representation of acoustic signal is the most important task in designing any speaker recognition system. A wide range of possibilities exists for parametrically representing the speech signal such as Linear Prediction Coding (LPC) ,Mel frequency Cepstrum coefficients (MFCC) and others. MFCC are currently the most popular choice for any speaker recognition system, though one of the shortcomings of MFCC is that the signal is assumed to be stationary within the given time frame and is therefore unable to analyze the non-stationary signal. Therefore it is not suitable for noisy speech signals. To overcome this problem several researchers used different types of AM-FM modulation/demodulation techniques for extracting features from speech signal. In some approaches it is proposed to use the wavelet filterbanks for extracting the features. In this paper a technique for extracting the features by combining the above mentioned approaches is proposed. Features are extracted from the envelope of the signal and then passed through wavelet filterbank. It is found that the proposed method outperforms the existing feature extraction techniques.
Comparative study to realize an automatic speaker recognition system IJECEIAES
In this research, we present an automatic speaker recognition system based on adaptive orthogonal transformations. To obtain the informative features with a minimum dimension from the input signals, we created an adaptive operator, which helped to identify the speaker’s voice in a fast and efficient manner. We test the efficiency and the performance of our method by comparing it with another approach, mel-frequency cepstral coefficients (MFCCs), which is widely used by researchers as their feature extraction method. The experimental results show the importance of creating the adaptive operator, which gives added value to the proposed approach. The performance of the system achieved 96.8% accuracy using Fourier transform as a compression method and 98.1% using Correlation as a compression method.
Intelligent Arabic letters speech recognition system based on mel frequency c...IJECEIAES
Speech recognition is one of the important applications of artificial intelligence (AI). Speech recognition aims to recognize spoken words regardless of who is speaking to them. The process of voice recognition involves extracting meaningful features from spoken words and then classifying these features into their classes. This paper presents a neural network classification system for Arabic letters. The paper will study the effect of changing the multi-layer perceptron (MLP) artificial neural network (ANN) properties to obtain an optimized performance. The proposed system consists of two main stages; first, the recorded spoken letters are transformed from the time domain into the frequency domain using fast Fourier transform (FFT), and features are extracted using mel frequency cepstral coefficients (MFCC). Second, the extracted features are then classified using the MLP ANN with back-propagation (BP) learning algorithm. The obtained results show that the proposed system along with the extracted features can classify Arabic spoken letters using two neural network hidden layers with an accuracy of around 86%.
Reduction of Side Lobes by Using Complementary Codes for Radar ApplicationIOSR Journals
Abstract: The analysis of new types of Complementary direct sequence complex signals which have synthesized
with well – know code sequences like Barker, Walsh, Golay, and complementary codes. Build on the
autocorrelation function (ACF) and ambiguity function (AF) of signals, the numerical method estimates the
volume of side lobes separately for each signal. The results obtained show that the signals, which have a low
volume of side lobes, means of approximately zero in by using complementary codes with compare to different
codes.
Keywords: Ambiguity function, complementary codes, autocorrelation function etc.
Reduction of Side Lobes by Using Complementary Codes for Radar ApplicationIOSR Journals
The analysis of new types of Complementary direct sequence complex signals which have synthesized
with well – know code sequences like Barker, Walsh, Golay, and complementary codes. Build on the
autocorrelation function (ACF) and ambiguity function (AF) of signals, the numerical method estimates the
volume of side lobes separately for each signal. The results obtained show that the signals, which have a low
volume of side lobes, means of approximately zero in by using complementary codes with compare to different
codes
Isolated words recognition using mfcc, lpc and neural networkeSAT Journals
Abstract Automatic speech recognition is an important topic of speech processing. This paper presents the use of an Artificial Neural Network (ANN) for isolated word recognition. The Pre-processing is done and voiced speech is detected based on energy and zero crossing rates (ZCR). The proposed approach used in speech recognition is Mel Frequency Cepstral Coefficients (MFCC) and combine features of both MFCC and Linear Predictive Coding (LPC). The back-propagation is used as a classifier. The recognition accuracy is increased when combine features of both LPC and MFCC are used as compared to only MFCC approach using Neural Network as a classifier.. Keywords: Pre-processing, Mel frequency Cepstral Coefficient (MFCC), Linear Predictive Coding (LPC), Artificial Neural Network (ANN).
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...CSCJournals
voice activity detector (VAD) is used to separate the speech data included parts from silence parts of the signal. In this paper a new VAD algorithm is represented on the basis of singular value decomposition. There are two sections to perform the feature vector extraction. In first section voiced frames are separated from unvoiced and silence frames. In second section unvoiced frames are silence frames. To perform the above sections, first, windowing the noisy signal then Hankel’s matrix is formed for each frame. The basis of statistical feature extraction of purposed system is slope of singular value curve related to each frame by using linear regression. It is shown that the slope of singular values curve per different SNRs in voiced frames is more than the other types and this property can be to achieve the goal the first part can be used. High similarity between feature vector of unvoiced and silence frame caused to approach for separation of the two categories above cannot be used. So in the second part, the frequency characteristics for identification of unvoiced frames from silent frames have been used. Simulation results show that high speed and accuracy are the advantages of the proposed system.
Speaker and Speech Recognition for Secured Smart Home ApplicationsRoger Gomes
The paper published in discusses implementation of a robust text-independent speaker recognition system using MFCC extraction of feature vectors its matching using VQ and optimization using LBG, further a text dependent speech recognition system using the DTW algorithm's implementation is discussed in the context of home automation.
Path Loss Prediction by Robust Regression Methodsijceronline
International Journal of Computational Engineering Research (IJCER) is dedicated to protecting personal information and will make every reasonable effort to handle collected information appropriately. All information collected, as well as related requests, will be handled as carefully and efficiently as possible in accordance with IJCER standards for integrity and objectivity.
Similar to Text Independent Speaker Identification Using Imfcc Integrated With Ica (20)
Water scarcity is the lack of fresh water resources to meet the standard water demand. There are two type of water scarcity. One is physical. The other is economic water scarcity.
Hierarchical Digital Twin of a Naval Power SystemKerry Sado
A hierarchical digital twin of a Naval DC power system has been developed and experimentally verified. Similar to other state-of-the-art digital twins, this technology creates a digital replica of the physical system executed in real-time or faster, which can modify hardware controls. However, its advantage stems from distributing computational efforts by utilizing a hierarchical structure composed of lower-level digital twin blocks and a higher-level system digital twin. Each digital twin block is associated with a physical subsystem of the hardware and communicates with a singular system digital twin, which creates a system-level response. By extracting information from each level of the hierarchy, power system controls of the hardware were reconfigured autonomously. This hierarchical digital twin development offers several advantages over other digital twins, particularly in the field of naval power systems. The hierarchical structure allows for greater computational efficiency and scalability while the ability to autonomously reconfigure hardware controls offers increased flexibility and responsiveness. The hierarchical decomposition and models utilized were well aligned with the physical twin, as indicated by the maximum deviations between the developed digital twin hierarchy and the hardware.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
This paper addresses the vulnerability of deep learning models, particularly convolutional neural networks
(CNN)s, to adversarial attacks and presents a proactive training technique designed to counter them. We
introduce a novel volumization algorithm, which transforms 2D images into 3D volumetric representations.
When combined with 3D convolution and deep curriculum learning optimization (CLO), itsignificantly improves
the immunity of models against localized universal attacks by up to 40%. We evaluate our proposed approach
using contemporary CNN architectures and the modified Canadian Institute for Advanced Research (CIFAR-10
and CIFAR-100) and ImageNet Large Scale Visual Recognition Challenge (ILSVRC12) datasets, showcasing
accuracy improvements over previous techniques. The results indicate that the combination of the volumetric
input and curriculum learning holds significant promise for mitigating adversarial attacks without necessitating
adversary training.
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Text Independent Speaker Identification Using Imfcc Integrated With Ica
1. IOSR Journal of Electronics and Communication Engineering (IOSR-JECE)
e-ISSN: 2278-2834,p- ISSN: 2278-8735. Volume 7, Issue 5 (Sep. - Oct. 2013), PP 22-27
www.iosrjournals.org
www.iosrjournals.org 22 | Page
Text Independent Speaker Identification Using Imfcc Integrated
With Ica
P.Suryakumari1
, N M Ramaligeswararao2
, K.Jyothi3
, Dr.V.Sailaja4
1,2,3,4,
(ECE, Godavari Institute of Engineering & Technology/ Jntuk, India)
Abstract: Over the years, more research work has been reported in literature regarding text independent
speaker identification using MFC coefficients. MFCC is one of the best methods modeled on human auditory
system. Murali et al (2011) [1] has developed a Text independent speaker identification using MFC coefficients
which follows Generalized Gaussian mixer model. MFCC, because of its filter bank structure it captures the
characteristics of information more effectively in lower frequency region than higher region, because of this,
valuable information in high frequency region may be lost. In this paper we rectify the above problem by
retrieving the information in high frequency region by inverting the Mel bank structure. The dimensionality and
dependency of above features were reduced by integrating with ICA. Here Text Independent Speaker
Identification system is developed by using Generalized Gaussian Mixer Model .By the experimentation, it was
observed that this model outperforms the earlier existing models.
Keywords: Independent Component Analysis; Generalized Gaussian Mixer Model; Inverted Mel frequency
cepstral coefficients; Bayesian classifier; EM algorithm.
I. Introduction:
From the past knowledge it is evident that physiological structure of a vocal tract is different for
different persons, each one having different features. Based on these features, we can differentiate one persons
voice from others(Ben gold (2002)[2]). The Mel frequency cepstral coefficients features was first proposed for
speech recognition by S.B.Davis and P.Mermelstein . Over the decades though it was most widely used, there
was a drawback in its filter bank structure, which is able to capture the information in low frequency regions
than higher frequency regions.As the information contained in the higher frequency regions may be missed,
proportionally the average efficiency of identification of speaker may be reduced. To overcome the this problem
Mr.S.Chakraborty (2007) [16] proposed a flipped MFCC filter bank using Inverted Mel scale by considering the
High frequency region coefficients.
Here the main objective is to capture the information which has been missed by the MFCC by using a
new filter bank structure when it is obtained by flipping or inverting the MFCC filters bank structure
(s.chakroborty(2007))[16]. In this feature extraction only few inverted Mel cepstral coefficients are considered
and the remaining coefficients are dropped as insignificant due to high dimensionality problems. Ignoring the
dependences and dropping some of the Mel cepstral coefficients may lead to falsification of model. It is needed
to reduce the dimensionality and avoid dependence among others to having a robust model for speakers. This
can be achieved by integrating the Inverted Mel frequency cepstral coefficients of each speaker’s speech spectra
with Independent component analysis (ICA).The integration of ICA with Inverted Mel frequency cepstral
coefficients will make the feature vector more robust.
The main aim of the ICA is to find out linear representation of non-Gaussian data so that the
components are statistically independent. The required assumption in applying ICA is that the observed signals
are linearly mixed. It helps in capturing some of the essential features of data in many applications
including Automatic Speaker
Recognition Systems for high dimensional multivariate analysis (Hyvarinen(2001))[3]. ICA is a
powerful tool to separate mixed signals mainly find in speech analysis results in computational and conceptual
simplicity.
Hence in this paper a text independent speaker identification method is developed and analyzed by
integrating independent component analysis with Inverted Mel frequency cepstarl coefficient and using
Generalized Gaussian Mixer model. The procedure for extracting the feature vectors of the speaker speech
spectra using inverted Mel frequency cepstral coefficients and independent component analysis is covered in
Section II. Assuming that each individual speech spectra in feature vector follows a Generalized Gaussian Mixer
Model and the estimation of the model parameters is obtained by using EM algorithm. The speaker
identification algorithm under Bayesian frame work is presented in the section IV.
2. Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 23 | Page
II. Feature Vectors Extraction
In this section we describe the feature vector extraction of each individual speech spectra. Since a long
time MFCC is considered as a reliable front end for speaker identification because it has coefficients that related
to psychophysical studies human perception of the frequency content of sounds in a nonlinear scale called Mel
scale (Ding(2001))[4]. This can be mathematically expressed as
fmel=2595 log[1+(f/700)] (1)
Where fmel is the subjective pitch in Mel’s corresponding to f in Hz.
But Mel cepstral coefficients are obtained by capturing lower frequency regions and neglecting higher regions.
This leads to loss of more information from the recorded data base. By considering high frequency regions by
using flipping or inverting Me3l bank structure (S.chakroborty (2007)) [5], we retrieve more information.
The flip of the MFCC Mel bank is expressed as
𝜳𝒊(k) = ΨQ+1-i (Ms/2 +1-k) (2)
𝛹𝑖(k) is the inverted Mel scale filter response while Ψi(k) is MFCC filter bank response and Q is the number of
filters in the bank. Here inverted Mel-scale is defined as follows
𝒇mel (f)=2195.2860-2595 log10(1+(4031.25-f/700)) (3)
𝑓mel (f) is subjective pitch in the new scale corresponding to f, actual frequency in Hz.
The block diagram of Inverted Mel frequency cepstral coefficients is shown in below figure 1.From the
block diagram of IMFCC the frequency bands are positioned logarithmically (on Inverted Mel scale) by inverted
Mel filter bank structure, which approximated the human auditory systems response (Ruche chaudhary (2012)
[6]). In this IMFCC captures high frequency bands of the Inverted Mel scale, getting more efficiency than
MFCC. These vector coefficients follow Independent component analysis. ICA is linear and not necessarily
orthogonal. It extracts independent components even with smaller magnitudes. Analysis of Text independent
speaker identification model is a challenging task because this system comprises highly complex model with
huge number of feature vectors. Under this circumstance dimension reduction of data plays a major role for
obtaining better identification results. This can be achieved by ICA algorithm.
Fig 1
ICA aims at extracting a set of statistically independent vectors from the matrix of training data the Inverted
Mel cepstral feature vectors derived from the original voice sample. It leads to find directions of minimal mutual
information by capturing certain correlations among the frequencies present in the spectral based representation
of the speech signal. All these aims are achieved by ICA in the form of linear combination of basic filter
functions specific to each person.
The signal X is used as proper Mel-cepstral based representation of original signal and the data can be observed
as a set of multivariate time series resulting from a linear mixing process A of independent functions S (
Hyvarinen, (2001))[7][8].
Linear combination of such sources or functions can be summarized as (Cardoso, (1996)) [9]
Ax=S (4)
There is one problem in ICA to determine both the excitation signal S and the scalars A and the only one known
component is the matrix of IMFCC coefficients of the input speech signal. S can be computed as follows
(hyvarinen, (1997)) [10]
S=Ax-1 (5)
A can be computed by considering X as a vector of observations, where each observation is expressed as a
linear combination of independent component s. In order to estimate one of the independent components. A
linear combination of xi is chosen such that (Hyvarinen, (1997)) [10], (Michael, 2002[11])
3. Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 24 | Page
Y=WT
X (6)
With respect to these conditions stated in equation (4) and equation (5), the linear combination represented in
equation (6) is a true combination of independent components if w was one of the columns of the inverse of A.
After pre processing and whitening, the final equation is given as (Michael,(1999))[12]
S = Y=WT
X = 𝑾 px (7)
Fast ICA algorithm is used to estimate Wi which constitutes the rows of W. since the components are considered
to be statistically independent, the variance between them is high. The following steps are used to estimate W
1. Choose an initial random guess for W
2. Iterate: W<= E{xg(WT
X)}- E{g’(WT
X)}W
3. Normalize: W<=
𝑊
||𝑊||
4. If not converged , go back to step 2
Trained Speaker
Model
Fig 2
Once W is estimated the final step is to project the signal in to the space created by ICA
New dataset=Wica* mean adjusted original data
Where , Wica is the transformation matrix obtained from Fast ICA algorithm.
The extraction of Feature vectors for speaker identification is done in two steps:
1. Compute IMFCC’s and
2. Apply ICA to transform them to get new feature vectors
The computation steps for extracting the new feature vectors as follows
Step1:
i. Take the fourier transform of a signal
ii. Map the powers of the spectrum obtained above on to the inverted mel scale(IMFCC), using hamming
window
iii. Take the logs of the powers at each of the Inverted Mel frequencies
iv. Take the discrete cosine transform of the list of the inverted Mel log powers, as it were a signal
4. Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 25 | Page
v. The IMFCCs are the amplitudes of the resulting spectrum
Step2:
vi. Apply ICA transform to Mel frequency cepstral coefficients to get the new feature vectors of each speaker
speech spectra
III. Speaker Identification Model With Generalized Gaussian Distribution:
In this section we briefly describe the speaker identification process. Figure 1 depicts the Text
Independent Speaker Identification model. In this dotted lines represents training phase and bold line represents
testing phase.
The Literature shows that probabilistic models like GGMM for have yielded better performance
results for training both text-dependent and text-independent speaker recognition applications. Due to
the probabilistic property of a GGMM, it can also be applied to speaker identification applications in the
presence of different noises increasing the channel robustness and therefore more suited to this research.
Here it is assumed that the feature vectors (after preprocessing the IMFCC with ICA) follow a
multivariate Generalized Gaussian Mixer model. It is reasonable to assume the acoustic space corresponding to
a speaker voice can be characterized by acoustic classes representing the some broad phonetic events such as
vowels, nasals. These acoustic classes reflect some general speaker dependent vocal tract configurations that are
useful for charactering speaker identity. The shape of each spectra in turn is represented by the mean of its
component density and variation of the average spectral shape represented by co-variance matrix. Hence the
entire speech spectra of the each individual speaker can be characterized as an M component Finite Multivariate
Generalized Gaussian Mixer model.
The Probability density function of the each individual speaker speech spectra is represented as follows:
𝒑(𝒙 𝒕|λ) = 𝜶𝒊 𝒃𝒊
𝑴
𝒊=𝟏 (𝒙 𝒕|λ) (8)
Where, 𝑥𝑡 = (𝑥𝑡𝑖𝑗 ) j=1,2,….D; i=1,2,….M; t=1,2….T is a D dimensional random vector representing
IMFCC vector.
λ is the parametric set such λ= (µ, 𝝆, ∑)
𝛼𝑖 is the component weight such that 𝛂𝐢 = 𝟏𝐌
𝐢=𝟏
𝑏𝑖(𝑥𝑡 | λ) is the probability density function of ith
acoustic class represented by the new vectors of the speech data
the D-dimensional Generalized Gaussian (GG) distribution (M.bicego et al (2008)(13)). It is in the form of the
following
𝒃𝒊(𝒙 𝒕| (µ, 𝝆, ∑)) =
[𝐝𝐞𝐭( )]−𝟏 𝟐
[𝒛 𝝆 𝑨 𝝆,𝝈 ] 𝑫exp −
−𝟏 𝟐(𝒙 𝒕−𝝁 𝒊
𝑨(𝝆,𝝈) 𝝆
(9)
Where z (𝝆) =
𝟐
𝝆
𝚪
𝟏
𝝆
and 𝑨 𝝆, 𝝈 =
𝚪 𝟏 𝝆
𝚪 𝟑 𝝆
The parameter 𝜇𝑖 is the mean vector and the summation of 𝐴(𝜌) is a scaling parameter which allows a variance
var (𝜎2
) and 𝜌 is the shape of the parameter when ρ=1, the Generalized Gaussian corresponds to laplacian or
double exponential Distribution. When ρ=2, the Generalized Gaussian corresponds to a Gaussian distribution. In
limiting case ρ→ + ∞. The model can have one covariance matrix per a Generalized Gaussian density of the
acoustic class of each speaker. The diagonal convergence matrix is used for speaker model, based on the
previous studies. This results diagonal covariance matrix for the feature vector, the features are independent and
the probability density function of the feature vector is
𝒃𝒊(𝒙 𝒕|λ) =
𝐞𝐱𝐩−
𝒙 𝒊𝒋−𝝁 𝒊𝒋
𝑨(𝝆 𝒊𝒋−𝝈 𝒊𝒋
𝝆 𝒊𝒋
𝟐
𝝆 𝒊𝒋
𝚪 𝟏+
𝟏
𝝆 𝒊𝒋
𝑨(𝝆 𝒊𝒋,𝝈 𝒊𝒋)
𝑫
𝒋=𝟏 = 𝒇𝒊𝒋
𝑫
𝒋=𝟏 (𝒙 𝒕𝒊𝒋) (10)
To find the estimate of model parameters αi μij and ij for i= 1,2,3 …,M, j=1,2,…,D, we maximize the expected
value b y using log likelihood function.the parameters are given by Armando.j el at (2003)[14] for each speech
spectra
The following are the updated equations of the parameters of EM algorithm as given by sailaja et al (2010)[15]
𝜶𝒊
(𝒍+𝟏)
=
𝟏
𝑻
𝜶𝒊
𝒍 𝒃 𝒊(𝒙 𝒕,𝝀 𝒍 )
𝜶𝒊
𝒍 𝒃 𝒊(𝒙 𝒕,𝝀 𝒍 )𝑴
𝒊=𝟏
𝑻
𝒕=𝟏 (11)
Where 𝜆 𝑙
= (𝜇𝑖𝑗
(𝑙)
,𝛼𝑖𝑗
(𝑙)
) are the estimates obtained.
5. Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 26 | Page
The updated equation for estimating 𝜇𝑖𝑗 is
𝝁𝒊𝒋
𝒍+𝟏
=
𝒕 𝟏 𝒙 𝒕 𝝀 𝒍 𝑨 𝑵,𝝆 𝟏
(𝒙 𝒕𝒊𝒋−𝝁 𝒕𝒊𝒋)
𝑻
𝒕=𝟏
𝒕 𝟏
𝑻
𝒕=𝟏 𝒙 𝒕,𝝀 𝒍 𝑨 𝑵,𝝆 𝒊𝒋
(12)
where 𝐴(𝑁, 𝜌𝑖𝑗 ) is a function which must be equal to unity for 𝜌𝑖 = 2and must be equal to
1
𝜌 𝑖𝑗 −1
for 𝜌𝑖 ≠ 1 in the
case of N=2, we have also observed that 𝐴(𝑁, 𝜌𝑖𝑗 ) must be an increasing function of𝜌𝑖𝑗 .
The updated equation for estimating 𝛼𝑖𝑗 is
𝝈𝒊𝒋
(𝒍+𝟏)
=
𝒕 𝟏(𝒙 𝒕 𝝀(𝒍))
𝚪
𝟑
𝝆 𝒊𝒋
𝝆 𝒕 𝚪
𝟏
𝝆 𝒕
𝒙 𝒕𝒊𝒋−𝝁 𝒊𝒋
(𝒍)
𝟏
𝝆 𝒊𝒋𝑵
𝒊=𝟏
𝒕 𝟏(𝑻
𝒕=𝟏 𝒙 𝒕 𝝀(𝒍))
𝟏
𝝆 𝒊𝒋
(13)
`
IV. Speaker Identification Using Bayes’ Decision Rule:
From each test speaker, we obtained the extracted feature vectors were applied to the function “ICA
TRANSFORM” and were estimated into the space of ICA created by the associated speaker with unique
speaker ID. The new feature vectors of the test utterances and the trained models were fed to a Byes classifier
for identification applications which employ large group of data sets and the corresponding test speaker was
identified. (Domingo’s, (1997))[16][17].
𝑝 𝑖
𝑥𝑡
, 𝜆 is the posteriori probability for an acoustic class i and is defined by the following equation
(Reynolds,(1995))[6].
𝐩 𝐢
𝐱 𝐭
, 𝛌 =
𝐩𝐢 𝐛𝐢 𝐱 𝐭
^−
𝐩 𝐤 𝐛 𝐤 𝐱 𝐭
^−𝐌
𝐤=𝟏
(14)
𝒔 = 𝐦𝐚𝐱 𝟏<𝒌<𝒔 𝒑𝒍 𝝀 𝒌 𝑿 = 𝐚𝐫𝐠 𝐦𝐚𝐱 𝟏<𝒌<𝒔[𝒑𝒍 𝝀 𝒌 𝑿 𝒑 𝒓(𝝀 𝒌)] (15)
Finally we computes S using the logarithms and independence between the observations.
V. Expermental Results:
The developed model, IMFCC with integration of ICA with GGMM performance is evaluated. An
experiment was conducted on 30 speakers, for each speaker 10 utterances with each of duration 6sec are
recorded by high quality microphone. Out of this voice corpus, first, fifth and seventh sessions are used for
testing and remaining sessions are used for training. By using front end process explained in section II, IMFCC
feature vectors were obtained which were again refined by using ICA. The global model for each speaker
density is estimated by using the derived parameters. With the test data set, the efficiency of the developed
model is studied by identifying the speaker with the speaker identification algorithm given in section IV. The
average percentage of identification was computed and the results are tabulated.
From the table 1, it is observed that the average percentage of correct identification of speaker for the
developed model is 98.3±10. The percentage correctness for Generalized Gaussian Mixer Model with
Integrated ICA using MFCC as a feature extraction is 97.8 ±12. This clearly shows that the speaker
Identification model with IMFCC in a front end processing is having higher average percentage of correct
identification than the other models.
Table1.Average percentage of correct Identification using MFCC and IMFCC feature vectors
Model MFCC IMFCC
Embedded ICA with GGMM 97.8±12 98.3±10
V. Conclusion:
In this paper we proposed and developed a Text Independent Speaker Identification with IMFC
coefficients as a Feature vectors. Experimental Results shows that this method gives tremendous Improvement
and it can detect the correct speaker among 30 speakers with 10 utterances. Therefore this
developed model will be applicable to real time applications.
6. Text Independent Speaker Identification Using Imfcc Integrated With Ica
www.iosrjournals.org 27 | Page
References:
[1]. N M ramaligeswararao, Dr.V.Sailaja & Dr.k.Srinivasa Rao(2011) “Text Independent Speaker Identification using Integrated
Independent Component Analysis with Generalized Gaussian Mixture Model”International Journal of Advanced Computer Science
and Applications vol.2, No12, 2011pp.85-91
[2]. Ben Gold and Nelson Morgan (2002), “speech and audio processing “part 4 chapter 14 pp.189-203 John wills and sons
[3]. Hyvarinen,(1999), “Fast and Robust Fixed Point Algorithm for ICA” proceedings of the IEEE transactions on neural
networks,Volume.10,no.3,pp.626-634.
[4]. Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information
processing”.
[5]. S.chakroborty. a. roy and g. saha , “ improved closed ser text independent speaker identification by combining MFCC with evidence
from flipped filter banks ” International journal of signal processing vol.4 , no.2 ,pp.114-121,2007.ISSN 1304-4478
[6]. Ruche chaudhary, National technical research organization ,”performance study of Text Independent speaker Identification system
using MFCC and IMFCC for telephone and microphone speeches” International Journal of computing and Business
Research(IJCBR),vol. 3 issue 2 My 2012
[7]. Hyvarinen, (2001), “Independent component analysis, john Wiley and sons.
[8]. Hyvarinen,(1999), “Fast and Robust Fixed Point Algorithm for ICA” proceedings of the IEEE transactions on neural
networks,Volume.10,no.3,pp.626-634.
[9]. Cardoso,(1996) “Equivarant adaptive source separation IEEE Transaction on signal processing”vol.44.no.12.pp-3017-3030
[10]. Hyvarinen.(1997) “A family of fixed point algorithms for independent component analysis” proceedings of the IEEE international
conference on acoustics speech and signal processing (ICASSP),VOLUME5,pp.3917-3920.
[11]. Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural networks/neural information
processing”.
[12]. Ning Wang , P. C. Ching(2011), “Nengheng Zheng, and Tan Lee, “Robust Speaker Recognition Using Denoised Vocal Source and
Vocal Tract Features speaker verification,” IEEE Transaction on Audio Speech and Language processing, Vol. 1, No. 2, pp. 25-35.
[13]. Md M. Bicego, D Gonzalez, E Grosso and Alba Castro(2008) “Generalized Gaussian distribution for sequential Data
Classification” IEE Heidelberg (2003), “ICANN/ICONIP’03 of the 2003 joint international conference on Artificial neural
networks/neural information processing”.
[14]. Armando. J et al (2003), “A practical procedure to estimate the shape Parameters in the Generalized Gaussian distribution
[15]. V Sailaja, K Srinivasa Rao & K V V S Reddy(2010), “Text Independent Speaker Identification Model with Finite Multivariate
Generalized Gaussian Mixture Model and Hierarchical Clustering Algorithm ” International Journal of Computer applications Vol.
11, No. 11, , pp. 25-31.
[16]. Domingo(1997), “Speaker Recognition”, A Tutorial", Proceedings of the IEEE, Vol. 85, no. 9
[17]. Domingo’s, (1997), “On the optimality of the simple Bayesian classifier under zero-one loss,” Machine Learning, vol. 29, pp.
103-130, 1997