SlideShare a Scribd company logo
1 of 2
VOICE RECOGNITION SYSTEM

    Shailendra Singh Tiwari                                 SGSITS ENGINEERING COLLEGE
    Computer Science Engineering                          INDORE (M.P.)
    singhtiwari.shailendra@gmail.com
    AB-24108                                                 Introduction


   Abstract
R.N.:O801CS123D13                                            Speech recognition system performs two
                                                             fundamental operations: signal modeling and
   Automatic speech recognition (ASR) has made
SGSITS                                                       pattern matching Signal modeling represents
   great strides with the development of digital             process of converting speech signal into a set of
   signal processing hardware and software. But              parameters. Pattern matching is the task of finding
   despite of all these advances, machines cannot            parameter set from memory which closely
   match the performance of their human                      matches the parameter set obtained from the
   counterparts in terms of accuracy and speed,              input speech signal.
   specially in case of speaker independent speech
   recognition. So today significant portion of              Signal Modeling
   speech recognition research is focussed on
   speaker independent speech recognition problem.           To obtain the perceptually meaningful parameters
   The reasons are its wide range of applications,           i.e. parameters which are analogous to those used
   and limitations of available techniques of speech         behuman auditory system. To obtain the invariant
   recognition. In this report we briefly discuss the        parameters i.e. parameters which are robust to
   signal modeling approach for speech recognition.          variations in channel, speaker and transducer. To
   It is followed by overview of basic operations            obtain parameters that capture spectral dynamics,
   involved in signal modeling. Further commonly             or changes of spectrum with time. The signal
   used temporal and spectral analysis techniques of         modeling involves basic operationsSpectral
   feature extraction are discussed in detail.               shapin


   spectral shapingis the process of converting the          Analysis techniques for feature extraction have
   speech signal from sound pressure wave to a digital       been studied in detail and following conclusions
   signal; and emphasizing important frequency               are drawn
   components in the signal.
                                                             Temporal analysis techniques involve less
   Feature extraction                                        computation, ease of implementation. But they
                                                             are limited to determination simple speech
   Feature extraction is process of obtaining different      parameters like power, energy and periodicity of
   features such as power, pitch, and vocal tract            speech. For finding vocal tract parameters we
   configuration from the speech signal. Parameter           require spectral analysis techniques. Critical
   transformation is the process of converting
   these features into signal parameters through             band filter bank decomposes the speech signal
   process of differentiation and concatenation.             into discrete set of spectral samples containing
   Statistical modeling involves conversion of               information, which is similar to information,
   parameters in signal observation vectors.                 presented to higher levels processing in auditory
                                                             system. Cepstral analysis separates the speech
   Parametric transformation                                 signal into component representing excitation
                                                             source and a component representing vocal tract
                                                             impulse response.
   Feature Extraction
                                                             So it provides information about pitch and
   In speaker independent speech recogniton, a               vocal tract configuration. But it is computationally
   premium is placed on extracting features that             more intensive. Mel cepstral analysis has
   are somewhat invariant to changes in the speaker.         decorrelating property of cepstral analysis and
   So feture extraction involves analysis of speech          also includes some aspects of audition. LPC
   siganl. Broadly the feature extraction techniques         analysis provides compact representation of vocal
   are classified as temporal analysis and spectral          tract configuration by relatively simple
   analysis technique. In temporal analysis the              computation compared tocepstral analysis. To
   speech waveform itself is used for analysis. In           minimize analysis complexity it assumes all
   spectral analysis spectral representation of speech       pole model for speech production system. But
   signal is used for analysis.                              speech has zeros due to nasals so in these cases the
                                                             result are not as good as in case of vowels but still
   Conclusions                                               reasonably acceptable if order of model is
                                                             sufficiently high.
   The basic operations in speech recognition system
   have been discussed briefly. Different temporal
   and spectral
Acknowlegdement                                   References
I wish to express my sincere gratitude to Prof.   L. R. Rabiner and R. W. Schafer, Digital
puja gupta for her constant guidance throughout   Processing of Speech Signals. Englewood Cliffs,
the course of the computer workshop and many      New Jersey:
useful discussions which enabled me to know the
subtleties of the subject in proper way.          Prentice-Hall, 1978. D.O. Shaughnessy, Speech
                                                  Communication: Human and Machine.
                                                  India:University Press ,2001.

More Related Content

What's hot

Speaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanSpeaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanAbhishek Mahajan
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...guestfb80e22
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementHarshal Ladhe
 
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSA REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSijiert bestjournal
 
multirate signal processing for speech
multirate signal processing for speechmultirate signal processing for speech
multirate signal processing for speechRudra Prasad Maiti
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisRushin Shah
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 
Speaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionalitySpeaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionalityarizhamid786
 

What's hot (20)

Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Speaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanSpeaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajan
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
Mini Project- Audio Enhancement
Mini Project- Audio EnhancementMini Project- Audio Enhancement
Mini Project- Audio Enhancement
 
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancement
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSA REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
 
Mini Project- Audio Enhancement
Mini Project-  Audio EnhancementMini Project-  Audio Enhancement
Mini Project- Audio Enhancement
 
Speech technology basics
Speech technology   basicsSpeech technology   basics
Speech technology basics
 
SPEAKER VERIFICATION
SPEAKER VERIFICATIONSPEAKER VERIFICATION
SPEAKER VERIFICATION
 
multirate signal processing for speech
multirate signal processing for speechmultirate signal processing for speech
multirate signal processing for speech
 
speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Speaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionalitySpeaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionality
 
G010424248
G010424248G010424248
G010424248
 

Viewers also liked

Blogging and research
Blogging and researchBlogging and research
Blogging and research09evansash
 
How we made our cd pane
How we made our cd paneHow we made our cd pane
How we made our cd paneElliott Nacey
 
Dengue
DengueDengue
Denguesertao
 
Price for buying pinterest followers
Price for buying pinterest followersPrice for buying pinterest followers
Price for buying pinterest followersmartin521
 
CCW Handguns - Semi Automatic vs Revolver
CCW Handguns - Semi Automatic vs RevolverCCW Handguns - Semi Automatic vs Revolver
CCW Handguns - Semi Automatic vs RevolverAlien Gear Holsters
 

Viewers also liked (9)

Blogging and research
Blogging and researchBlogging and research
Blogging and research
 
Bitácora 08 04-2013
Bitácora 08 04-2013Bitácora 08 04-2013
Bitácora 08 04-2013
 
How we made our cd pane
How we made our cd paneHow we made our cd pane
How we made our cd pane
 
Dengue
DengueDengue
Dengue
 
Bitácora 01 04-2013
Bitácora 01 04-2013Bitácora 01 04-2013
Bitácora 01 04-2013
 
Price for buying pinterest followers
Price for buying pinterest followersPrice for buying pinterest followers
Price for buying pinterest followers
 
3 essays
3 essays3 essays
3 essays
 
CCW Handguns - Semi Automatic vs Revolver
CCW Handguns - Semi Automatic vs RevolverCCW Handguns - Semi Automatic vs Revolver
CCW Handguns - Semi Automatic vs Revolver
 
Bitácora 09 04-2013
Bitácora 09 04-2013Bitácora 09 04-2013
Bitácora 09 04-2013
 

Similar to Servey

Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004guest6e7a1b1
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identificationIJCSEA Journal
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationIDES Editor
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...sipij
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...ijcsit
 
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...AIRCC Publishing Corporation
 
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET Journal
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionHugo Moreno
 
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Adaptive wavelet thresholding with robust hybrid features  for text-independe...Adaptive wavelet thresholding with robust hybrid features  for text-independe...
Adaptive wavelet thresholding with robust hybrid features for text-independe...IJECEIAES
 
Bachelors project summary
Bachelors project summaryBachelors project summary
Bachelors project summaryAditya Deshmukh
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327IJMER
 
Speaker Identification
Speaker IdentificationSpeaker Identification
Speaker Identificationsipij
 

Similar to Servey (20)

Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and Validation
 
voice morphing.pptx
voice morphing.pptxvoice morphing.pptx
voice morphing.pptx
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
52 57
52 5752 57
52 57
 
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
 
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
 
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound Recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Adaptive wavelet thresholding with robust hybrid features  for text-independe...Adaptive wavelet thresholding with robust hybrid features  for text-independe...
Adaptive wavelet thresholding with robust hybrid features for text-independe...
 
Bachelors project summary
Bachelors project summaryBachelors project summary
Bachelors project summary
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
 
Speaker Identification
Speaker IdentificationSpeaker Identification
Speaker Identification
 

Servey

  • 1. VOICE RECOGNITION SYSTEM Shailendra Singh Tiwari SGSITS ENGINEERING COLLEGE Computer Science Engineering INDORE (M.P.) singhtiwari.shailendra@gmail.com AB-24108 Introduction Abstract R.N.:O801CS123D13 Speech recognition system performs two fundamental operations: signal modeling and Automatic speech recognition (ASR) has made SGSITS pattern matching Signal modeling represents great strides with the development of digital process of converting speech signal into a set of signal processing hardware and software. But parameters. Pattern matching is the task of finding despite of all these advances, machines cannot parameter set from memory which closely match the performance of their human matches the parameter set obtained from the counterparts in terms of accuracy and speed, input speech signal. specially in case of speaker independent speech recognition. So today significant portion of Signal Modeling speech recognition research is focussed on speaker independent speech recognition problem. To obtain the perceptually meaningful parameters The reasons are its wide range of applications, i.e. parameters which are analogous to those used and limitations of available techniques of speech behuman auditory system. To obtain the invariant recognition. In this report we briefly discuss the parameters i.e. parameters which are robust to signal modeling approach for speech recognition. variations in channel, speaker and transducer. To It is followed by overview of basic operations obtain parameters that capture spectral dynamics, involved in signal modeling. Further commonly or changes of spectrum with time. The signal used temporal and spectral analysis techniques of modeling involves basic operationsSpectral feature extraction are discussed in detail. shapin spectral shapingis the process of converting the Analysis techniques for feature extraction have speech signal from sound pressure wave to a digital been studied in detail and following conclusions signal; and emphasizing important frequency are drawn components in the signal. Temporal analysis techniques involve less Feature extraction computation, ease of implementation. But they are limited to determination simple speech Feature extraction is process of obtaining different parameters like power, energy and periodicity of features such as power, pitch, and vocal tract speech. For finding vocal tract parameters we configuration from the speech signal. Parameter require spectral analysis techniques. Critical transformation is the process of converting these features into signal parameters through band filter bank decomposes the speech signal process of differentiation and concatenation. into discrete set of spectral samples containing Statistical modeling involves conversion of information, which is similar to information, parameters in signal observation vectors. presented to higher levels processing in auditory system. Cepstral analysis separates the speech Parametric transformation signal into component representing excitation source and a component representing vocal tract impulse response. Feature Extraction So it provides information about pitch and In speaker independent speech recogniton, a vocal tract configuration. But it is computationally premium is placed on extracting features that more intensive. Mel cepstral analysis has are somewhat invariant to changes in the speaker. decorrelating property of cepstral analysis and So feture extraction involves analysis of speech also includes some aspects of audition. LPC siganl. Broadly the feature extraction techniques analysis provides compact representation of vocal are classified as temporal analysis and spectral tract configuration by relatively simple analysis technique. In temporal analysis the computation compared tocepstral analysis. To speech waveform itself is used for analysis. In minimize analysis complexity it assumes all spectral analysis spectral representation of speech pole model for speech production system. But signal is used for analysis. speech has zeros due to nasals so in these cases the result are not as good as in case of vowels but still Conclusions reasonably acceptable if order of model is sufficiently high. The basic operations in speech recognition system have been discussed briefly. Different temporal and spectral
  • 2. Acknowlegdement References I wish to express my sincere gratitude to Prof. L. R. Rabiner and R. W. Schafer, Digital puja gupta for her constant guidance throughout Processing of Speech Signals. Englewood Cliffs, the course of the computer workshop and many New Jersey: useful discussions which enabled me to know the subtleties of the subject in proper way. Prentice-Hall, 1978. D.O. Shaughnessy, Speech Communication: Human and Machine. India:University Press ,2001.