Speaker identification system with voice controlled functionality

946 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
946
On SlideShare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
61
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Speaker identification system with voice controlled functionality

  1. 1. SPEAKER IDENTIFICATION SYSTEM WITH VOICE – CONTROLLED FUNCTIONALITY
  2. 2. IntroductionObjective  To develop a speaker identificationsystem and control the system using a person’svoice.Platform  MatlabImplementation of Artificial Neural Networks(ANN) for pattern classificationFeature extraction  MFCC 2
  3. 3. Experimental Setup Sound Recorder Feature ExtractionSpeech Wav File MFCC Artificial Neural Network Subsystem Test Train 3
  4. 4. Signal ProcessingBuilt - in MATLAB function‘wavrecord.m’ The recorded samples serve as input tothe next stage, which is the Mel –Frequency Cepstral Analysis. 4
  5. 5. Feature ExtractionMel – Frequency Cepstral Coefficients (MFCC) MFCCs are based on the known variation of thehuman ear’s critical bandwidths with frequencyLinear at low frequencies and logarithmic at highfrequencies 5
  6. 6. MFCC Block DiagramSpeech Frame Frame Windowing FFT Blocking Mel Cepstrum Mel Mel–Freq. Spectrum Cepstrum Wrapping Spectrum 6
  7. 7. Steps of MFCC1. Frame Blocking2. Windowing3. Fast Fourier Transform (FFT )4. Mel–Frequency Wrapping5. Cepstrum Auditory Toolbox - mfcc.mceps=mfcc(input, sampling rate, [frame rate]) 7
  8. 8. Artificial Neural Networks (ANN)General models of how human brain processesinformation.Layered architecture  Consists of nodescorresponding to neurons and of weightscorresponding to connections between neurons“Learning” rule  Weights are adjusted on thebasis of a series of training patterns 8
  9. 9. Probabilistic Neural Network (PNN) Feed – forward neural network Provides a general technique to solve pattern classification problems Develops distribution function to estimate the likelihood of an input pattern being within several given categories. Created in MATLAB using ‘newpnn’ net = newpnn(p,t) 9
  10. 10. Schematic Diagram 10
  11. 11. ConclusionImplementation difficult due to variabilityin speech signalPossible improvement using noisecancellation techniques  Weiner Filter,Adaptive Filters 11
  12. 12. ReferencesL.Rabiner, B. H. Juang – Fundamentals of SpeechRecognitionC. P. Lim, S.C. Woo – Speech Recognition usingNeural Networks. IEEE Trans. on Acoustics,Speech and Signal Processing - 2000.Khalid Saeed and Mohammed Kheir Nammous –A Speech and Speaker Identification System.IEEE Trans. on Industrial Electronics - 2007. 12
  13. 13. 13

×