SlideShare a Scribd company logo
1 of 17
SHREEJEE INSTITUTE OF
TECHNOLOGY AND MANAGEMENT
Speaker Recognition
• Guided By:- Mr. Prakash
Singh Panwar
• By:- Rajpal Singh Chouhan
• EC BRANCH 1ST YEAR
What is Speaker Recognition?
Speaker Recognition is the process of automatically
recognizing who is speaking on the basis of individual
information included in speech signals.
Speaker Recognition
=
Speaker Identification,
Speaker Verification
Speaker Identification
• a
Whose voice is
this?
?
?
??
Speaker Verification
• a
• Synonyms: authentication, detection.
• User claims an identity.
• System task: Accept or reject identity claim.
Is this Ahmad’s
voice ?
?
Model of Speaker Recognizer
• a
Fig -1 : Simple model of Speaker Recognizer .
U Permitted
to Access
Hello,
Mr. John
The Structure of Speaker
Recognizer• a
• Figure 2 :Functional Scheme of an ASR System.
Feature
Extraction Feature Vector
Training Mode
Recognition
Speaker
Modeling
Classification
Decision Logic
Speaker #ID
Speaker_1
Speech Signal Analysis
Feature Extraction
• a
• - The aim is to extract the voice features to
distinguish different phonemes of a language.
5
1
5
6
4
5
4
6
5
1
5
6
1
5
6
1
6
5
1
5
6
4
5
6
4
5
4
2
5
1
5
6
1
5
6
5
MFCC extraction
• a
Pre-emphasis DFT
Mel filter
banks
Log(||2) IDFT
Speech
signal
x(n)
WINDOW
x’(n)
xt (n)
Xt(k)
Yt(m)
MFCC
yt
(m)(k)
MFCC means Mel-frequency cepstral coefficients that
representation of the short-term power spectrum of a sound for
audio processing.
The MFCCs are the amplitudes of the resulting spectrum.
a
• a
Speech waveform of a
phoneme “ae”
After pre-emphasis and
Hamming windowing
Power spectrum MFCC
Speech Signal to Feature Vector
• a
5
1
5
6
4
5
4
6
5
1
5
6
1
5
6
1
6
5
1
5
6
4
5
6
4
5
4
2
5
1
5
6
1
5
6
5
Vector Quantization (VQ)
• aAIM of VQ :
representation of large amounts
of data by (few) prototype vectors.
example:
identification and grouping
in clusters of similar data.
assignment of feature vector 
to the closest prototype w
(similarity or distance measure,
e.g. Euclidean distance )
Database Creation Process
• a
Database
Speaker #1
Speaker #2
Speaker #3
Hello, Speaker #1
Speaker #1Speaker #2
Hello, Speaker #2
Speaker Identification
• a
Database
#1 #2 #3
Speaker
# ?
Speaker
# 1
Speaker Verification
• a
Database
#1 #2 #3
Speaker
# 1Accept
14
Database Creation Condition
• a
Table 1: Database description.
Parameter Characteristics
Language Bangla
No. of speaker 5
Speech type Sentence reading
Recording condition A normal room condition
Audio Length 60-90 seconds
Audio type Stereo
Sample Format 16-bit PCM
Sampling Frequency 8 KHz
Bit Rate 1411 kbps
Speaker Recognition Result
• a
Table 3: Test result for speaker recognition system.
Speaker No. of input Correct Incorrect Accuracy
Speaker_1 5 5 0 100%
Speaker_2 9 8 1 88.88%
Speaker_3 6 6 0 100%
Speaker_3 12 11 1 91.67%
Speaker_4 8 8 0 100%
Speaker_5 10 10 0 100%
Total Speaker 50 48 2 96%
Applications
• a
• Transaction authentication
– Toll fraud prevention
– Telephone credit card purchases
– Telephone brokerage (e.g., stock trading)
• Access control
– Physical facilities
– Computers and data networks
• Information retrieval
– Customer information for call centers
– Audio indexing (speech skimming device)
• Forensics
– Voice sample matching

More Related Content

What's hot

Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
Diptimaya Sarangi
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
Amrita More
 
Speaker identification using mel frequency
Speaker identification using mel frequency Speaker identification using mel frequency
Speaker identification using mel frequency
Phan Duy
 

What's hot (20)

Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Ai based character recognition and speech synthesis
Ai based character recognition and speech  synthesisAi based character recognition and speech  synthesis
Ai based character recognition and speech synthesis
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Mini Project- Audio Enhancement
Mini Project-  Audio EnhancementMini Project-  Audio Enhancement
Mini Project- Audio Enhancement
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Automatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAutomatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approach
 
Speaker identification using mel frequency
Speaker identification using mel frequency Speaker identification using mel frequency
Speaker identification using mel frequency
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
 
Mini Project- Audio Enhancement
Mini Project- Audio EnhancementMini Project- Audio Enhancement
Mini Project- Audio Enhancement
 
Automatic speech recognition system using deep learning
Automatic speech recognition system using deep learningAutomatic speech recognition system using deep learning
Automatic speech recognition system using deep learning
 
Deep Learning in practice : Speech recognition and beyond - Meetup
Deep Learning in practice : Speech recognition and beyond - MeetupDeep Learning in practice : Speech recognition and beyond - Meetup
Deep Learning in practice : Speech recognition and beyond - Meetup
 

Viewers also liked

Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC technique
Pankaj Kumar
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
Sarang Afle
 

Viewers also liked (10)

Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC technique
 
MFCC Malta - MICE Presentation 2017
MFCC Malta - MICE Presentation 2017MFCC Malta - MICE Presentation 2017
MFCC Malta - MICE Presentation 2017
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
 
Speech based password authentication system on FPGA
Speech based password authentication system on FPGASpeech based password authentication system on FPGA
Speech based password authentication system on FPGA
 
Text independent speaker recognition system
Text independent speaker recognition systemText independent speaker recognition system
Text independent speaker recognition system
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Speaker recognition systems
Speaker recognition systemsSpeaker recognition systems
Speaker recognition systems
 

Similar to Speaker recognition system by abhishek mahajan

DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITIONDEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
niranjan kumar
 
Bachelors project summary
Bachelors project summaryBachelors project summary
Bachelors project summary
Aditya Deshmukh
 

Similar to Speaker recognition system by abhishek mahajan (20)

DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITIONDEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
 
Bachelors project summary
Bachelors project summaryBachelors project summary
Bachelors project summary
 
Design and Implementation of Speech Based Scientific Calculator
Design and Implementation of Speech Based Scientific CalculatorDesign and Implementation of Speech Based Scientific Calculator
Design and Implementation of Speech Based Scientific Calculator
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Voice recognition security systems
Voice recognition security systemsVoice recognition security systems
Voice recognition security systems
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
02 Global AI Bootcamp 2022 Sofia Voice biometry Recognizing speakers in our a...
02 Global AI Bootcamp 2022 Sofia Voice biometry Recognizing speakers in our a...02 Global AI Bootcamp 2022 Sofia Voice biometry Recognizing speakers in our a...
02 Global AI Bootcamp 2022 Sofia Voice biometry Recognizing speakers in our a...
 
Speech Retrieval
Speech RetrievalSpeech Retrieval
Speech Retrieval
 
ASR_final
ASR_finalASR_final
ASR_final
 
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based ModelReal-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
 
A Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving SystemA Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving System
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Sound, Search, and Semantics: How Form Follows Function
Sound, Search, and Semantics: How Form Follows FunctionSound, Search, and Semantics: How Form Follows Function
Sound, Search, and Semantics: How Form Follows Function
 
Sound, Search, and Semantics: How Form Follows Function
Sound, Search, and Semantics: How Form Follows FunctionSound, Search, and Semantics: How Form Follows Function
Sound, Search, and Semantics: How Form Follows Function
 
Sound, Search, and Semantics: How Form Follows Function
Sound, Search, and Semantics: How Form Follows FunctionSound, Search, and Semantics: How Form Follows Function
Sound, Search, and Semantics: How Form Follows Function
 
Voice based autometedtransport enquiry system in #c by Rohit malav
Voice based autometedtransport enquiry system in #c by Rohit malavVoice based autometedtransport enquiry system in #c by Rohit malav
Voice based autometedtransport enquiry system in #c by Rohit malav
 
Filar seymour oreilly_bot_story_
Filar seymour oreilly_bot_story_Filar seymour oreilly_bot_story_
Filar seymour oreilly_bot_story_
 

More from Abhishek Mahajan

More from Abhishek Mahajan (19)

Mems project by abhishek mahajan
Mems project by abhishek mahajanMems project by abhishek mahajan
Mems project by abhishek mahajan
 
Embedded system by abhishek mahajan
Embedded system by abhishek mahajanEmbedded system by abhishek mahajan
Embedded system by abhishek mahajan
 
Satellite communication by abhishek mahajan
Satellite communication by abhishek mahajanSatellite communication by abhishek mahajan
Satellite communication by abhishek mahajan
 
Robotics by abhishek mahajan
Robotics by abhishek mahajanRobotics by abhishek mahajan
Robotics by abhishek mahajan
 
Optical computing by abhishek mahajan
Optical computing by abhishek mahajanOptical computing by abhishek mahajan
Optical computing by abhishek mahajan
 
Embedded system by abhishek mahajan
Embedded system by abhishek mahajanEmbedded system by abhishek mahajan
Embedded system by abhishek mahajan
 
Wireless sensor network by abhishek mahajan
Wireless sensor network by abhishek mahajanWireless sensor network by abhishek mahajan
Wireless sensor network by abhishek mahajan
 
Wireless communication by abhishek mmahajan
Wireless communication by abhishek mmahajanWireless communication by abhishek mmahajan
Wireless communication by abhishek mmahajan
 
Teleconferncing by abhishek mahajan
Teleconferncing by abhishek mahajanTeleconferncing by abhishek mahajan
Teleconferncing by abhishek mahajan
 
Remote sensing by abhishek mahajan
Remote sensing by abhishek mahajanRemote sensing by abhishek mahajan
Remote sensing by abhishek mahajan
 
Radar by abhishek mahajan
Radar by abhishek mahajanRadar by abhishek mahajan
Radar by abhishek mahajan
 
Optical fibre by abhishek mahajan
Optical fibre by abhishek mahajanOptical fibre by abhishek mahajan
Optical fibre by abhishek mahajan
 
Nanotechnology by abhishek mahajan
Nanotechnology by abhishek mahajanNanotechnology by abhishek mahajan
Nanotechnology by abhishek mahajan
 
Microwave communication by abhishek mahajan
Microwave communication by abhishek mahajanMicrowave communication by abhishek mahajan
Microwave communication by abhishek mahajan
 
micro electro mechnical system
micro electro mechnical systemmicro electro mechnical system
micro electro mechnical system
 
Internet telephony by abhishek mahajan
Internet telephony by abhishek mahajanInternet telephony by abhishek mahajan
Internet telephony by abhishek mahajan
 
Eletrocardiograph by abhishek mahajan
Eletrocardiograph by abhishek mahajanEletrocardiograph by abhishek mahajan
Eletrocardiograph by abhishek mahajan
 
Earthquake monitering by abhishek mahajan
Earthquake monitering by abhishek mahajanEarthquake monitering by abhishek mahajan
Earthquake monitering by abhishek mahajan
 
Display technology by abhishek mahajan
Display technology by abhishek mahajanDisplay technology by abhishek mahajan
Display technology by abhishek mahajan
 

Recently uploaded

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 

Recently uploaded (20)

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 

Speaker recognition system by abhishek mahajan

  • 1. SHREEJEE INSTITUTE OF TECHNOLOGY AND MANAGEMENT Speaker Recognition • Guided By:- Mr. Prakash Singh Panwar • By:- Rajpal Singh Chouhan • EC BRANCH 1ST YEAR
  • 2. What is Speaker Recognition? Speaker Recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech signals. Speaker Recognition = Speaker Identification, Speaker Verification
  • 3. Speaker Identification • a Whose voice is this? ? ? ??
  • 4. Speaker Verification • a • Synonyms: authentication, detection. • User claims an identity. • System task: Accept or reject identity claim. Is this Ahmad’s voice ? ?
  • 5. Model of Speaker Recognizer • a Fig -1 : Simple model of Speaker Recognizer . U Permitted to Access Hello, Mr. John
  • 6. The Structure of Speaker Recognizer• a • Figure 2 :Functional Scheme of an ASR System. Feature Extraction Feature Vector Training Mode Recognition Speaker Modeling Classification Decision Logic Speaker #ID Speaker_1
  • 7. Speech Signal Analysis Feature Extraction • a • - The aim is to extract the voice features to distinguish different phonemes of a language. 5 1 5 6 4 5 4 6 5 1 5 6 1 5 6 1 6 5 1 5 6 4 5 6 4 5 4 2 5 1 5 6 1 5 6 5
  • 8. MFCC extraction • a Pre-emphasis DFT Mel filter banks Log(||2) IDFT Speech signal x(n) WINDOW x’(n) xt (n) Xt(k) Yt(m) MFCC yt (m)(k) MFCC means Mel-frequency cepstral coefficients that representation of the short-term power spectrum of a sound for audio processing. The MFCCs are the amplitudes of the resulting spectrum.
  • 9. a • a Speech waveform of a phoneme “ae” After pre-emphasis and Hamming windowing Power spectrum MFCC
  • 10. Speech Signal to Feature Vector • a 5 1 5 6 4 5 4 6 5 1 5 6 1 5 6 1 6 5 1 5 6 4 5 6 4 5 4 2 5 1 5 6 1 5 6 5
  • 11. Vector Quantization (VQ) • aAIM of VQ : representation of large amounts of data by (few) prototype vectors. example: identification and grouping in clusters of similar data. assignment of feature vector  to the closest prototype w (similarity or distance measure, e.g. Euclidean distance )
  • 12. Database Creation Process • a Database Speaker #1 Speaker #2 Speaker #3 Hello, Speaker #1 Speaker #1Speaker #2 Hello, Speaker #2
  • 13. Speaker Identification • a Database #1 #2 #3 Speaker # ? Speaker # 1
  • 14. Speaker Verification • a Database #1 #2 #3 Speaker # 1Accept 14
  • 15. Database Creation Condition • a Table 1: Database description. Parameter Characteristics Language Bangla No. of speaker 5 Speech type Sentence reading Recording condition A normal room condition Audio Length 60-90 seconds Audio type Stereo Sample Format 16-bit PCM Sampling Frequency 8 KHz Bit Rate 1411 kbps
  • 16. Speaker Recognition Result • a Table 3: Test result for speaker recognition system. Speaker No. of input Correct Incorrect Accuracy Speaker_1 5 5 0 100% Speaker_2 9 8 1 88.88% Speaker_3 6 6 0 100% Speaker_3 12 11 1 91.67% Speaker_4 8 8 0 100% Speaker_5 10 10 0 100% Total Speaker 50 48 2 96%
  • 17. Applications • a • Transaction authentication – Toll fraud prevention – Telephone credit card purchases – Telephone brokerage (e.g., stock trading) • Access control – Physical facilities – Computers and data networks • Information retrieval – Customer information for call centers – Audio indexing (speech skimming device) • Forensics – Voice sample matching