SlideShare a Scribd company logo
1 of 28
A Survey on
“Speaker Recognition System”
Under the Guidance of :
Prof.S.M.Hatture
Objectives…
• Introduction
• Speaker Recognition
• Literature Survey
• Issues and Challenges
• Conclusion
Introduction
• To create new services that will make our every day
lives more secured.
• For forensic purposes.
Speaker Recognition
Literature Survey
1. CASA-Based Robust Speaker Identification
(Computational Auditory Scene Analysis)
2. Independent component analysis and MLLR
transformation for speaker identification
• Independent Component Analysis (ICA).
• Principle Component Analysis (PCA).
3. Towards noise –robust speaker recognition
using probabilistic linear discriminant analysis
• Probabilistic linear discriminant analysis
• Additive noise
4. Weighted LDA techniques for i-vector
based speaker verification.
• Improving i-vector speaker verification in presence of
high inter session variability.
• Interview-interview condition.
• Telephone-telephone condition.
5. An Overview of Speaker Identification:
Accuracy and Robustness Issues
Two methods
• Speaker identification
• Speaker verification
6. Cross-pollination of normalization techniques
from speaker to face authentication using
Gaussian mixture models.
7.Front-End Factor Analysis for Speaker
Verification
• This paper proposed new way of combining JFA and
SVM’s for speaker verification.
8. Parallel transformation network feature for
speaker recognition
• TN features with SVM modeling-method in order to
become language independent and overcome the need
for accurate speech recognition.
9. Statistical Pattern Recognition Techniques for
Speaker Verification
10. Speaker Identification within Whispered Speech
Audio Streams
 Whisper is an alternative speech production mode used by
subjects in natural conversation to protect the privacy.
 Whispered speech is a natural mode of speech information.
11. A comparison of approaches for modeling prosodic
features in speaker recognition.
• It address the task of text-independent speaker
verification.
• Prosodic features.
12.Fusion Methods for Boosting Performance of
Speaker Identification Systems
1. feature extraction.
2.classification tasks.
13. Source-normalized LDA for robust speaker
recognition using i-vectors from multiple speech
sources
• Improves the robustness of i-vector-based speaker
recognition.
• An source-normalized algorithm to improves
robustness of i-vector-based-speaker recognition.
14. A study on Universal Background Model training
in Speaker Verification
• Systematic analyze of speaker verification system
performance.
• Rigorous methods like IFS scheme is used to
estimate similarity.
15. Speaker Identification Using Instantaneous
Frequencies
• Introduction of new set of descriptors that capture the
identity of speaker well.
• Provides robustness with respect to changes in
recording channel and speaking style.
16. Codebook Design Method for Noise Robust
Speaker Identification based on Genetic
Algorithm
• To designing a codebook for noise robust speaker,
Genetic algorithm is proposed.
Paradigm of the proposed codebook
design method.
17. Enhanced speaker recognition based on intra-
modal fusion and accent modeling.
• Intra-modal fusion.
• Accent modeling.
18. Discriminant NAP for SVM Speaker Recognition
• Nuisance Attribute Projection (NAP) provides an
effective method of removing the unwanted session
variability in a Support Vector Machine (SVM) based
speaker recognition system by removing the principal
components of this variability.
19. A Speech-and-Speaker Identification System:
Feature Extraction, Description and Classification of
Speech-Signal Image
• A speech-and-speaker (SAS) identification system
based on spoken Arabic digit recognition.
20. In-Set/Out-of-Set Speaker Recognition Under
Sparse Enrollment
• The problem of in-set speaker recognition is
addressed with the constraints of low enrollment (5 s)
and test material (2–8 s) and in-set group sizes
ranging from 15–45 speakers.
• An algorithm is proposed that uses an in-set
speaker’s cohort set to make up for the sparse (e.g., 5
s per speaker) enrollment data.
21. Analysis of Speech Recognition Techniques for
use in a Non-Speech Sound Recognition System
• Analysis the different techniques used for speech
recognition and identifies those that can be used for
non-speech sound recognition
22. Speaker verification for home security system
• A reliable speaker verification algorithm is used in
home security.
23. An Efficient Scoring Algorithm for Gaussian
Mixture Model Based Speaker Identification
• The use of GMM for speaker identification was
shown to provide superior performance
Graphical illustration of the observation vector recording
24. Speaker Recognition: A Tutorial
• Speech processing is a diverse field with many
applications.
25. Speaker Identification Based on the Use of
Robust Cepstral Features Obtained from Pole-Zero
Transfer Functions
• An attempt made to alleviate mismatch in the training and
testing conditions.
• Proposed a new feature called linear predictive ceptrum
derived by pole-zero function.
26. Speaker Verification Using Mixture Decomposition
Discrimination
• Mixture decomposition discrimination (MDD) is based on
the idea that, when modeling speech using hidden Markov
models (HMM), different speakers speaking the same word
would cause different HMM mixture components to
dominate.
27. Recent Advances in the Automatic Recognition of
Audiovisual Speech
28. Unsupervised Speaker Recognition Based on
Competition Between Self-Organizing Maps
• Clustering the speaker from unlabeled and unsegmented
conversation, when no priori knowledge about the identity
of the participants is given.
29. Speaker Recognition with Polynomial Classifiers
• Polynomial –based classifier to achieve high accuracy at low
complexity.
- It has several advantages.
1. Polynomial classifier scoring yields a system which is highly
computationally scalable with the number of speakers.
2. A new training algorithm is proposed which is discriminative,
handles large data sets, and has low memory usage.
3. The output of the polynomial classifier is easily incorporated
into a statistical framework allowing it to be combined with
other techniques such as HMM.
30. Automatic Verbal Information Verification for
User Authentication
An example of verbal information verification by asking
sequential questions.
Issues and challenges…
• Robustness
• Portability
• Adaptation
• Language modeling
• Confidence measure
• Out of vocabulary words
• Prosody
Conclusion…
• Problems are still with speaker-generated variability and
variability in channel and recording conditions.
• It is very important to investigate feature parameters that
are stable over time, insensitive to the variation of
speaking manner, including the speaking rate and level,
and robust against variations in voice quality due to
causes such as voice disguise or colds.
• Studies on ways to automatically extract the speech
periods of each person separately from a dialogue
involving more than two people have recently appeared as
an extension of speaker recognition technology.
Thank You…

More Related Content

What's hot

Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communicationSwathi Venugopal
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overviewVarun Jain
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basicssivakumar m
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest
 
Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)BushraShaikh44
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overviewsajanazoya
 
Text independent speaker recognition system
Text independent speaker recognition systemText independent speaker recognition system
Text independent speaker recognition systemDeepesh Lekhak
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 

What's hot (20)

Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communication
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh Tomar
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Text independent speaker recognition system
Text independent speaker recognition systemText independent speaker recognition system
Text independent speaker recognition system
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
SPEAKER VERIFICATION
SPEAKER VERIFICATIONSPEAKER VERIFICATION
SPEAKER VERIFICATION
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 

Viewers also liked

iVector vs GMM/UBM for Automatic Speaker Recognition system
iVector vs GMM/UBM for Automatic Speaker Recognition system iVector vs GMM/UBM for Automatic Speaker Recognition system
iVector vs GMM/UBM for Automatic Speaker Recognition system Walid Bouaffou
 
Car audio stereo system
Car audio stereo systemCar audio stereo system
Car audio stereo systemaudio bsd
 
Text-Independent Speaker Verification Report
Text-Independent Speaker Verification ReportText-Independent Speaker Verification Report
Text-Independent Speaker Verification ReportCody Ray
 
Expert car audio
Expert car audioExpert car audio
Expert car audioexpertcar
 
Microprocessor-Based Automatic Door Opener
Microprocessor-Based Automatic Door OpenerMicroprocessor-Based Automatic Door Opener
Microprocessor-Based Automatic Door OpenerJitendra Choudhary
 
document for Voice banking system mini project
document for Voice banking system mini projectdocument for Voice banking system mini project
document for Voice banking system mini projectJal Pari
 
VOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMVOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMNikhil Ravi
 
Introduction to machine learningunsupervised learning
Introduction to machine learningunsupervised learningIntroduction to machine learningunsupervised learning
Introduction to machine learningunsupervised learningSardar Alam
 
The Main Concepts of Speech Recognition
The Main Concepts of Speech RecognitionThe Main Concepts of Speech Recognition
The Main Concepts of Speech Recognition子毅 楊
 
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...Universitat Politècnica de Catalunya
 
Speaker Recognition using Gaussian Mixture Model
Speaker Recognition using Gaussian Mixture Model Speaker Recognition using Gaussian Mixture Model
Speaker Recognition using Gaussian Mixture Model Saurab Dulal
 
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation MaximizationLecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation Maximizationbutest
 
Wireless mobile charger
Wireless mobile chargerWireless mobile charger
Wireless mobile chargerMaruf Priyo
 
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...Universitat Politècnica de Catalunya
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemREHMAT ULLAH
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 

Viewers also liked (20)

iVector vs GMM/UBM for Automatic Speaker Recognition system
iVector vs GMM/UBM for Automatic Speaker Recognition system iVector vs GMM/UBM for Automatic Speaker Recognition system
iVector vs GMM/UBM for Automatic Speaker Recognition system
 
Car audio stereo system
Car audio stereo systemCar audio stereo system
Car audio stereo system
 
Text-Independent Speaker Verification Report
Text-Independent Speaker Verification ReportText-Independent Speaker Verification Report
Text-Independent Speaker Verification Report
 
Basic Car Audio
Basic Car AudioBasic Car Audio
Basic Car Audio
 
Expert car audio
Expert car audioExpert car audio
Expert car audio
 
Microprocessor-Based Automatic Door Opener
Microprocessor-Based Automatic Door OpenerMicroprocessor-Based Automatic Door Opener
Microprocessor-Based Automatic Door Opener
 
document for Voice banking system mini project
document for Voice banking system mini projectdocument for Voice banking system mini project
document for Voice banking system mini project
 
VOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMVOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEM
 
Introduction to machine learningunsupervised learning
Introduction to machine learningunsupervised learningIntroduction to machine learningunsupervised learning
Introduction to machine learningunsupervised learning
 
The Main Concepts of Speech Recognition
The Main Concepts of Speech RecognitionThe Main Concepts of Speech Recognition
The Main Concepts of Speech Recognition
 
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
 
Speaker Recognition using Gaussian Mixture Model
Speaker Recognition using Gaussian Mixture Model Speaker Recognition using Gaussian Mixture Model
Speaker Recognition using Gaussian Mixture Model
 
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation MaximizationLecture 18: Gaussian Mixture Models and Expectation Maximization
Lecture 18: Gaussian Mixture Models and Expectation Maximization
 
Wireless mobile charger
Wireless mobile chargerWireless mobile charger
Wireless mobile charger
 
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Wireless charger
Wireless chargerWireless charger
Wireless charger
 
GSM Presentation
GSM PresentationGSM Presentation
GSM Presentation
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 

Similar to A Survey on Speaker Recognition System

DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITIONDEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITIONniranjan kumar
 
Text Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.pptText Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.pptGrace136708
 
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionTeaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionZachary S. Brown
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionSai Kiran Kadam
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionSai Kiran Kadam
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognitionAditya Kumar Khare
 
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based ModelReal-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Modeladil raja
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationIDES Editor
 
AI_attachment.pptx prepared for all students
AI_attachment.pptx prepared for all  studentsAI_attachment.pptx prepared for all  students
AI_attachment.pptx prepared for all studentstalldesalegn
 
Speaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanSpeaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanAbhishek Mahajan
 
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...Agile Testing Alliance
 
Metamorphic Testing Thesis Defense.pptx
Metamorphic Testing Thesis Defense.pptxMetamorphic Testing Thesis Defense.pptx
Metamorphic Testing Thesis Defense.pptxentertainmentweekly11
 
Lip Reading.pptx
Lip Reading.pptxLip Reading.pptx
Lip Reading.pptxNivethaT15
 
Giab workshop intro 180125
Giab workshop intro 180125Giab workshop intro 180125
Giab workshop intro 180125GenomeInABottle
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition systemavinash raibole
 
Dstc6 an introduction
Dstc6 an introductionDstc6 an introduction
Dstc6 an introductionhkh
 

Similar to A Survey on Speaker Recognition System (20)

DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITIONDEVELOPMENT OF SPEAKER VERIFICATION  UNDER LIMITED DATA AND CONDITION
DEVELOPMENT OF SPEAKER VERIFICATION UNDER LIMITED DATA AND CONDITION
 
Text Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.pptText Independent Speaker recognitom framework for detecting criminals.ppt
Text Independent Speaker recognitom framework for detecting criminals.ppt
 
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionTeaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker Recognition
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event Detection
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
 
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based ModelReal-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and Validation
 
AI_attachment.pptx prepared for all students
AI_attachment.pptx prepared for all  studentsAI_attachment.pptx prepared for all  students
AI_attachment.pptx prepared for all students
 
Speaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajanSpeaker recognition system by abhishek mahajan
Speaker recognition system by abhishek mahajan
 
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
 
Metamorphic Testing Thesis Defense.pptx
Metamorphic Testing Thesis Defense.pptxMetamorphic Testing Thesis Defense.pptx
Metamorphic Testing Thesis Defense.pptx
 
Lip Reading.pptx
Lip Reading.pptxLip Reading.pptx
Lip Reading.pptx
 
Giab workshop intro 180125
Giab workshop intro 180125Giab workshop intro 180125
Giab workshop intro 180125
 
Text summarization
Text summarization Text summarization
Text summarization
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Dstc6 an introduction
Dstc6 an introductionDstc6 an introduction
Dstc6 an introduction
 

Recently uploaded

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

A Survey on Speaker Recognition System

  • 1. A Survey on “Speaker Recognition System” Under the Guidance of : Prof.S.M.Hatture
  • 2. Objectives… • Introduction • Speaker Recognition • Literature Survey • Issues and Challenges • Conclusion
  • 3. Introduction • To create new services that will make our every day lives more secured. • For forensic purposes.
  • 6. 1. CASA-Based Robust Speaker Identification (Computational Auditory Scene Analysis)
  • 7. 2. Independent component analysis and MLLR transformation for speaker identification • Independent Component Analysis (ICA). • Principle Component Analysis (PCA). 3. Towards noise –robust speaker recognition using probabilistic linear discriminant analysis • Probabilistic linear discriminant analysis • Additive noise
  • 8. 4. Weighted LDA techniques for i-vector based speaker verification. • Improving i-vector speaker verification in presence of high inter session variability. • Interview-interview condition. • Telephone-telephone condition. 5. An Overview of Speaker Identification: Accuracy and Robustness Issues Two methods • Speaker identification • Speaker verification
  • 9. 6. Cross-pollination of normalization techniques from speaker to face authentication using Gaussian mixture models.
  • 10. 7.Front-End Factor Analysis for Speaker Verification • This paper proposed new way of combining JFA and SVM’s for speaker verification. 8. Parallel transformation network feature for speaker recognition • TN features with SVM modeling-method in order to become language independent and overcome the need for accurate speech recognition.
  • 11. 9. Statistical Pattern Recognition Techniques for Speaker Verification 10. Speaker Identification within Whispered Speech Audio Streams  Whisper is an alternative speech production mode used by subjects in natural conversation to protect the privacy.  Whispered speech is a natural mode of speech information.
  • 12. 11. A comparison of approaches for modeling prosodic features in speaker recognition. • It address the task of text-independent speaker verification. • Prosodic features. 12.Fusion Methods for Boosting Performance of Speaker Identification Systems 1. feature extraction. 2.classification tasks.
  • 13. 13. Source-normalized LDA for robust speaker recognition using i-vectors from multiple speech sources • Improves the robustness of i-vector-based speaker recognition. • An source-normalized algorithm to improves robustness of i-vector-based-speaker recognition. 14. A study on Universal Background Model training in Speaker Verification • Systematic analyze of speaker verification system performance. • Rigorous methods like IFS scheme is used to estimate similarity.
  • 14. 15. Speaker Identification Using Instantaneous Frequencies • Introduction of new set of descriptors that capture the identity of speaker well. • Provides robustness with respect to changes in recording channel and speaking style. 16. Codebook Design Method for Noise Robust Speaker Identification based on Genetic Algorithm • To designing a codebook for noise robust speaker, Genetic algorithm is proposed.
  • 15. Paradigm of the proposed codebook design method.
  • 16. 17. Enhanced speaker recognition based on intra- modal fusion and accent modeling. • Intra-modal fusion. • Accent modeling. 18. Discriminant NAP for SVM Speaker Recognition • Nuisance Attribute Projection (NAP) provides an effective method of removing the unwanted session variability in a Support Vector Machine (SVM) based speaker recognition system by removing the principal components of this variability.
  • 17. 19. A Speech-and-Speaker Identification System: Feature Extraction, Description and Classification of Speech-Signal Image • A speech-and-speaker (SAS) identification system based on spoken Arabic digit recognition.
  • 18. 20. In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment • The problem of in-set speaker recognition is addressed with the constraints of low enrollment (5 s) and test material (2–8 s) and in-set group sizes ranging from 15–45 speakers. • An algorithm is proposed that uses an in-set speaker’s cohort set to make up for the sparse (e.g., 5 s per speaker) enrollment data. 21. Analysis of Speech Recognition Techniques for use in a Non-Speech Sound Recognition System • Analysis the different techniques used for speech recognition and identifies those that can be used for non-speech sound recognition
  • 19. 22. Speaker verification for home security system • A reliable speaker verification algorithm is used in home security.
  • 20. 23. An Efficient Scoring Algorithm for Gaussian Mixture Model Based Speaker Identification • The use of GMM for speaker identification was shown to provide superior performance Graphical illustration of the observation vector recording
  • 21. 24. Speaker Recognition: A Tutorial • Speech processing is a diverse field with many applications.
  • 22. 25. Speaker Identification Based on the Use of Robust Cepstral Features Obtained from Pole-Zero Transfer Functions • An attempt made to alleviate mismatch in the training and testing conditions. • Proposed a new feature called linear predictive ceptrum derived by pole-zero function. 26. Speaker Verification Using Mixture Decomposition Discrimination • Mixture decomposition discrimination (MDD) is based on the idea that, when modeling speech using hidden Markov models (HMM), different speakers speaking the same word would cause different HMM mixture components to dominate.
  • 23. 27. Recent Advances in the Automatic Recognition of Audiovisual Speech
  • 24. 28. Unsupervised Speaker Recognition Based on Competition Between Self-Organizing Maps • Clustering the speaker from unlabeled and unsegmented conversation, when no priori knowledge about the identity of the participants is given. 29. Speaker Recognition with Polynomial Classifiers • Polynomial –based classifier to achieve high accuracy at low complexity. - It has several advantages. 1. Polynomial classifier scoring yields a system which is highly computationally scalable with the number of speakers. 2. A new training algorithm is proposed which is discriminative, handles large data sets, and has low memory usage. 3. The output of the polynomial classifier is easily incorporated into a statistical framework allowing it to be combined with other techniques such as HMM.
  • 25. 30. Automatic Verbal Information Verification for User Authentication An example of verbal information verification by asking sequential questions.
  • 26. Issues and challenges… • Robustness • Portability • Adaptation • Language modeling • Confidence measure • Out of vocabulary words • Prosody
  • 27. Conclusion… • Problems are still with speaker-generated variability and variability in channel and recording conditions. • It is very important to investigate feature parameters that are stable over time, insensitive to the variation of speaking manner, including the speaking rate and level, and robust against variations in voice quality due to causes such as voice disguise or colds. • Studies on ways to automatically extract the speech periods of each person separately from a dialogue involving more than two people have recently appeared as an extension of speaker recognition technology.