SlideShare a Scribd company logo
1 of 29
Noise robust speech recognition  through Compressed Sensing Jort Gemmeke Bert Cranen Centre for Language and Speech Technology Radboud University Nijmegen – The Netherlands
Speech recognition “ one” Speech recognition Feature extraction
Speech recognition “ one” Speech recognition Feature extraction ?? Speech recognition Feature extraction
Noise robust speech recognition “ one” Speech recognition Speech restoration Feature extraction pre- processing
Missing Data Imputation Clean speech  Noisy speech  Mask Frequency   time   “ one” “ one” time   time  
Missing Data Imputation Masked speech  Restored Speech Frequency   time   time   Imputation
Sparse representation one
Sparse representation … one eight four six one zero two seven one four one nine
Sparse representation … one eight four six one zero two seven one four one nine
Sparse representation … one eight four six one zero two seven one four one nine
Compressed Sensing ,[object Object]
Compressed Sensing ,[object Object],[object Object]
Compressed Sensing
Compressed Sensing … eight four six one zero two one seven four one nine
Compressed Sensing … eight four six one zero two one seven four one nine
Imputation one one four one
Imputation one one four one one one four one
Imputation one one four one one one four one
Experiments ,[object Object],[object Object],[object Object],[object Object],[object Object]
Results Consonant Challenge Method Test set 1 Clean 2 comp 3 8 spk 4 SSN 5 fact 6 MSSN 7 3 spk baseline 86.7 7.6 5.0 5.5 3.9 8.9 5.5 oracle mask - 44.8 43.0 36.0 39.6 41.7 40.9 estimated mask - 7.3 9.9 7.8 9.6 7.0 6.0
Summary ,[object Object],[object Object],[object Object],[object Object]
Questions… 8
Vector representation
linear representation
Basis of examples
Reliable and unreliable part
sparse representation
sparse representation
Sparse Imputation

More Related Content

Viewers also liked

Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
Jaganadh Gopinadhan
 

Viewers also liked (18)

speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Neural Network and NLP
Neural Network and NLPNeural Network and NLP
Neural Network and NLP
 
Neural Language Model Tutorial
Neural Language Model Tutorial Neural Language Model Tutorial
Neural Language Model Tutorial
 
Language Modeling Tutorial
Language Modeling Tutorial Language Modeling Tutorial
Language Modeling Tutorial
 
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
Speech Recognition with Deep Neural Networks (D3L2 Deep Learning for Speech a...
 
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
 
(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结
 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
IBM cognitive service introduction
IBM cognitive service introductionIBM cognitive service introduction
IBM cognitive service introduction
 

Similar to Interspeech Gemmeke 2008 V6

Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...
Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...
Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...
eCommConf
 

Similar to Interspeech Gemmeke 2008 V6 (17)

Asr
AsrAsr
Asr
 
Asr
AsrAsr
Asr
 
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
 
Thesis
ThesisThesis
Thesis
 
李宏毅/當語音處理遇上深度學習
李宏毅/當語音處理遇上深度學習李宏毅/當語音處理遇上深度學習
李宏毅/當語音處理遇上深度學習
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK
 
Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...
Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...
Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards ...
 
Bengali Sign Language
Bengali Sign LanguageBengali Sign Language
Bengali Sign Language
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech E...
A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech E...A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech E...
A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech E...
 
Esophageal Speech Recognition using Artificial Neural Network (ANN)
Esophageal Speech Recognition using Artificial Neural Network (ANN)Esophageal Speech Recognition using Artificial Neural Network (ANN)
Esophageal Speech Recognition using Artificial Neural Network (ANN)
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognition
 
Entering the Fourth Dimension of OCR with Tesseract - Talk from Voxxed Days B...
Entering the Fourth Dimension of OCR with Tesseract - Talk from Voxxed Days B...Entering the Fourth Dimension of OCR with Tesseract - Talk from Voxxed Days B...
Entering the Fourth Dimension of OCR with Tesseract - Talk from Voxxed Days B...
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template Matching
 
An efficient peak valley detection based vad algorithm for robust detection o...
An efficient peak valley detection based vad algorithm for robust detection o...An efficient peak valley detection based vad algorithm for robust detection o...
An efficient peak valley detection based vad algorithm for robust detection o...
 
AN EFFICIENT PEAK VALLEY DETECTION BASED VAD ALGORITHM FOR ROBUST DETECTION O...
AN EFFICIENT PEAK VALLEY DETECTION BASED VAD ALGORITHM FOR ROBUST DETECTION O...AN EFFICIENT PEAK VALLEY DETECTION BASED VAD ALGORITHM FOR ROBUST DETECTION O...
AN EFFICIENT PEAK VALLEY DETECTION BASED VAD ALGORITHM FOR ROBUST DETECTION O...
 
An efficient peak valley detection based vad algorithm for robust detection o...
An efficient peak valley detection based vad algorithm for robust detection o...An efficient peak valley detection based vad algorithm for robust detection o...
An efficient peak valley detection based vad algorithm for robust detection o...
 

Interspeech Gemmeke 2008 V6

Editor's Notes

  1. Logo erbij