SlideShare a Scribd company logo
www.projectsatbangalore.com 09591912372
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 22, NO. 4, APRIL 2014 745
An Overview of Noise-Robust Automatic
Speech Recognition
Jinyu Li, Member, IEEE, Li Deng, Fellow, IEEE, Yifan Gong, Senior Member, IEEE, and
Reinhold Haeb-Umbach, Senior Member, IEEE
Abstract—New waves of consumer-centric applications, such as
voice search and voice interaction with mobile devices and home
entertainment systems, increasingly require automatic speech
recognition (ASR) to be robust to the full range of real-world
noise and other acoustic distorting conditions. Despite its practical
importance, however, the inherent links between and distinctions
among the myriad of methods for noise-robust ASR have yet
to be carefully studied in order to advance the field further. To
this end, it is critical to establish a solid, consistent, and common
mathematical foundation for noise-robust ASR, which is lacking
at present. This article is intended to fill this gap and to provide
a thorough overview of modern noise-robust techniques for ASR
developed over the past 30 years. We emphasize methods that are
proven to be successful and that are likely to sustain or expand
their future applicability. We distill key insights from our com-
prehensive overview in this field and take a fresh look at a few
old problems, which nevertheless are still highly relevant today.
Specifically, we have analyzed and categorized a wide range of
noise-robust techniques using five different criteria: 1) feature-do-
main vs. model-domain processing, 2) the use of prior knowledge
about the acoustic environment distortion, 3) the use of explicit
environment-distortion models, 4) deterministic vs. uncertainty
processing, and 5) the use of acoustic models trained jointly with
the same feature enhancement or model adaptation process used
in the testing stage. With this taxonomy-oriented review, we equip
the reader with the insight to choose among techniques and with
the awareness of the performance-complexity tradeoffs. The
pros and cons of using different noise-robust ASR techniques in
practical application scenarios are provided as a guide to inter-
ested practitioners. The current challenges and future research
directions in this field is also carefully analyzed.
Index Terms—Speech recognition, noise, robustness, distortion
modeling, compensation, uncertainty processing, joint model
training .
I. INTRODUCTION
AUTOMATIC speech recognition (ASR) is the process
and the related technology for converting the speech
signal into its corresponding sequence of words or other
linguistic entities by means of algorithms implemented in a
Manuscript received May 21, 2013; revised October 09, 2013; accepted Jan-
uary 23, 2014. Date of publication February 05, 2014; date of current version
February 19, 2014. The associate editor coordinating the review of this manu-
script and approving it for publication was Prof. DeLiang Wang.
J. Li, L. Deng, and Y. Gong are with Microsoft Corporation, Redmond,
WA 98052-6399 USA (e-mail: jinyli@microsoft.com; deng@microsoft.com;
ygong@microsoft.com).
R. Haeb-Umbach is with the Department of Communications Engi-
neering, University of Paderborn, 33098 Paderborn, Germany (e-mail:
haeb@nt.uni-paderborn.de).
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TASLP.2014.2304637
device, a computer, or computer clusters [1], [2]. Historically,
ASR applications have included voice dialing, call routing,
interactive voice response, data entry and dictation, voice
command and control, structured document creation (e.g.,
medical and legal transcriptions), appliance control by voice,
computer-aided language learning, content-based spoken audio
search, and robotics. More recently, with the exponential
growth of big data and computing power, ASR technology has
advanced to the stage where more challenging applications are
becoming a reality. Examples are voice search and interactions
with mobile devices (e.g., Siri on iPhone, Bing voice search
on winPhone, and Google Now on Android), voice control
in home entertainment systems (e.g., Kinect on xBox), and
various speech-centric information processing applications
capitalizing on downstream processing of ASR outputs [3]. For
such large-scale, real-world applications, noise robustness is
becoming an increasingly important core technology since ASR
needs to work in much more difficult acoustic environments
than in the past [4].
A large number of noise-robust ASR methods, in the order
of hundreds, have been proposed and published over the past
30 years or so, and many of them have created significant
impact on either research or commercial use. Such accumu-
lated knowledge deserves thorough examination not only to
define the state of the art in this field from a fresh and unifying
perspective but also to point to fruitful future directions in the
field. Nevertheless, a well-organized framework for relating
and analyzing these methods is conspicuously missing. The
existing survey papers [5]–[13] in noise-robust ASR either do
not cover all recent advances in the field or focus only on a
specific sub-area. Although there are also few recent books
[14], [15], they are collections of topics with each chapter
written by different authors and it is hard to provide a unified
view across all topics. Given the importance of noise-robust
ASR, the time is ripe to analyze and unify the solutions. In
this paper, we elaborate on the basic concepts in noise-robust
ASR and develop categorization criteria and unifying themes.
Specifically, we hierarchically classify the major and signifi-
cant noise-robust ASR methods using a consistent and unifying
mathematical language. We establish their interrelations and
differentiate among important techniques, and discuss current
technical challenges and future research directions. This paper
also identifies relatively promising, short-term new research
areas based on a careful analysis of successful methods, which
can serve as a reference for future algorithm development in
the field. Furthermore, in the literature spanning over 30 years
on noise-robust ASR, there is inconsistent use of basic concepts
2329-9290 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

More Related Content

Viewers also liked

Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
anshu shrivastava
 
Búsqueda avanzada en google
Búsqueda avanzada en googleBúsqueda avanzada en google
Búsqueda avanzada en google
Ramón Gambin Rivero
 
Giuseppe Navarria - Mobile senza Unity: il caso SBK
Giuseppe Navarria - Mobile senza Unity: il caso SBKGiuseppe Navarria - Mobile senza Unity: il caso SBK
Giuseppe Navarria - Mobile senza Unity: il caso SBK
Codemotion
 
Menjar Escombraires
Menjar EscombrairesMenjar Escombraires
Menjar EscombrairesNatudelrey
 
Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...
Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...
Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...
Environmental Protection Agency, Ireland
 
Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...
Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...
Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...
Akash Patil
 
Füzi Otília és Farkas Bertalan Péter: Formativ Értékelés
Füzi Otília és Farkas Bertalan Péter: Formativ ÉrtékelésFüzi Otília és Farkas Bertalan Péter: Formativ Értékelés
Füzi Otília és Farkas Bertalan Péter: Formativ Értékelés
Sulinetwork
 
РуСофт - система видеотрансляций
РуСофт - система видеотрансляцийРуСофт - система видеотрансляций
РуСофт - система видеотрансляций
Lidiya Myalkina
 
A Parallel Data Distribution Management Algorithm
A Parallel Data Distribution Management AlgorithmA Parallel Data Distribution Management Algorithm
A Parallel Data Distribution Management Algorithm
Gabriele D'Angelo
 
спортивные достижения 2 б кл
спортивные  достижения 2 б клспортивные  достижения 2 б кл
спортивные достижения 2 б клArfenia Sarkissian
 
Case Study Analysis: Cineplex Entertainment: The Loyalty Program
Case Study Analysis: Cineplex Entertainment: The Loyalty ProgramCase Study Analysis: Cineplex Entertainment: The Loyalty Program
Case Study Analysis: Cineplex Entertainment: The Loyalty Program
Akash Patil
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
Hira Shaukat
 

Viewers also liked (12)

Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Búsqueda avanzada en google
Búsqueda avanzada en googleBúsqueda avanzada en google
Búsqueda avanzada en google
 
Giuseppe Navarria - Mobile senza Unity: il caso SBK
Giuseppe Navarria - Mobile senza Unity: il caso SBKGiuseppe Navarria - Mobile senza Unity: il caso SBK
Giuseppe Navarria - Mobile senza Unity: il caso SBK
 
Menjar Escombraires
Menjar EscombrairesMenjar Escombraires
Menjar Escombraires
 
Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...
Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...
Siltflux Project: Literature Review, Sediment Database & Preliminary Analysis...
 
Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...
Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...
Case Study Analysis: Cash Flow Productivity at PepsiCo: Communicating Value t...
 
Füzi Otília és Farkas Bertalan Péter: Formativ Értékelés
Füzi Otília és Farkas Bertalan Péter: Formativ ÉrtékelésFüzi Otília és Farkas Bertalan Péter: Formativ Értékelés
Füzi Otília és Farkas Bertalan Péter: Formativ Értékelés
 
РуСофт - система видеотрансляций
РуСофт - система видеотрансляцийРуСофт - система видеотрансляций
РуСофт - система видеотрансляций
 
A Parallel Data Distribution Management Algorithm
A Parallel Data Distribution Management AlgorithmA Parallel Data Distribution Management Algorithm
A Parallel Data Distribution Management Algorithm
 
спортивные достижения 2 б кл
спортивные  достижения 2 б клспортивные  достижения 2 б кл
спортивные достижения 2 б кл
 
Case Study Analysis: Cineplex Entertainment: The Loyalty Program
Case Study Analysis: Cineplex Entertainment: The Loyalty ProgramCase Study Analysis: Cineplex Entertainment: The Loyalty Program
Case Study Analysis: Cineplex Entertainment: The Loyalty Program
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 

Similar to An Overview of Noise-Robust Automatic Speech Recognition

Recent advances in LVCSR : A benchmark comparison of performances
Recent advances in LVCSR : A benchmark comparison of performancesRecent advances in LVCSR : A benchmark comparison of performances
Recent advances in LVCSR : A benchmark comparison of performances
IJECEIAES
 
Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...
karthik annam
 
The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...
The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...
The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...
IJCI JOURNAL
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
IJMER
 
Automatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
Automatic Speech Recognition and Machine Learning for Robotic Arm in SurgeryAutomatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
Automatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
DR.P.S.JAGADEESH KUMAR
 
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
IRJET Journal
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEY
IJCERT
 
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
IOSR Journals
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
JhalakDashora
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
IRJET Journal
 
Comparison and Analysis Of LDM and LMS for an Application of a Speech
Comparison and Analysis Of LDM and LMS for an Application of a SpeechComparison and Analysis Of LDM and LMS for an Application of a Speech
Comparison and Analysis Of LDM and LMS for an Application of a Speech
CSCJournals
 
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATIONSPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
IRJET Journal
 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
eSAT Publishing House
 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
eSAT Journals
 
GLOVE BASED GESTURE RECOGNITION USING IR SENSOR
GLOVE BASED GESTURE RECOGNITION USING IR SENSORGLOVE BASED GESTURE RECOGNITION USING IR SENSOR
GLOVE BASED GESTURE RECOGNITION USING IR SENSOR
IRJET Journal
 
Bayesian distance metric learning and its application in automatic speaker re...
Bayesian distance metric learning and its application in automatic speaker re...Bayesian distance metric learning and its application in automatic speaker re...
Bayesian distance metric learning and its application in automatic speaker re...
IJECEIAES
 
Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...
ijnlc
 
H42045359
H42045359H42045359
H42045359
IJERA Editor
 
Industrial Applications of Automatic Speech Recognition Systems
Industrial Applications of Automatic Speech Recognition SystemsIndustrial Applications of Automatic Speech Recognition Systems
Industrial Applications of Automatic Speech Recognition Systems
IJERA Editor
 
ASL Fingerspelling Recognition Using Hybrid Deep Learning Architecture
ASL Fingerspelling Recognition Using Hybrid Deep Learning ArchitectureASL Fingerspelling Recognition Using Hybrid Deep Learning Architecture
ASL Fingerspelling Recognition Using Hybrid Deep Learning Architecture
IRJET Journal
 

Similar to An Overview of Noise-Robust Automatic Speech Recognition (20)

Recent advances in LVCSR : A benchmark comparison of performances
Recent advances in LVCSR : A benchmark comparison of performancesRecent advances in LVCSR : A benchmark comparison of performances
Recent advances in LVCSR : A benchmark comparison of performances
 
Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...
 
The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...
The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...
The Evaluation of a Code-Switched Sepedi-English Automatic Speech Recognition...
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
 
Automatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
Automatic Speech Recognition and Machine Learning for Robotic Arm in SurgeryAutomatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
Automatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
 
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEY
 
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
 
Comparison and Analysis Of LDM and LMS for an Application of a Speech
Comparison and Analysis Of LDM and LMS for an Application of a SpeechComparison and Analysis Of LDM and LMS for an Application of a Speech
Comparison and Analysis Of LDM and LMS for an Application of a Speech
 
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATIONSPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
 
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifierEmotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
 
GLOVE BASED GESTURE RECOGNITION USING IR SENSOR
GLOVE BASED GESTURE RECOGNITION USING IR SENSORGLOVE BASED GESTURE RECOGNITION USING IR SENSOR
GLOVE BASED GESTURE RECOGNITION USING IR SENSOR
 
Bayesian distance metric learning and its application in automatic speaker re...
Bayesian distance metric learning and its application in automatic speaker re...Bayesian distance metric learning and its application in automatic speaker re...
Bayesian distance metric learning and its application in automatic speaker re...
 
Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...Taxonomy extraction from automotive natural language requirements using unsup...
Taxonomy extraction from automotive natural language requirements using unsup...
 
H42045359
H42045359H42045359
H42045359
 
Industrial Applications of Automatic Speech Recognition Systems
Industrial Applications of Automatic Speech Recognition SystemsIndustrial Applications of Automatic Speech Recognition Systems
Industrial Applications of Automatic Speech Recognition Systems
 
ASL Fingerspelling Recognition Using Hybrid Deep Learning Architecture
ASL Fingerspelling Recognition Using Hybrid Deep Learning ArchitectureASL Fingerspelling Recognition Using Hybrid Deep Learning Architecture
ASL Fingerspelling Recognition Using Hybrid Deep Learning Architecture
 

More from Projectsatbangalore

Iot projects-2018-2019
Iot projects-2018-2019Iot projects-2018-2019
Iot projects-2018-2019
Projectsatbangalore
 
Voltage Support Control Strategies for Static Synchronous Compensators Under ...
Voltage Support Control Strategies for Static Synchronous Compensators Under ...Voltage Support Control Strategies for Static Synchronous Compensators Under ...
Voltage Support Control Strategies for Static Synchronous Compensators Under ...
Projectsatbangalore
 
Two-Stage Power Conversion Architecture Suitable for Wide Range Input Voltage
Two-Stage Power Conversion Architecture Suitable for Wide Range Input VoltageTwo-Stage Power Conversion Architecture Suitable for Wide Range Input Voltage
Two-Stage Power Conversion Architecture Suitable for Wide Range Input Voltage
Projectsatbangalore
 
Topology and Modulation for a New Multilevel Diode-Clamped Matrix Converter
Topology and Modulation for a New Multilevel Diode-Clamped Matrix ConverterTopology and Modulation for a New Multilevel Diode-Clamped Matrix Converter
Topology and Modulation for a New Multilevel Diode-Clamped Matrix Converter
Projectsatbangalore
 
Texture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary patternTexture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary pattern
Projectsatbangalore
 
Similarity validation based nonlocal means image denoising
Similarity validation based nonlocal means image denoisingSimilarity validation based nonlocal means image denoising
Similarity validation based nonlocal means image denoising
Projectsatbangalore
 
Preamble-Based LMMSE Channel Estimation in OFDM/OQAM Modulation
Preamble-Based LMMSE Channel Estimation in OFDM/OQAM ModulationPreamble-Based LMMSE Channel Estimation in OFDM/OQAM Modulation
Preamble-Based LMMSE Channel Estimation in OFDM/OQAM Modulation
Projectsatbangalore
 
Particle filter framework for salient object detection in videos
Particle filter framework for salient object detection in videosParticle filter framework for salient object detection in videos
Particle filter framework for salient object detection in videos
Projectsatbangalore
 
New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...
New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...
New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...
Projectsatbangalore
 
MIMO-OFDM PAPR Reduction by Residue Number System
MIMO-OFDM PAPR Reduction by Residue Number SystemMIMO-OFDM PAPR Reduction by Residue Number System
MIMO-OFDM PAPR Reduction by Residue Number System
Projectsatbangalore
 
Learning View-Model Joint Relevance for 3D Object Retrieval
Learning View-Model Joint Relevance for 3D Object RetrievalLearning View-Model Joint Relevance for 3D Object Retrieval
Learning View-Model Joint Relevance for 3D Object Retrieval
Projectsatbangalore
 
Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...
Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...
Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...
Projectsatbangalore
 
High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...
High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...
High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...
Projectsatbangalore
 
High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...
High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...
High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...
Projectsatbangalore
 
Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...
Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...
Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...
Projectsatbangalore
 
Excitation Synchronous Wind Power Generators With Maximum Power Tracking Scheme
Excitation Synchronous Wind Power Generators With Maximum Power Tracking SchemeExcitation Synchronous Wind Power Generators With Maximum Power Tracking Scheme
Excitation Synchronous Wind Power Generators With Maximum Power Tracking Scheme
Projectsatbangalore
 
Efficient Road Detection and Tracking for Unmanned Aerial Vehicle
Efficient Road Detection and Tracking for Unmanned Aerial VehicleEfficient Road Detection and Tracking for Unmanned Aerial Vehicle
Efficient Road Detection and Tracking for Unmanned Aerial Vehicle
Projectsatbangalore
 
Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...
Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...
Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...
Projectsatbangalore
 
Dc microgrid for wind and solar power integration
Dc microgrid for wind and solar power integrationDc microgrid for wind and solar power integration
Dc microgrid for wind and solar power integration
Projectsatbangalore
 
Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...
Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...
Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...
Projectsatbangalore
 

More from Projectsatbangalore (20)

Iot projects-2018-2019
Iot projects-2018-2019Iot projects-2018-2019
Iot projects-2018-2019
 
Voltage Support Control Strategies for Static Synchronous Compensators Under ...
Voltage Support Control Strategies for Static Synchronous Compensators Under ...Voltage Support Control Strategies for Static Synchronous Compensators Under ...
Voltage Support Control Strategies for Static Synchronous Compensators Under ...
 
Two-Stage Power Conversion Architecture Suitable for Wide Range Input Voltage
Two-Stage Power Conversion Architecture Suitable for Wide Range Input VoltageTwo-Stage Power Conversion Architecture Suitable for Wide Range Input Voltage
Two-Stage Power Conversion Architecture Suitable for Wide Range Input Voltage
 
Topology and Modulation for a New Multilevel Diode-Clamped Matrix Converter
Topology and Modulation for a New Multilevel Diode-Clamped Matrix ConverterTopology and Modulation for a New Multilevel Diode-Clamped Matrix Converter
Topology and Modulation for a New Multilevel Diode-Clamped Matrix Converter
 
Texture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary patternTexture descriptor based on local combination adaptive ternary pattern
Texture descriptor based on local combination adaptive ternary pattern
 
Similarity validation based nonlocal means image denoising
Similarity validation based nonlocal means image denoisingSimilarity validation based nonlocal means image denoising
Similarity validation based nonlocal means image denoising
 
Preamble-Based LMMSE Channel Estimation in OFDM/OQAM Modulation
Preamble-Based LMMSE Channel Estimation in OFDM/OQAM ModulationPreamble-Based LMMSE Channel Estimation in OFDM/OQAM Modulation
Preamble-Based LMMSE Channel Estimation in OFDM/OQAM Modulation
 
Particle filter framework for salient object detection in videos
Particle filter framework for salient object detection in videosParticle filter framework for salient object detection in videos
Particle filter framework for salient object detection in videos
 
New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...
New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...
New Three-Phase Multilevel Inverter With Reduced Number of Power Electronic C...
 
MIMO-OFDM PAPR Reduction by Residue Number System
MIMO-OFDM PAPR Reduction by Residue Number SystemMIMO-OFDM PAPR Reduction by Residue Number System
MIMO-OFDM PAPR Reduction by Residue Number System
 
Learning View-Model Joint Relevance for 3D Object Retrieval
Learning View-Model Joint Relevance for 3D Object RetrievalLearning View-Model Joint Relevance for 3D Object Retrieval
Learning View-Model Joint Relevance for 3D Object Retrieval
 
Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...
Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...
Improving Power Quality in Cascade Multilevel Converters Based on Single-Phas...
 
High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...
High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...
High-Voltage Gain Boost Converter Based on Three-State Commutation Cell for B...
 
High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...
High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...
High Efficiency Resonant dc/dc Converter Utilizing a Resistance Compression N...
 
Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...
Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...
Four-Way Microstrip-Based Power Combining for Microwave Outphasing Power Ampl...
 
Excitation Synchronous Wind Power Generators With Maximum Power Tracking Scheme
Excitation Synchronous Wind Power Generators With Maximum Power Tracking SchemeExcitation Synchronous Wind Power Generators With Maximum Power Tracking Scheme
Excitation Synchronous Wind Power Generators With Maximum Power Tracking Scheme
 
Efficient Road Detection and Tracking for Unmanned Aerial Vehicle
Efficient Road Detection and Tracking for Unmanned Aerial VehicleEfficient Road Detection and Tracking for Unmanned Aerial Vehicle
Efficient Road Detection and Tracking for Unmanned Aerial Vehicle
 
Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...
Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...
Design of Single-Switch Inverters for Variable Resistance / Load Modulation O...
 
Dc microgrid for wind and solar power integration
Dc microgrid for wind and solar power integrationDc microgrid for wind and solar power integration
Dc microgrid for wind and solar power integration
 
Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...
Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...
Control of a Bidirectional Converter to Interface Ultracapacitor with Renewab...
 

Recently uploaded

BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
BoudhayanBhattachari
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Denish Jangid
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
RAHUL
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
MysoreMuleSoftMeetup
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Leena Ghag-Sakpal
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
Amin Marwan
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 

Recently uploaded (20)

BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 

An Overview of Noise-Robust Automatic Speech Recognition

  • 1. www.projectsatbangalore.com 09591912372 IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 22, NO. 4, APRIL 2014 745 An Overview of Noise-Robust Automatic Speech Recognition Jinyu Li, Member, IEEE, Li Deng, Fellow, IEEE, Yifan Gong, Senior Member, IEEE, and Reinhold Haeb-Umbach, Senior Member, IEEE Abstract—New waves of consumer-centric applications, such as voice search and voice interaction with mobile devices and home entertainment systems, increasingly require automatic speech recognition (ASR) to be robust to the full range of real-world noise and other acoustic distorting conditions. Despite its practical importance, however, the inherent links between and distinctions among the myriad of methods for noise-robust ASR have yet to be carefully studied in order to advance the field further. To this end, it is critical to establish a solid, consistent, and common mathematical foundation for noise-robust ASR, which is lacking at present. This article is intended to fill this gap and to provide a thorough overview of modern noise-robust techniques for ASR developed over the past 30 years. We emphasize methods that are proven to be successful and that are likely to sustain or expand their future applicability. We distill key insights from our com- prehensive overview in this field and take a fresh look at a few old problems, which nevertheless are still highly relevant today. Specifically, we have analyzed and categorized a wide range of noise-robust techniques using five different criteria: 1) feature-do- main vs. model-domain processing, 2) the use of prior knowledge about the acoustic environment distortion, 3) the use of explicit environment-distortion models, 4) deterministic vs. uncertainty processing, and 5) the use of acoustic models trained jointly with the same feature enhancement or model adaptation process used in the testing stage. With this taxonomy-oriented review, we equip the reader with the insight to choose among techniques and with the awareness of the performance-complexity tradeoffs. The pros and cons of using different noise-robust ASR techniques in practical application scenarios are provided as a guide to inter- ested practitioners. The current challenges and future research directions in this field is also carefully analyzed. Index Terms—Speech recognition, noise, robustness, distortion modeling, compensation, uncertainty processing, joint model training . I. INTRODUCTION AUTOMATIC speech recognition (ASR) is the process and the related technology for converting the speech signal into its corresponding sequence of words or other linguistic entities by means of algorithms implemented in a Manuscript received May 21, 2013; revised October 09, 2013; accepted Jan- uary 23, 2014. Date of publication February 05, 2014; date of current version February 19, 2014. The associate editor coordinating the review of this manu- script and approving it for publication was Prof. DeLiang Wang. J. Li, L. Deng, and Y. Gong are with Microsoft Corporation, Redmond, WA 98052-6399 USA (e-mail: jinyli@microsoft.com; deng@microsoft.com; ygong@microsoft.com). R. Haeb-Umbach is with the Department of Communications Engi- neering, University of Paderborn, 33098 Paderborn, Germany (e-mail: haeb@nt.uni-paderborn.de). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TASLP.2014.2304637 device, a computer, or computer clusters [1], [2]. Historically, ASR applications have included voice dialing, call routing, interactive voice response, data entry and dictation, voice command and control, structured document creation (e.g., medical and legal transcriptions), appliance control by voice, computer-aided language learning, content-based spoken audio search, and robotics. More recently, with the exponential growth of big data and computing power, ASR technology has advanced to the stage where more challenging applications are becoming a reality. Examples are voice search and interactions with mobile devices (e.g., Siri on iPhone, Bing voice search on winPhone, and Google Now on Android), voice control in home entertainment systems (e.g., Kinect on xBox), and various speech-centric information processing applications capitalizing on downstream processing of ASR outputs [3]. For such large-scale, real-world applications, noise robustness is becoming an increasingly important core technology since ASR needs to work in much more difficult acoustic environments than in the past [4]. A large number of noise-robust ASR methods, in the order of hundreds, have been proposed and published over the past 30 years or so, and many of them have created significant impact on either research or commercial use. Such accumu- lated knowledge deserves thorough examination not only to define the state of the art in this field from a fresh and unifying perspective but also to point to fruitful future directions in the field. Nevertheless, a well-organized framework for relating and analyzing these methods is conspicuously missing. The existing survey papers [5]–[13] in noise-robust ASR either do not cover all recent advances in the field or focus only on a specific sub-area. Although there are also few recent books [14], [15], they are collections of topics with each chapter written by different authors and it is hard to provide a unified view across all topics. Given the importance of noise-robust ASR, the time is ripe to analyze and unify the solutions. In this paper, we elaborate on the basic concepts in noise-robust ASR and develop categorization criteria and unifying themes. Specifically, we hierarchically classify the major and signifi- cant noise-robust ASR methods using a consistent and unifying mathematical language. We establish their interrelations and differentiate among important techniques, and discuss current technical challenges and future research directions. This paper also identifies relatively promising, short-term new research areas based on a careful analysis of successful methods, which can serve as a reference for future algorithm development in the field. Furthermore, in the literature spanning over 30 years on noise-robust ASR, there is inconsistent use of basic concepts 2329-9290 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.