2. COURSE OBJECTIVES:
• Study the fundamentals of speech signal and
extracs various speech features
• Understand different speech coding
techniques for speech compression
applications
• Learn to build speech enhancement, text-to-
speech synthesis system
3. UNIT I FUNDAMENTALS OF SPEECH
6
• The Human speech production mechanism,
Discrete-Time model of speech production,
Speech perception - human auditory system,
Phonetics - articulatory phonetics, acoustic
phonetics, and auditory phonetics, Categorization
of speech sounds, Spectrographic analysis of
speech sounds, Pitch frequency, Pitch period
measurement using spectral and cepstral domain,
Formants, Evaluation of Formants for voiced and
unvoiced speech.
4. UNIT II SPEECH FEATURES AND
DISTORTION MEASURES 6
• Significance of speech features in speech-
based applications, Speech Features – Cepstral
Coefficients, Mel Frequency Cepstral
Coefficients (MFCCs), Perceptual Linear
Prediction (PLP), Log Frequency Power
Coefficients (LFPCs), Speech distortion
measures–Simplified distance measure, LPC-
based distance measure, Spectral distortion
measure, Perceptual distortion measure. 112
5. UNIT III SPEECH CODING
6
• Need for speech coding, Waveform coding of
speech – PCM, Adaptive PCM, DPCM, ADPCM,
Delta Modulation, Adaptive Delta Modulation,
G.726 Standard for ADPCM, Parametric
Speech Coding – Channel Vocoders, Linear
Prediction Based Vocoders, Code Excited
Linear Prediction (CELP) based Vocoders,
Sinusoidal speech coding techniques, Hybrid
coder, Transform domain coding of speech
6. UNIT IV SPEECH ENHANCEMENT
6
• Classes of Speech Enhancement Algorithms,
Spectral-Subtractive Algorithms - Multiband
Spectral Subtraction, MMSE Spectral Subtraction
Algorithm, Spectral Subtraction Based on
Perceptual Properties, Wiener Filtering - Wiener
Filters in the Time Domain, Wiener Filters in the
Frequency Domain, Wiener Filters for Noise
Reduction, Maximum-Likelihood Estimators,
Bayesian Estimators, MMSE and Log-MMSE
Estimator, Subspace Algorithms.
7. UNIT V SPEECH SYNTHESIS AND
APPLICATION 6
• A Text-to-Speech systems (TTS), Synthesizers
technologies – Concatenative synthesis, Use of
Formants for concatenative synthesis, Use of
LPC for concatenative synthesis, HMM-based
synthesis, Sinewave synthesis, Speech
transformations, Watermarking for
authentication of a speech, Emotion
recognition from speech.
30 PERIODS