SlideShare a Scribd company logo
1 of 33
Speech Coding
Speech Coding Techniques
What’s the need for speech
coding ?
Necessary in order to represent human speech in a
digital form
Applications: mobile/telephone communication, voice
over IP
Code efficiency (high quality, fewer bits) is a must
Components of a speech coding
system
Speech Encoding System
5
Example of coding techniques
ZIP: no transformation nor quantization, apply VLC (LZW)
to the stream of letters (symbols) in a file directly, lossless
coding
PCM for speech: no transformation, quantize the speech
samples directly, apply fixed length binary coding
ADPCM for speech: apply prediction to original samples,
the predictor is adapted from one speech frame to the next,
quantize the prediction error, error symbols coded using
fixed length binary coding
JPEG for image: apply discrete cosine transform to blocks
of image pixels, quantize the transformed coefficients,
code the quantized coefficients using variable length
coding (runlength + Huffman coding)
The aim of speech coding is to enhance the quality of a speech
signal at a particular bit-rate or to minimize the bit-rate at a given
quality.
The bit-rate at which the speech is to be transmitted or stored
depends on the rate of transmission or storage, the computation of
coding the digital speech signal and the quality of the speech signal
required.
7
Low bit-rate
High speech quality
Robustness to different speakers/languages
Channel errors
Low memory requirements
Less computational complexity
Low coding delay
8
Speech Encoding System
9
10
Necessity
Efficient speech-coding techniques
Digital streams of ones and zeros
The lower the bandwidth, the lower the quality
Processing power
The better quality (for a given bandwidth) uses a more
complex algorithm
A balance between quality and cost
Voice Quality
Bandwidth is easily quantified
Voice quality is subjective
MOS, Mean Opinion Score
ITU-T Recommendation P.800
Excellent – 5
Good – 4
Fair – 3
Poor – 2
Bad – 1
A minimum of 30 people
Listen to voice samples or in conversations
About Speech
Speech
Air pushed from the lungs past the vocal cords and along
the vocal tract
The basic vibrations – vocal cords
The sound is altered by the disposition of the vocal tract
( tongue and mouth)
Model the vocal tract as a filter
The shape changes relatively slowly
The vibrations at the vocal cords
The excitation signal
14
Speech Compression
Handling speech with other media information such as text, images,
video, and data is the essential part of multimedia applications
The ideal speech coder has a low bit-rate, high perceived quality,
low signal delay, and low complexity.
Delay
Less than 150 ms one-way end-to-end delay for a conversation
Processing (coding) delay, network delay
Complexity
Computational complexity of speech coders depends on
algorithms
Contributes to achievable bit-rate and processing delay
15
Speech coding
Standard voice channel:
analog: 4 kHz slot (~ 40 dB SNR)
digital: 64 Kbps = 8 bit µ-law x 8 kHz
How to compress?
Exploit redundancy
signal assumed to be a single voice, not any waveform
Code only what is needed
intelligibility
speaker identification
Source-filter decomposition
vocal tract shape & fundamental frequency change slowly
16
Taxonomy of Speech Coders
Speech Coders
Waveform Coders Source Coders
Time Domain:
PCM, ADPCM
Frequency Domain:
e.g. Sub-band coder,
Adaptive transform
coder
Linear
Predictive
Coder
Vocoder
Speech coders are classified based on the
bit-rate at which they produce output with
reasonable quality and on the type of
coding techniques used for coding the
speech signal.
17
Classification by Bit-Rate
Type of coder Bit-Rate Range
High bit-rate coders >15 Kbps
Medium bit-rate coders 5 to 15 Kbps
Low bit-rate coders 2 to 5 Kbps
Very Low bit-rate coders <2 Kbps
18
Classification by
Coding Techniques
19
 Waveform coders
 Parametric coders
 Hybrid coders
Types of Speech Codecs
Waveform codecs,source codecs (also known as vocoders),and
hybrid codecs.
21
The ancestor: Channel Vocoder (1940s-1960s)
Source-filter decomposition
filterbank breaks into spectral bands
transmit slowly-changing energy in each band
10-20 bands, perceptually spaced
Downsampling
Excitation with a pitch / noise model
22
LPC encoding
The classic source-filter model
Compression gains:
filter parameters are ~slowly changing
excitation can be represented many ways
23
Linear Predictive Code
Model speech production system
as an auto-regressive model:
Model parameters are computed
for speech segment (~30 ms).
Parameters {a(k); k=1:p} are found
by solving a Toeplitz system of
equations.
Transfer function
To encode speech, one may
transmit the quantized parameters
{a(k)} and G or equivalent
parameter set.
The model order is 8-10 in most
speech coding standards.
)()()()(
1
neknskans
p
k
+−= ∑= ∑=
−
−
== p
k
k
zka
G
zE
zS
zH
1
)(1
)(
)(
)(
unvoiced
G
v/u
voiced
N
random
sequence
generator
periodic
pulse
train
generator
× H(z) = 1
1− ∑ akz-k
P
k = 1
u[n]
Vocal Tract Model
Excitation parameters
v/u : voiced/ unvoiced
N : pitch for voiced
G : signal gain
→ excitation signal u[n]
Vocal Tract parameters
{ak
} : LPC coefficients
→formant structure of
speech signals
24
Excitation parameters
v/u : voiced/ unvoiced
N : pitch for voiced
G : signal gain
→ excitation signal u[n]
Vocal Tract parameters
{ak
} : LPC coefficients
→formant structure of
speech signals
unvoiced
G
v/u
voiced
N
random
sequence
generator
periodic
pulse
train
generator
× H(z) = 1
1− ∑ akz-k
P
k = 1
u[n]
Vocal Tract Model
25
LPC Speech Coder
Buffer
LPC
filter
Voice/
Un-voice
Pitch
Analysis
Encoder Decoder
Synthesizer
Excitation
Channel
26
Encoding LPC filter parameters
For ‘communications quality’:
8 kHz sampling (4 kHz bandwidth)
~10th order LPC (up to 5 pole pairs)
update every 20-30 ms → 300 - 500 param/s
Representation & quantization
{ai} - poor distribution,
can’t interpolate
reflection coefficients {ki}:
guaranteed stable
log area ratios (LAR) - stable
Bit allocation (filter):
GSM (13 kbps):
8 LARs x 3-6 bits / 20 ms = 1.8 Kbps
27
Excitation
Excitation as LPC residual is already better than raw signal:
save several bits/sample, still > 32 Kbps
Crude model: U/V flag + pitch period
~ 7 bits / 5 ms = 1.4 Kbps → LPC10 @ 2.4 Kbps
28
CELP
Code excited linear predictive (CELP) speech coding.
White noise input does not give satisfactory results:
the residue sequence still contains important information for
speech synthesis
it is necessary to send the residue to receiving end too.
To save space, use vector quantization (VQ) technique to encode
the residue sequence
Hence the name “code excited”.
In CELP, each code book is a linear vector containing 0 or ±1
each code word length is 60 samples
successive code words are overlapped by 58 samples
a linear search is performed to find the best code words as input
to the LPC model.
29
CELP
Represent excitation with codebook
e.g. 512 sparse excitation vectors
linear search for minimum weighted error?
30
GSM Speech Encoder
Hamming
Window
Segmentation
Pre-emphasis
Short
Term
Prediction
LPC
Inverse
Filter
Long
Term
Prediction
+ LPF
Grid
Selection
MUX
Gain, pitch
LAR coefficients
Speech
input
Pre-processing STP LTP Regular pulse excitation
(RPE)
20ms
Order = 8
31
GSM Decoding
RPE
Decoding
LTP
Synthesis
STP
Synthesis
Post-
Processing
De-Mux
Pitch, gain
LAR Coefficients
32
Implementation Issues
Tasks:
LPC analysis filter to calculate
the coefficients
Long term prediction for pitch
analysis need to find delay D
and gain
VQ search during CELP
encoding – Most time
consuming
FIR filtering for pre- and post
processing
Often implemented in DSP chips
for embedded applications (e.g.
cell phone).
The parameter quantization part
needs bit-level operation.
Know more !!
33

More Related Content

What's hot

Frequency hopping spread spectrum
Frequency hopping spread spectrumFrequency hopping spread spectrum
Frequency hopping spread spectrumHarshit Gupta
 
NYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISINYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISIFAIZAN SHAFI
 
Concept of Diversity & Fading (wireless communication)
Concept of Diversity & Fading (wireless communication)Concept of Diversity & Fading (wireless communication)
Concept of Diversity & Fading (wireless communication)Omkar Rane
 
Small scale fading
Small scale fadingSmall scale fading
Small scale fadingAJAL A J
 
Introduction To Wireless Fading Channels
Introduction To Wireless Fading ChannelsIntroduction To Wireless Fading Channels
Introduction To Wireless Fading ChannelsNitin Jain
 
Chap 5 (small scale fading)
Chap 5 (small scale fading)Chap 5 (small scale fading)
Chap 5 (small scale fading)asadkhan1327
 
10. types of small scale fading
10. types of small scale fading10. types of small scale fading
10. types of small scale fadingJAIGANESH SEKAR
 
Delta modulation
Delta modulationDelta modulation
Delta modulationmpsrekha83
 
4.5 equalizers and its types
4.5   equalizers and its types4.5   equalizers and its types
4.5 equalizers and its typesJAIGANESH SEKAR
 
Pulse modulation
Pulse modulationPulse modulation
Pulse modulationstk_gpg
 
Link power and rise time budget analysis
Link power and rise time budget analysisLink power and rise time budget analysis
Link power and rise time budget analysisCKSunith1
 
Travelling Wave, Broadband Antennas, Frequency-independent Antennas
Travelling Wave, Broadband Antennas,  Frequency-independent AntennasTravelling Wave, Broadband Antennas,  Frequency-independent Antennas
Travelling Wave, Broadband Antennas, Frequency-independent AntennasRoma Rico Flores
 

What's hot (20)

Speech coding techniques
Speech coding techniquesSpeech coding techniques
Speech coding techniques
 
Frequency hopping spread spectrum
Frequency hopping spread spectrumFrequency hopping spread spectrum
Frequency hopping spread spectrum
 
NYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISINYQUIST CRITERION FOR ZERO ISI
NYQUIST CRITERION FOR ZERO ISI
 
Concept of Diversity & Fading (wireless communication)
Concept of Diversity & Fading (wireless communication)Concept of Diversity & Fading (wireless communication)
Concept of Diversity & Fading (wireless communication)
 
Small scale fading
Small scale fadingSmall scale fading
Small scale fading
 
Introduction To Wireless Fading Channels
Introduction To Wireless Fading ChannelsIntroduction To Wireless Fading Channels
Introduction To Wireless Fading Channels
 
Chap 5 (small scale fading)
Chap 5 (small scale fading)Chap 5 (small scale fading)
Chap 5 (small scale fading)
 
IS-95 Cdma
IS-95 CdmaIS-95 Cdma
IS-95 Cdma
 
Pulse modulation
Pulse modulationPulse modulation
Pulse modulation
 
Equalization
EqualizationEqualization
Equalization
 
OFDM
OFDMOFDM
OFDM
 
10. types of small scale fading
10. types of small scale fading10. types of small scale fading
10. types of small scale fading
 
Delta modulation
Delta modulationDelta modulation
Delta modulation
 
4.5 equalizers and its types
4.5   equalizers and its types4.5   equalizers and its types
4.5 equalizers and its types
 
Pulse modulation
Pulse modulationPulse modulation
Pulse modulation
 
Link power and rise time budget analysis
Link power and rise time budget analysisLink power and rise time budget analysis
Link power and rise time budget analysis
 
Travelling Wave, Broadband Antennas, Frequency-independent Antennas
Travelling Wave, Broadband Antennas,  Frequency-independent AntennasTravelling Wave, Broadband Antennas,  Frequency-independent Antennas
Travelling Wave, Broadband Antennas, Frequency-independent Antennas
 
Line coding
Line codingLine coding
Line coding
 
Modulation
ModulationModulation
Modulation
 
Multiple Access
Multiple AccessMultiple Access
Multiple Access
 

Similar to Speech encoding techniques

Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
Digital Transmission Fundamentals
Digital Transmission FundamentalsDigital Transmission Fundamentals
Digital Transmission FundamentalsAisu
 
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.pptZeyadAlabsy
 
adaptive multirate speech coding
adaptive multirate speech codingadaptive multirate speech coding
adaptive multirate speech codingAbhiram Subhagan
 
Voice over IP (VoIP)
Voice over IP (VoIP)Voice over IP (VoIP)
Voice over IP (VoIP)Peter R. Egli
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive CodingSrishti Kakade
 
3. digital transmission fundamentals
3. digital transmission fundamentals3. digital transmission fundamentals
3. digital transmission fundamentalsRovin Valencia
 
Multimedia Compression and Communication
Multimedia Compression and CommunicationMultimedia Compression and Communication
Multimedia Compression and CommunicationBenesh Selvanesan
 
Low power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoderLow power fpga solution for dab audio decoder
Low power fpga solution for dab audio decodereSAT Publishing House
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
Digitization of Audio.ppt
Digitization of Audio.pptDigitization of Audio.ppt
Digitization of Audio.pptVideoguy
 
TeleCom Lecture 07.ppt
TeleCom Lecture 07.pptTeleCom Lecture 07.ppt
TeleCom Lecture 07.pptRiyaBatool
 

Similar to Speech encoding techniques (20)

Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Ijetr021253
Ijetr021253Ijetr021253
Ijetr021253
 
Digital Transmission Fundamentals
Digital Transmission FundamentalsDigital Transmission Fundamentals
Digital Transmission Fundamentals
 
add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
 
adaptive multirate speech coding
adaptive multirate speech codingadaptive multirate speech coding
adaptive multirate speech coding
 
Digital audio
Digital audioDigital audio
Digital audio
 
Voice over IP (VoIP)
Voice over IP (VoIP)Voice over IP (VoIP)
Voice over IP (VoIP)
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
3. digital transmission fundamentals
3. digital transmission fundamentals3. digital transmission fundamentals
3. digital transmission fundamentals
 
Mk3422222228
Mk3422222228Mk3422222228
Mk3422222228
 
Multimedia Compression and Communication
Multimedia Compression and CommunicationMultimedia Compression and Communication
Multimedia Compression and Communication
 
Multimedia Services: Audio
Multimedia Services: AudioMultimedia Services: Audio
Multimedia Services: Audio
 
Low power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoderLow power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoder
 
Speaker Segmentation (2006)
Speaker Segmentation (2006)Speaker Segmentation (2006)
Speaker Segmentation (2006)
 
05 signal encodingtechniques
05 signal encodingtechniques05 signal encodingtechniques
05 signal encodingtechniques
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
40120140504002
4012014050400240120140504002
40120140504002
 
Digitization of Audio.ppt
Digitization of Audio.pptDigitization of Audio.ppt
Digitization of Audio.ppt
 
TeleCom Lecture 07.ppt
TeleCom Lecture 07.pptTeleCom Lecture 07.ppt
TeleCom Lecture 07.ppt
 

More from Hemaraja Nayaka S

More from Hemaraja Nayaka S (15)

Anatomy and Neurophysiology of Swallowing
Anatomy and Neurophysiology of SwallowingAnatomy and Neurophysiology of Swallowing
Anatomy and Neurophysiology of Swallowing
 
surface dyslexia
surface dyslexia surface dyslexia
surface dyslexia
 
1. models of word recognition
1. models of word recognition1. models of word recognition
1. models of word recognition
 
Speech technology basics
Speech technology   basicsSpeech technology   basics
Speech technology basics
 
Management of articulation
Management of articulationManagement of articulation
Management of articulation
 
stuttering &amp; nnf
stuttering &amp; nnfstuttering &amp; nnf
stuttering &amp; nnf
 
Articulatory dynamics in sttg
Articulatory dynamics in sttgArticulatory dynamics in sttg
Articulatory dynamics in sttg
 
Linguistic n prosodic basis
Linguistic n prosodic basisLinguistic n prosodic basis
Linguistic n prosodic basis
 
Physiological basis of fluency disorders
Physiological basis of fluency disordersPhysiological basis of fluency disorders
Physiological basis of fluency disorders
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
 
laryngeal dynamics in stuttering
 laryngeal  dynamics in stuttering laryngeal  dynamics in stuttering
laryngeal dynamics in stuttering
 
Rhythm of speech
Rhythm of speech Rhythm of speech
Rhythm of speech
 
Disfluency types
Disfluency typesDisfluency types
Disfluency types
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
 
Neuroanatomy of language functions
Neuroanatomy of language functionsNeuroanatomy of language functions
Neuroanatomy of language functions
 

Recently uploaded

Low Rate Call Girls Patna Anika 8250192130 Independent Escort Service Patna
Low Rate Call Girls Patna Anika 8250192130 Independent Escort Service PatnaLow Rate Call Girls Patna Anika 8250192130 Independent Escort Service Patna
Low Rate Call Girls Patna Anika 8250192130 Independent Escort Service Patnamakika9823
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...Miss joya
 
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...Miss joya
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Miss joya
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsBangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsGfnyt
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Servicevidya singh
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escortsvidya singh
 
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...indiancallgirl4rent
 
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...narwatsonia7
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Deliverynehamumbai
 
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...narwatsonia7
 
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatorenarwatsonia7
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 

Recently uploaded (20)

Low Rate Call Girls Patna Anika 8250192130 Independent Escort Service Patna
Low Rate Call Girls Patna Anika 8250192130 Independent Escort Service PatnaLow Rate Call Girls Patna Anika 8250192130 Independent Escort Service Patna
Low Rate Call Girls Patna Anika 8250192130 Independent Escort Service Patna
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
 
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsBangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
 
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
 
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Servicesauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
 
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
 
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 

Speech encoding techniques

  • 3. What’s the need for speech coding ? Necessary in order to represent human speech in a digital form Applications: mobile/telephone communication, voice over IP Code efficiency (high quality, fewer bits) is a must
  • 4. Components of a speech coding system
  • 6. Example of coding techniques ZIP: no transformation nor quantization, apply VLC (LZW) to the stream of letters (symbols) in a file directly, lossless coding PCM for speech: no transformation, quantize the speech samples directly, apply fixed length binary coding ADPCM for speech: apply prediction to original samples, the predictor is adapted from one speech frame to the next, quantize the prediction error, error symbols coded using fixed length binary coding JPEG for image: apply discrete cosine transform to blocks of image pixels, quantize the transformed coefficients, code the quantized coefficients using variable length coding (runlength + Huffman coding)
  • 7. The aim of speech coding is to enhance the quality of a speech signal at a particular bit-rate or to minimize the bit-rate at a given quality. The bit-rate at which the speech is to be transmitted or stored depends on the rate of transmission or storage, the computation of coding the digital speech signal and the quality of the speech signal required. 7
  • 8. Low bit-rate High speech quality Robustness to different speakers/languages Channel errors Low memory requirements Less computational complexity Low coding delay 8
  • 10. 10
  • 11. Necessity Efficient speech-coding techniques Digital streams of ones and zeros The lower the bandwidth, the lower the quality Processing power The better quality (for a given bandwidth) uses a more complex algorithm A balance between quality and cost
  • 12. Voice Quality Bandwidth is easily quantified Voice quality is subjective MOS, Mean Opinion Score ITU-T Recommendation P.800 Excellent – 5 Good – 4 Fair – 3 Poor – 2 Bad – 1 A minimum of 30 people Listen to voice samples or in conversations
  • 13. About Speech Speech Air pushed from the lungs past the vocal cords and along the vocal tract The basic vibrations – vocal cords The sound is altered by the disposition of the vocal tract ( tongue and mouth) Model the vocal tract as a filter The shape changes relatively slowly The vibrations at the vocal cords The excitation signal
  • 14. 14 Speech Compression Handling speech with other media information such as text, images, video, and data is the essential part of multimedia applications The ideal speech coder has a low bit-rate, high perceived quality, low signal delay, and low complexity. Delay Less than 150 ms one-way end-to-end delay for a conversation Processing (coding) delay, network delay Complexity Computational complexity of speech coders depends on algorithms Contributes to achievable bit-rate and processing delay
  • 15. 15 Speech coding Standard voice channel: analog: 4 kHz slot (~ 40 dB SNR) digital: 64 Kbps = 8 bit µ-law x 8 kHz How to compress? Exploit redundancy signal assumed to be a single voice, not any waveform Code only what is needed intelligibility speaker identification Source-filter decomposition vocal tract shape & fundamental frequency change slowly
  • 16. 16 Taxonomy of Speech Coders Speech Coders Waveform Coders Source Coders Time Domain: PCM, ADPCM Frequency Domain: e.g. Sub-band coder, Adaptive transform coder Linear Predictive Coder Vocoder
  • 17. Speech coders are classified based on the bit-rate at which they produce output with reasonable quality and on the type of coding techniques used for coding the speech signal. 17
  • 18. Classification by Bit-Rate Type of coder Bit-Rate Range High bit-rate coders >15 Kbps Medium bit-rate coders 5 to 15 Kbps Low bit-rate coders 2 to 5 Kbps Very Low bit-rate coders <2 Kbps 18
  • 19. Classification by Coding Techniques 19  Waveform coders  Parametric coders  Hybrid coders
  • 20. Types of Speech Codecs Waveform codecs,source codecs (also known as vocoders),and hybrid codecs.
  • 21. 21 The ancestor: Channel Vocoder (1940s-1960s) Source-filter decomposition filterbank breaks into spectral bands transmit slowly-changing energy in each band 10-20 bands, perceptually spaced Downsampling Excitation with a pitch / noise model
  • 22. 22 LPC encoding The classic source-filter model Compression gains: filter parameters are ~slowly changing excitation can be represented many ways
  • 23. 23 Linear Predictive Code Model speech production system as an auto-regressive model: Model parameters are computed for speech segment (~30 ms). Parameters {a(k); k=1:p} are found by solving a Toeplitz system of equations. Transfer function To encode speech, one may transmit the quantized parameters {a(k)} and G or equivalent parameter set. The model order is 8-10 in most speech coding standards. )()()()( 1 neknskans p k +−= ∑= ∑= − − == p k k zka G zE zS zH 1 )(1 )( )( )( unvoiced G v/u voiced N random sequence generator periodic pulse train generator × H(z) = 1 1− ∑ akz-k P k = 1 u[n] Vocal Tract Model Excitation parameters v/u : voiced/ unvoiced N : pitch for voiced G : signal gain → excitation signal u[n] Vocal Tract parameters {ak } : LPC coefficients →formant structure of speech signals
  • 24. 24 Excitation parameters v/u : voiced/ unvoiced N : pitch for voiced G : signal gain → excitation signal u[n] Vocal Tract parameters {ak } : LPC coefficients →formant structure of speech signals unvoiced G v/u voiced N random sequence generator periodic pulse train generator × H(z) = 1 1− ∑ akz-k P k = 1 u[n] Vocal Tract Model
  • 26. 26 Encoding LPC filter parameters For ‘communications quality’: 8 kHz sampling (4 kHz bandwidth) ~10th order LPC (up to 5 pole pairs) update every 20-30 ms → 300 - 500 param/s Representation & quantization {ai} - poor distribution, can’t interpolate reflection coefficients {ki}: guaranteed stable log area ratios (LAR) - stable Bit allocation (filter): GSM (13 kbps): 8 LARs x 3-6 bits / 20 ms = 1.8 Kbps
  • 27. 27 Excitation Excitation as LPC residual is already better than raw signal: save several bits/sample, still > 32 Kbps Crude model: U/V flag + pitch period ~ 7 bits / 5 ms = 1.4 Kbps → LPC10 @ 2.4 Kbps
  • 28. 28 CELP Code excited linear predictive (CELP) speech coding. White noise input does not give satisfactory results: the residue sequence still contains important information for speech synthesis it is necessary to send the residue to receiving end too. To save space, use vector quantization (VQ) technique to encode the residue sequence Hence the name “code excited”. In CELP, each code book is a linear vector containing 0 or ±1 each code word length is 60 samples successive code words are overlapped by 58 samples a linear search is performed to find the best code words as input to the LPC model.
  • 29. 29 CELP Represent excitation with codebook e.g. 512 sparse excitation vectors linear search for minimum weighted error?
  • 30. 30 GSM Speech Encoder Hamming Window Segmentation Pre-emphasis Short Term Prediction LPC Inverse Filter Long Term Prediction + LPF Grid Selection MUX Gain, pitch LAR coefficients Speech input Pre-processing STP LTP Regular pulse excitation (RPE) 20ms Order = 8
  • 32. 32 Implementation Issues Tasks: LPC analysis filter to calculate the coefficients Long term prediction for pitch analysis need to find delay D and gain VQ search during CELP encoding – Most time consuming FIR filtering for pre- and post processing Often implemented in DSP chips for embedded applications (e.g. cell phone). The parameter quantization part needs bit-level operation.