SlideShare a Scribd company logo
1 of 39
SPEECH CODING
TECHNIQUES
∑
∞
−∞=
−=
n
nTtts )()( δ
∑
∞
−∞=
−=
=
n
c
cs
nTttx
tstxtx
)()(
)()()(
δ
∑
∞
−∞=
Ω−Ω=Ω
k
sk
T
jS )(
2
)( δ
π
Ω
)( ΩjXc
NΩNΩ−
ΩSΩSΩ− 0
Ω
)( ΩjXc
NΩNΩ− SΩSΩ
)( NS Ω−Ω
 Nyquist sampling theorem
QUANTIZATION (SCALAR
QUANTIZATION)
v1 v2 vk+1 vL
m0= -A m1 m2 …… mk mk+1 mL−1 mL=A
· Assume | x[n] | ≤ A
divide the range [ −A , A ] into L quantization levels
{ J1
, J2
, …… Jk
,….. JL
}
Jk
: [mk-1,mk
]
L = 2
R
each quantization level Jk
is represented by a value vk
S = U Jk
, V = { v1
, v2
, …… vk
,….. vL
}
Jk+1
COMPANDING
F(x)
x[n]
Uniform
Quantization
F−1
(x)
x[n]
Uniform
Decoder
^
Compressor …1101…1101… Expandor
Compressor + Expandor → Compandor
F(x) is to specify the non-uniform quantization
characteristics
BINARY ENCODING
BINARY ENCODING
• Binary encoding: to represent a finite set of symbols using
binary codewords.
• Fixed length coding: N levels represented by (int) log2(N)
bits.
• Variable length coding (VLC): more frequently appearing
symbols represented by shorter codewords (Huffman,
arithmetic, LZW=zip).
• The minimum number of bits required to represent a source
is bounded by its entropy
TYPES OF SPEECH CODECS
• Waveform codecs,source codecs (also known as
vocoders),and hybrid codecs.
WAVEFORM-BASED CODERS
• Non-predictive coding (uniform or non-uniform): samples
are encoded independently; PCM
• Predictive coding: samples are encoded as difference from
other samples; LCP or Differential PCM (DPCM)
PCM (PULSE CODE
MODULATION)
• In PCM each sample of the signal is quantized to one of the
amplitude levels, where B is the number of bits used to
represent each sample
• The bitrate of the encoded signal will be : B*F bps where F
is the sample frequency
• The quantized waveform is modeled as:
where q(n) is the quantization noise
B
2
)()()(~ nqnsns +=
PREDICTIVE CODING (LPC
OR DPCM)
• Observation: Adjacent samples are often similar
• Predictive coding:
• Predict the current sample from previous samples, quantize and
code the prediction error, instead of the original sample.
• If the prediction is accurate most of the time, the prediction error is
concentrated near zeros and can be coded with fewer bits than the
original signal
• Usually a linear predictor is used (linear predictive coding):
∑=
−∗=
p
k
kp knxanx
1
)()(
PREDICTIVE ENCODER
DIAGRAM
PREDICTIVE DECODER
DIAGRAM
SPEECH SOURCE MODEL AND
SOURCE CODING
unvoiced
G
v/u
voiced
N
random
sequence
generator
periodic
pulse
train
generator
× G(z) = 1
1− ∑ akz-k
P
k = 1
x[n]
G(z), G(ω), g[n]
u[n]
Excitation
Vocal Tract
Model
Excitation parameters
v/u : voiced/ unvoiced
N : pitch for voiced
G : signal gain
→ excitation signal u[n]
Vocal Tract parameters
{ak
} : LPC coefficients
→formant structure of
speech signals
A good approximation,
though not precise enough
LPC VOCODER(VOICE CODER)
x[n]
LPC
Analysis
{ ak }
N , G
v/u
Encoder
…
11011…
N by pitch detection
v/u by voicing detection
Decoder
{ ak }
N , G
v/u
receiver
…
11011…
g[n]
G(z)
Ex
x[n]
{ak
} can be non-uniform or vector
quantized to reduce bit rate further
SPEECH CODING
CHARACTERISTICS
• Speech coders are lossy coders, i.e. the decoded signal is
different from the original
• The goal in speech coding is to minimize the distortion at a given
bit rate, or minimize the bit rate to reach a given distortion
• Metrics in speech coding:
• Objective measure of distortion is SNR (Signal to noise ratio);
SNR does not correlate well with perceived speech quality
• Subjective measure - MOS (mean opinion score):
• 5: excellent
• 4: good
• 3: fair
• 2: poor
• 1: bad
G.711
• The most commonplace codec
• Used in circuit-switched telephone network
• PCM, Pulse-Code Modulation
• If uniform quantization
• 12 bits * 8 k/sec = 96 kbps
• Non-uniform quantization
• 65 kbps DS0 rate
•
• North America
• A-law
• Other countries, a little friendlier to
lower signal levels
• An MOS of about 4.3
law−µ
ADPCM(ADAPTIVE
DIFFERENTIAL PCM)
• DPCM and ADPCM.
• ADPCM : Adaptive Prediction in DPCM
Adaptive Quantization
Adaptive Quantization
• Quantization level ∆ varies with local signal level
• ∆[n] = aσx
[n]
• σx
[n] : locally estimated standard deviation of x[n]
• G.721:ADPCM-coded speech at 32Kbps.
• G.726(A-law or )
• 16,24,32,40Kbps
• MOS 4.0 , at 32Kbps
law−µ
ANALYSIS-BY-SYNTHESIS (ABS)
CODECS
• Hybrid codec
• Fill the gap between waveform and source codecs
• The most successful and commonly used
• Time-domain AbS codecs
• Not a simple two-state, voiced/unvoiced
• Different excitation signals are attempted
• Closest to the original waveform is selected
• MPE, Multi-Pulse Excited
• RPE, Regular-Pulse Excited
• CELP, Code-Excited Linear Predictive
G.728 LD-CELP
• CELP codecs
• A filter; its characteristics change over time
• A codebook of acoustic vectors
• A vector = a set of elements representing various char.
of the excitation
• Transmit
• Filter coefficients, gain, a pointer to the vector chosen
• Low Delay CELP
• Backward-adaptive coder
• Use previous samples to determine filter coefficients
• Operates on five samples at a time
• Delay < 1 ms
• Only the pointer is transmitted

1024 vectors in the code book

10-bit pointer (index)

16 kbps
 LD-CELP encoder
 Minimize a frequency-weighted mean-square error
 LD-CELP decoder
 An MOS score of about 3.9
 One-quarter of G.711 bandwidth
G.723.1 ACELP
 6.3 or 5.3 kbps
 Both mandatory
 Can change from one to another during a conversation
 The coder
 A band-limited input speech signal
 Sampled at 8 KHz, 16-bit uniform PCM quantization
 Operate on blocks of 240 samples at a time
 A look-ahead of 7.5 ms
 A total algorithmic delay of 37.5 ms + other delays
 A high-pass filter to remove any DC component
 G.723.1 Annex A
 Silence Insertion Description (SID) frames
of size four octets
 The two lsbs of the first octet
 00 6.3kbps 24 octets/frame
 01 5.3kbps 20
 10 SID frame 4
 An MOS of about 3.8
 At least 37.5 ms delay
G.729
 8 kbps
 Input frames of 10 ms, 80 samples for 8 KHz
sampling rate
 5 ms look-ahead
 Algorithmic delay of 15 ms
 An 80-bit frame for 10 ms of speech
 A complex codec
 G.729.A (Annex A), a number of simplifications
 Same frame structure
 Encoder/decoder, G.729/G.729.A
 Slightly lower quality
 G.729.B
 VAD, Voice Activity Detection

Based on analysis of several parameters of the input

The current frames plus two preceding frames
 DTX, Discontinuous Transmission

Send nothing or send an SID frame

SID frame contains information to generate comfort noise
 CNG, Comfort Noise Generation
 G.729, an MOS of about 4.0
 G.729A an MOS of about 3.7
Other Codecs
 CDMA QCELP defined in IS-733
 Variable-rate coder
 Two most common rates

The high rate, 13.3 kbps

A lower rate, 6.2 kbps
 Silence suppression
 For use with RTP, RFC 2658
 GSM Enhanced Full-Rate (EFR)
 GSM 06.60
 An enhanced version of GSM Full-Rate
 ACELP-based codec
 The same bit rate and the same overall
packing structure

12.2 kbps
 Support discontinuous transmission
 For use with RTP, RFC 1890
 GSM Adaptive Multi-Rate (AMR) codec
 GSM 06.90
 Eight different modes
 4.75 kbps to 12.2 kbps
 12.2 kbps, GSM EFR
 7.4 kbps, IS-641 (TDMA cellular systems)
 Change the mode at any time
 Offer discontinuous transmission
 The coding choice of many 3G wireless
networks
 The MOS values are for laboratory
conditions
 G.711 does not deal with lost packets
 G.729 can accommodate a lost frame by
interpolating from previous frames

But cause errors in subsequent speech frames
 Processing Power
 G.728 or G.729, 40 MIPS
 G.726 10 MIPS
 Cascaded Codecs
 E.g., G.711 stream -> G.729
encoder/decoder
 Might not even come close to G.729
 Each coder only generate an
approximate of the incoming signal
Tones, Signal, and DTMF
Digits
 The hybrid codecs are optimized for human
speech
 Other data may need to be transmitted
 Tones: fax tones, dialing tone, busy tone
 DTMF digits for two-stage dialing or voice-mail
 G.711 is OK
 G.723.1 and G.729 can be unintelligible
 The ingress gateway needs to intercept
 The tones and DTMT digits
 Use an external signaling system

Easy at the start of a call

Difficult in the middle of a call
 Encode the tones differently form the speech

Send them along the same media path

An RTP packet provides the name of the tone and the
duration

Or, a dynamic RTP profile; an RTP packet containing the
frequency, volume and the duration

RFC 2198
 An RTP payload format for redundant audio data
 Sending both types of RTP payload
 RTP Payload Format for DTMF Digits
 An Internet Draft
 Both methods described before
 A large number of tones and events

DTMF digits, a busy tone, a congestion tone, a
ringing tone, etc.
 The named events
 E: the end of the tone, R: reserved
 Payload format
FINIS
DISCRETE TIME LTI SYSTEMS:
THE CONVOLUTION SUM
∑
+∞
−∞=
−=
k
knkxnx ][][][ δ
0 1 2
0 1 0 1 2 3
h[n]
x[n] y[n]
n
n n
1
0.5
2
0.5
2.5
2
∑
+∞
−∞=
−=
k
knhkxny ][][][
FREQUENCY-DOMAIN
REPRESENTATION OF
SAMPLING
∑
∞
−∞=
−=
n
nTtts )()( δ
∑
∞
−∞=
−=
=
n
c
cs
nTttx
tstxtx
)()(
)()()(
δ
∑
∞
−∞=
Ω−Ω=Ω
k
sk
T
jS )(
2
)( δ
π
Ω
)( ΩjXc
NΩNΩ−
ΩSΩSΩ− 0
Ω
)( ΩjXc
NΩNΩ− SΩSΩ
)( NS Ω−Ω
SPEECH SOURCE MODEL AND
SOURCE CODING
• Vocal Tract Model
∑=
=−+
p
k
k nxknxanu
1
][][)(
)(
)(
1
1
)(
1
zU
zX
za
zG p
k
k
k
=
−
=
∑=
−

More Related Content

What's hot

What's hot (20)

Coherent and Non-coherent detection of ASK, FSK AND QASK
Coherent and Non-coherent detection of ASK, FSK AND QASKCoherent and Non-coherent detection of ASK, FSK AND QASK
Coherent and Non-coherent detection of ASK, FSK AND QASK
 
IS-95 Cdma
IS-95 CdmaIS-95 Cdma
IS-95 Cdma
 
Ec 2401 wireless communication unit 2
Ec 2401 wireless communication   unit 2Ec 2401 wireless communication   unit 2
Ec 2401 wireless communication unit 2
 
Equalization
EqualizationEqualization
Equalization
 
Companding and DPCM and ADPCM
Companding and DPCM and ADPCMCompanding and DPCM and ADPCM
Companding and DPCM and ADPCM
 
Digital modulation techniques...
Digital modulation techniques...Digital modulation techniques...
Digital modulation techniques...
 
MINIMUM SHIFT KEYING(MSK)
MINIMUM SHIFT KEYING(MSK)MINIMUM SHIFT KEYING(MSK)
MINIMUM SHIFT KEYING(MSK)
 
Waveform coding
Waveform codingWaveform coding
Waveform coding
 
Satellite link design
Satellite link designSatellite link design
Satellite link design
 
Unit I.fundamental of Programmable DSP
Unit I.fundamental of Programmable DSPUnit I.fundamental of Programmable DSP
Unit I.fundamental of Programmable DSP
 
Chap 5 (small scale fading)
Chap 5 (small scale fading)Chap 5 (small scale fading)
Chap 5 (small scale fading)
 
Small scale fading
Small scale fadingSmall scale fading
Small scale fading
 
Subband Coding
Subband CodingSubband Coding
Subband Coding
 
Adaptive filter
Adaptive filterAdaptive filter
Adaptive filter
 
Linear block coding
Linear block codingLinear block coding
Linear block coding
 
signal and channel bandwidth
signal and channel bandwidthsignal and channel bandwidth
signal and channel bandwidth
 
Adaptive linear equalizer
Adaptive linear equalizerAdaptive linear equalizer
Adaptive linear equalizer
 
Digital modulation
Digital modulationDigital modulation
Digital modulation
 
Propagation Model
Propagation ModelPropagation Model
Propagation Model
 
Convolutional codes
Convolutional codesConvolutional codes
Convolutional codes
 

Similar to Speech coding techniques

Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...Rohde & Schwarz North America
 
Digital Earth Station
Digital Earth Station  Digital Earth Station
Digital Earth Station Susmita Pandey
 
Analog to Digital Converters and Data Acquisition Systems
Analog to Digital Converters and Data Acquisition SystemsAnalog to Digital Converters and Data Acquisition Systems
Analog to Digital Converters and Data Acquisition SystemsMathivanan Natarajan
 
All Digital Phase Lock Loop 03 12 09
All Digital Phase Lock Loop 03 12 09All Digital Phase Lock Loop 03 12 09
All Digital Phase Lock Loop 03 12 09imranbashir
 
Unit II OFDM.pdf
Unit II OFDM.pdfUnit II OFDM.pdf
Unit II OFDM.pdfvpshinde2
 
Multiband Transceivers - [Chapter 4] Design Parameters of Wireless Radios
Multiband Transceivers - [Chapter 4] Design Parameters of Wireless RadiosMultiband Transceivers - [Chapter 4] Design Parameters of Wireless Radios
Multiband Transceivers - [Chapter 4] Design Parameters of Wireless RadiosSimen Li
 
Signal and image processing on satellite communication using MATLAB
Signal and image processing on satellite communication using MATLABSignal and image processing on satellite communication using MATLAB
Signal and image processing on satellite communication using MATLABEmbedded Plus Trichy
 
Lect2 up390 (100329)
Lect2 up390 (100329)Lect2 up390 (100329)
Lect2 up390 (100329)aicdesign
 
171220027 Kaushal Verma.pdf
171220027 Kaushal Verma.pdf171220027 Kaushal Verma.pdf
171220027 Kaushal Verma.pdfsaiusa5444
 
Lecture3 signal encoding_in_wireless
Lecture3  signal encoding_in_wirelessLecture3  signal encoding_in_wireless
Lecture3 signal encoding_in_wirelessYahya Alzidi
 
ADC Conveter Performance and Limitations.ppt
ADC Conveter Performance and Limitations.pptADC Conveter Performance and Limitations.ppt
ADC Conveter Performance and Limitations.pptBEVARAVASUDEVAAP1813
 
Digital communication
Digital communicationDigital communication
Digital communicationmeashi
 
Voice over IP (VoIP)
Voice over IP (VoIP)Voice over IP (VoIP)
Voice over IP (VoIP)Peter R. Egli
 

Similar to Speech coding techniques (20)

Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
Synchronous Time / Frequency Domain Measurements Using a Digital Oscilloscope...
 
Digital Earth Station
Digital Earth Station  Digital Earth Station
Digital Earth Station
 
Analog to Digital Converters and Data Acquisition Systems
Analog to Digital Converters and Data Acquisition SystemsAnalog to Digital Converters and Data Acquisition Systems
Analog to Digital Converters and Data Acquisition Systems
 
add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
GPS Signals (1)
GPS Signals (1)GPS Signals (1)
GPS Signals (1)
 
All Digital Phase Lock Loop 03 12 09
All Digital Phase Lock Loop 03 12 09All Digital Phase Lock Loop 03 12 09
All Digital Phase Lock Loop 03 12 09
 
Unit II OFDM.pdf
Unit II OFDM.pdfUnit II OFDM.pdf
Unit II OFDM.pdf
 
Multiband Transceivers - [Chapter 4] Design Parameters of Wireless Radios
Multiband Transceivers - [Chapter 4] Design Parameters of Wireless RadiosMultiband Transceivers - [Chapter 4] Design Parameters of Wireless Radios
Multiband Transceivers - [Chapter 4] Design Parameters of Wireless Radios
 
Pass band transmission
Pass band transmission Pass band transmission
Pass band transmission
 
Digital audio
Digital audioDigital audio
Digital audio
 
Signal and image processing on satellite communication using MATLAB
Signal and image processing on satellite communication using MATLABSignal and image processing on satellite communication using MATLAB
Signal and image processing on satellite communication using MATLAB
 
Gps signals
Gps signalsGps signals
Gps signals
 
Lect2 up390 (100329)
Lect2 up390 (100329)Lect2 up390 (100329)
Lect2 up390 (100329)
 
171220027 Kaushal Verma.pdf
171220027 Kaushal Verma.pdf171220027 Kaushal Verma.pdf
171220027 Kaushal Verma.pdf
 
Lecture3 signal encoding_in_wireless
Lecture3  signal encoding_in_wirelessLecture3  signal encoding_in_wireless
Lecture3 signal encoding_in_wireless
 
ADC Conveter Performance and Limitations.ppt
ADC Conveter Performance and Limitations.pptADC Conveter Performance and Limitations.ppt
ADC Conveter Performance and Limitations.ppt
 
IMT Advanced
IMT AdvancedIMT Advanced
IMT Advanced
 
Digital communication
Digital communicationDigital communication
Digital communication
 
Spread Spectrum.ppt
Spread Spectrum.pptSpread Spectrum.ppt
Spread Spectrum.ppt
 
Voice over IP (VoIP)
Voice over IP (VoIP)Voice over IP (VoIP)
Voice over IP (VoIP)
 

More from Hemaraja Nayaka S

More from Hemaraja Nayaka S (15)

Anatomy and Neurophysiology of Swallowing
Anatomy and Neurophysiology of SwallowingAnatomy and Neurophysiology of Swallowing
Anatomy and Neurophysiology of Swallowing
 
surface dyslexia
surface dyslexia surface dyslexia
surface dyslexia
 
1. models of word recognition
1. models of word recognition1. models of word recognition
1. models of word recognition
 
Speech technology basics
Speech technology   basicsSpeech technology   basics
Speech technology basics
 
Management of articulation
Management of articulationManagement of articulation
Management of articulation
 
stuttering &amp; nnf
stuttering &amp; nnfstuttering &amp; nnf
stuttering &amp; nnf
 
Articulatory dynamics in sttg
Articulatory dynamics in sttgArticulatory dynamics in sttg
Articulatory dynamics in sttg
 
Linguistic n prosodic basis
Linguistic n prosodic basisLinguistic n prosodic basis
Linguistic n prosodic basis
 
Physiological basis of fluency disorders
Physiological basis of fluency disordersPhysiological basis of fluency disorders
Physiological basis of fluency disorders
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
 
laryngeal dynamics in stuttering
 laryngeal  dynamics in stuttering laryngeal  dynamics in stuttering
laryngeal dynamics in stuttering
 
Rhythm of speech
Rhythm of speech Rhythm of speech
Rhythm of speech
 
Disfluency types
Disfluency typesDisfluency types
Disfluency types
 
1. fluency introduction
1. fluency introduction1. fluency introduction
1. fluency introduction
 
Neuroanatomy of language functions
Neuroanatomy of language functionsNeuroanatomy of language functions
Neuroanatomy of language functions
 

Recently uploaded

College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceCollege Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceNehru place Escorts
 
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls ServiceCall Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Servicesonalikaur4
 
Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024Gabriel Guevara MD
 
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiNehru place Escorts
 
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service MumbaiLow Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbaisonalikaur4
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment BookingCall Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service JaipurHigh Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipurparulsinha
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Modelssonalikaur4
 
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Miss joya
 
Call Girl Indore Vrinda 9907093804 Independent Escort Service Indore
Call Girl Indore Vrinda 9907093804 Independent Escort Service IndoreCall Girl Indore Vrinda 9907093804 Independent Escort Service Indore
Call Girl Indore Vrinda 9907093804 Independent Escort Service IndoreRiya Pathan
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...narwatsonia7
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Serviceparulsinha
 

Recently uploaded (20)

College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceCollege Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
 
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls ServiceCall Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
 
Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024
 
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
 
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service MumbaiLow Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
 
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment BookingCall Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
 
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service JaipurHigh Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hosur Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
 
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
 
Call Girl Indore Vrinda 9907093804 Independent Escort Service Indore
Call Girl Indore Vrinda 9907093804 Independent Escort Service IndoreCall Girl Indore Vrinda 9907093804 Independent Escort Service Indore
Call Girl Indore Vrinda 9907093804 Independent Escort Service Indore
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
 

Speech coding techniques

  • 2. ∑ ∞ −∞= −= n nTtts )()( δ ∑ ∞ −∞= −= = n c cs nTttx tstxtx )()( )()()( δ ∑ ∞ −∞= Ω−Ω=Ω k sk T jS )( 2 )( δ π Ω )( ΩjXc NΩNΩ− ΩSΩSΩ− 0 Ω )( ΩjXc NΩNΩ− SΩSΩ )( NS Ω−Ω  Nyquist sampling theorem
  • 3. QUANTIZATION (SCALAR QUANTIZATION) v1 v2 vk+1 vL m0= -A m1 m2 …… mk mk+1 mL−1 mL=A · Assume | x[n] | ≤ A divide the range [ −A , A ] into L quantization levels { J1 , J2 , …… Jk ,….. JL } Jk : [mk-1,mk ] L = 2 R each quantization level Jk is represented by a value vk S = U Jk , V = { v1 , v2 , …… vk ,….. vL } Jk+1
  • 4. COMPANDING F(x) x[n] Uniform Quantization F−1 (x) x[n] Uniform Decoder ^ Compressor …1101…1101… Expandor Compressor + Expandor → Compandor F(x) is to specify the non-uniform quantization characteristics
  • 6. BINARY ENCODING • Binary encoding: to represent a finite set of symbols using binary codewords. • Fixed length coding: N levels represented by (int) log2(N) bits. • Variable length coding (VLC): more frequently appearing symbols represented by shorter codewords (Huffman, arithmetic, LZW=zip). • The minimum number of bits required to represent a source is bounded by its entropy
  • 7. TYPES OF SPEECH CODECS • Waveform codecs,source codecs (also known as vocoders),and hybrid codecs.
  • 8.
  • 9. WAVEFORM-BASED CODERS • Non-predictive coding (uniform or non-uniform): samples are encoded independently; PCM • Predictive coding: samples are encoded as difference from other samples; LCP or Differential PCM (DPCM)
  • 10. PCM (PULSE CODE MODULATION) • In PCM each sample of the signal is quantized to one of the amplitude levels, where B is the number of bits used to represent each sample • The bitrate of the encoded signal will be : B*F bps where F is the sample frequency • The quantized waveform is modeled as: where q(n) is the quantization noise B 2 )()()(~ nqnsns +=
  • 11. PREDICTIVE CODING (LPC OR DPCM) • Observation: Adjacent samples are often similar • Predictive coding: • Predict the current sample from previous samples, quantize and code the prediction error, instead of the original sample. • If the prediction is accurate most of the time, the prediction error is concentrated near zeros and can be coded with fewer bits than the original signal • Usually a linear predictor is used (linear predictive coding): ∑= −∗= p k kp knxanx 1 )()(
  • 14. SPEECH SOURCE MODEL AND SOURCE CODING unvoiced G v/u voiced N random sequence generator periodic pulse train generator × G(z) = 1 1− ∑ akz-k P k = 1 x[n] G(z), G(ω), g[n] u[n] Excitation Vocal Tract Model Excitation parameters v/u : voiced/ unvoiced N : pitch for voiced G : signal gain → excitation signal u[n] Vocal Tract parameters {ak } : LPC coefficients →formant structure of speech signals A good approximation, though not precise enough
  • 15. LPC VOCODER(VOICE CODER) x[n] LPC Analysis { ak } N , G v/u Encoder … 11011… N by pitch detection v/u by voicing detection Decoder { ak } N , G v/u receiver … 11011… g[n] G(z) Ex x[n] {ak } can be non-uniform or vector quantized to reduce bit rate further
  • 16. SPEECH CODING CHARACTERISTICS • Speech coders are lossy coders, i.e. the decoded signal is different from the original • The goal in speech coding is to minimize the distortion at a given bit rate, or minimize the bit rate to reach a given distortion • Metrics in speech coding: • Objective measure of distortion is SNR (Signal to noise ratio); SNR does not correlate well with perceived speech quality • Subjective measure - MOS (mean opinion score): • 5: excellent • 4: good • 3: fair • 2: poor • 1: bad
  • 17. G.711 • The most commonplace codec • Used in circuit-switched telephone network • PCM, Pulse-Code Modulation • If uniform quantization • 12 bits * 8 k/sec = 96 kbps • Non-uniform quantization • 65 kbps DS0 rate • • North America • A-law • Other countries, a little friendlier to lower signal levels • An MOS of about 4.3 law−µ
  • 18. ADPCM(ADAPTIVE DIFFERENTIAL PCM) • DPCM and ADPCM. • ADPCM : Adaptive Prediction in DPCM Adaptive Quantization Adaptive Quantization • Quantization level ∆ varies with local signal level • ∆[n] = aσx [n] • σx [n] : locally estimated standard deviation of x[n] • G.721:ADPCM-coded speech at 32Kbps. • G.726(A-law or ) • 16,24,32,40Kbps • MOS 4.0 , at 32Kbps law−µ
  • 19. ANALYSIS-BY-SYNTHESIS (ABS) CODECS • Hybrid codec • Fill the gap between waveform and source codecs • The most successful and commonly used • Time-domain AbS codecs • Not a simple two-state, voiced/unvoiced • Different excitation signals are attempted • Closest to the original waveform is selected • MPE, Multi-Pulse Excited • RPE, Regular-Pulse Excited • CELP, Code-Excited Linear Predictive
  • 20. G.728 LD-CELP • CELP codecs • A filter; its characteristics change over time • A codebook of acoustic vectors • A vector = a set of elements representing various char. of the excitation • Transmit • Filter coefficients, gain, a pointer to the vector chosen • Low Delay CELP • Backward-adaptive coder • Use previous samples to determine filter coefficients • Operates on five samples at a time • Delay < 1 ms • Only the pointer is transmitted
  • 21.  1024 vectors in the code book  10-bit pointer (index)  16 kbps  LD-CELP encoder  Minimize a frequency-weighted mean-square error
  • 22.  LD-CELP decoder  An MOS score of about 3.9  One-quarter of G.711 bandwidth
  • 23. G.723.1 ACELP  6.3 or 5.3 kbps  Both mandatory  Can change from one to another during a conversation  The coder  A band-limited input speech signal  Sampled at 8 KHz, 16-bit uniform PCM quantization  Operate on blocks of 240 samples at a time  A look-ahead of 7.5 ms  A total algorithmic delay of 37.5 ms + other delays  A high-pass filter to remove any DC component
  • 24.  G.723.1 Annex A  Silence Insertion Description (SID) frames of size four octets  The two lsbs of the first octet  00 6.3kbps 24 octets/frame  01 5.3kbps 20  10 SID frame 4  An MOS of about 3.8  At least 37.5 ms delay
  • 25. G.729  8 kbps  Input frames of 10 ms, 80 samples for 8 KHz sampling rate  5 ms look-ahead  Algorithmic delay of 15 ms  An 80-bit frame for 10 ms of speech  A complex codec  G.729.A (Annex A), a number of simplifications  Same frame structure  Encoder/decoder, G.729/G.729.A  Slightly lower quality
  • 26.  G.729.B  VAD, Voice Activity Detection  Based on analysis of several parameters of the input  The current frames plus two preceding frames  DTX, Discontinuous Transmission  Send nothing or send an SID frame  SID frame contains information to generate comfort noise  CNG, Comfort Noise Generation  G.729, an MOS of about 4.0  G.729A an MOS of about 3.7
  • 27. Other Codecs  CDMA QCELP defined in IS-733  Variable-rate coder  Two most common rates  The high rate, 13.3 kbps  A lower rate, 6.2 kbps  Silence suppression  For use with RTP, RFC 2658
  • 28.  GSM Enhanced Full-Rate (EFR)  GSM 06.60  An enhanced version of GSM Full-Rate  ACELP-based codec  The same bit rate and the same overall packing structure  12.2 kbps  Support discontinuous transmission  For use with RTP, RFC 1890
  • 29.  GSM Adaptive Multi-Rate (AMR) codec  GSM 06.90  Eight different modes  4.75 kbps to 12.2 kbps  12.2 kbps, GSM EFR  7.4 kbps, IS-641 (TDMA cellular systems)  Change the mode at any time  Offer discontinuous transmission  The coding choice of many 3G wireless networks
  • 30.  The MOS values are for laboratory conditions  G.711 does not deal with lost packets  G.729 can accommodate a lost frame by interpolating from previous frames  But cause errors in subsequent speech frames  Processing Power  G.728 or G.729, 40 MIPS  G.726 10 MIPS
  • 31.  Cascaded Codecs  E.g., G.711 stream -> G.729 encoder/decoder  Might not even come close to G.729  Each coder only generate an approximate of the incoming signal
  • 32. Tones, Signal, and DTMF Digits  The hybrid codecs are optimized for human speech  Other data may need to be transmitted  Tones: fax tones, dialing tone, busy tone  DTMF digits for two-stage dialing or voice-mail  G.711 is OK  G.723.1 and G.729 can be unintelligible  The ingress gateway needs to intercept  The tones and DTMT digits  Use an external signaling system
  • 33.  Easy at the start of a call  Difficult in the middle of a call  Encode the tones differently form the speech  Send them along the same media path  An RTP packet provides the name of the tone and the duration  Or, a dynamic RTP profile; an RTP packet containing the frequency, volume and the duration  RFC 2198  An RTP payload format for redundant audio data  Sending both types of RTP payload
  • 34.  RTP Payload Format for DTMF Digits  An Internet Draft  Both methods described before  A large number of tones and events  DTMF digits, a busy tone, a congestion tone, a ringing tone, etc.  The named events  E: the end of the tone, R: reserved
  • 36. FINIS
  • 37. DISCRETE TIME LTI SYSTEMS: THE CONVOLUTION SUM ∑ +∞ −∞= −= k knkxnx ][][][ δ 0 1 2 0 1 0 1 2 3 h[n] x[n] y[n] n n n 1 0.5 2 0.5 2.5 2 ∑ +∞ −∞= −= k knhkxny ][][][
  • 38. FREQUENCY-DOMAIN REPRESENTATION OF SAMPLING ∑ ∞ −∞= −= n nTtts )()( δ ∑ ∞ −∞= −= = n c cs nTttx tstxtx )()( )()()( δ ∑ ∞ −∞= Ω−Ω=Ω k sk T jS )( 2 )( δ π Ω )( ΩjXc NΩNΩ− ΩSΩSΩ− 0 Ω )( ΩjXc NΩNΩ− SΩSΩ )( NS Ω−Ω
  • 39. SPEECH SOURCE MODEL AND SOURCE CODING • Vocal Tract Model ∑= =−+ p k k nxknxanu 1 ][][)( )( )( 1 1 )( 1 zU zX za zG p k k k = − = ∑= −