SlideShare a Scribd company logo
Speech Compression                                    Uncompressed audio data rates
• Recommended Reading: J. Harrington and S.                     • Voice: 8000samples/sec, 8bits/sample,
  Cassidy, “Techniques in Speech Acoustics”,                      mono
  Kluwer, 1999
                                                                                 = 64000bits/sec (64kbps)
• Contents
                                                                • CD: 44100samples/sec, 16bits/sample,
  – Uncompressed audio data rates
                                                                  stereo
  – ADPCM
  – SB-ADPCM
                                                                         =1411200bits/sec (~1.5Mbps)
  – LPC




  ADPCM (Adaptive Differential PCM)                                                    ADPCM
• Uses the statistical properties of human speech (=> not           Measured                                   Transmitted
  compatible with fax/modem signals)                                value                                      value
• Makes a prediction about the size of the next sample, based                                  Adaptive
  on previous info                                                                             quantiser
                                                                             -
• Transmitter then sends only the difference between real
  value and predicted value
                                                                                               Predictor
• Receiver uses the same prediction algorithm, together with
  the differences to reconstruct the speech data
• Enables the data rate to be reduced to 32kbps
• Used on international telephone links
• Specified in G.721, G.722, G.723, G.726, G.727




SB-APDCM (Sub-band ADPCM)                                                           SB-ADPCM
• Given 64kbps: ADPCM could produce                                                   Upper sub-band
                                                                                      ADPCM encoder
  better than toll voice quality (eg radio)
                                                                       Input 4-7KHz                  16kbps
                                                                                                              MUX
• Sub-bands are 0-4kHz (given 48kbps), 4-                              filters
  7kHz (given 16kbps)                                                                 Lower sub-band
                                                                                      ADPCM encoder
• Low band contains more audio energy, high                                        50Hz-4KHz         48kbps
  band contains intelligibility info.
• Standardised in G.722                                          Analogue                                      Digital signal
                                                                 signal in                                     out




                                                                                                                                1
Linear Predictive Coding (LPC)                                                        LPC
• Introduced in the 1960s                                      • coefficients (‘a’s) correspond to those of a vocal
• nth signal sample is represented as a linear                   tract filter and the error signal (‘e’) corresponds to
  combination of the previous p samples, plus a                  a source signal
  residual representing the prediction error:                  • Source signal will approximate either a voiced
                                                                 signal (which looks like a series of impulses) or a
x(n) = a1x(n-1) + a2x(n-2) + … + apx(n-p) + e(n)
                                                                 white noise source
                                                               • So, LPC involves “exciting” a source signal with a
• If the error (‘e’) is small enough, we can just
  transmit the coefficients (‘a’s)                               vocal tract filter




            Impulses and Filters                                        LPC – Autocorrelation
                                                               • Minimise the error signal by choosing optimal
                                                                 coefficients (‘a’s)
                                                               • Use the autocorrelation criteria (aka root mean
                                                                 squared criterion):




                                                                                                      for 1<=j<=p,
                                                               where R is the autocorrelation of x(n) defined as
                                                                R(i) = E[x(n)x(n-i)]




  LPC – Solving the autocorrelation
                                                                                       LPC
              formula
• In matrix form the equation can be written as                • Used in:
                      R*a=r
                                                                  – GSM (Groupe Speciale Mobile) (Residual
  where the autocorrelation matrix R is a symmetric Toeplitz        Pulse Excited-LPC) (13kbps)
  matrix with elements ri,j = R(i - j), vector r is the
  autocorrelation vector rj = R(j), and vector a is the           – LD-CELP (Low-Delay Code Excited Linear
  parameter vector of ai                                            Prediction) (G.728) (16kbps)
• An algorithm by N. Levinson (proposed in 1947) and              – CS-ACELP (Conjugate Structure-Algebraic
  modified by J. Durbin (in 1959) recursively calculates the        CELP) (G.729) (8kbps)
  solution to the Toeplitz matrix.
• GSM coder uses an integer version of the Schur recursion        – MP-MLQ (Multi Pulse – Maximum Likelihood
  (1917)                                                            Quantisation) (G.723.1) (6.3kbps)…




                                                                                                                          2

More Related Content

What's hot

Line coding
Line codingLine coding
Line coding
Gagan Randhawa
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
neeraj9217
 
Audio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound GenerationAudio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound Generation
a3labdsp
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
Disha Modi
 
Digital Transmission Fundamentals
Digital Transmission FundamentalsDigital Transmission Fundamentals
Digital Transmission Fundamentals
Aisu
 
Lect2 up400 (100329)
Lect2 up400 (100329)Lect2 up400 (100329)
Lect2 up400 (100329)
aicdesign
 
Presentation ct
Presentation ctPresentation ct
Presentation ct
surabhisushant
 
3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
Ramin Anushiravani
 
A BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiverA BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiver
RFIC-IUMA
 
Digital Audio
Digital AudioDigital Audio
Digital Audio
Magic Finger Lounge
 
iDiff 2008 conference #04 IP-Racine FSSG
iDiff 2008 conference #04 IP-Racine   FSSGiDiff 2008 conference #04 IP-Racine   FSSG
iDiff 2008 conference #04 IP-Racine FSSG
Benoit Michel
 
Data bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwaData bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwa
Abhishek Wadhwa
 
CivcomIntelDuoBinary-TODC-OFC2006
CivcomIntelDuoBinary-TODC-OFC2006CivcomIntelDuoBinary-TODC-OFC2006
CivcomIntelDuoBinary-TODC-OFC2006
Gabriel (Gabby) Shpirer
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compression
guestdd7ccca
 
Baud rate is the number of change in signal
Baud rate is the number of change in signalBaud rate is the number of change in signal
Baud rate is the number of change in signal
Abhishek Pathak
 
Application of fourier series
Application of fourier seriesApplication of fourier series
Application of fourier series
Girish Dhareshwar
 
Chap 5
Chap 5Chap 5
Chap 5
Tanuj Patel
 
3. digital transmission fundamentals
3. digital transmission fundamentals3. digital transmission fundamentals
3. digital transmission fundamentals
Rovin Valencia
 
L'explication à propos du règlement sonore
L'explication à propos du règlement sonoreL'explication à propos du règlement sonore
L'explication à propos du règlement sonore
LeSoir.be
 

What's hot (19)

Line coding
Line codingLine coding
Line coding
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Audio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound GenerationAudio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound Generation
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
Digital Transmission Fundamentals
Digital Transmission FundamentalsDigital Transmission Fundamentals
Digital Transmission Fundamentals
 
Lect2 up400 (100329)
Lect2 up400 (100329)Lect2 up400 (100329)
Lect2 up400 (100329)
 
Presentation ct
Presentation ctPresentation ct
Presentation ct
 
3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
 
A BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiverA BICMOS chipset for a DVB-H front-end receiver
A BICMOS chipset for a DVB-H front-end receiver
 
Digital Audio
Digital AudioDigital Audio
Digital Audio
 
iDiff 2008 conference #04 IP-Racine FSSG
iDiff 2008 conference #04 IP-Racine   FSSGiDiff 2008 conference #04 IP-Racine   FSSG
iDiff 2008 conference #04 IP-Racine FSSG
 
Data bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwaData bit rate_by_abhishek_wadhwa
Data bit rate_by_abhishek_wadhwa
 
CivcomIntelDuoBinary-TODC-OFC2006
CivcomIntelDuoBinary-TODC-OFC2006CivcomIntelDuoBinary-TODC-OFC2006
CivcomIntelDuoBinary-TODC-OFC2006
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compression
 
Baud rate is the number of change in signal
Baud rate is the number of change in signalBaud rate is the number of change in signal
Baud rate is the number of change in signal
 
Application of fourier series
Application of fourier seriesApplication of fourier series
Application of fourier series
 
Chap 5
Chap 5Chap 5
Chap 5
 
3. digital transmission fundamentals
3. digital transmission fundamentals3. digital transmission fundamentals
3. digital transmission fundamentals
 
L'explication à propos du règlement sonore
L'explication à propos du règlement sonoreL'explication à propos du règlement sonore
L'explication à propos du règlement sonore
 

Similar to Z24 4 Speech Compression

add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
AshenafiGirma5
 
30 CHL PCM PDH SDH BY SKG
30 CHL PCM PDH SDH BY SKG30 CHL PCM PDH SDH BY SKG
30 CHL PCM PDH SDH BY SKG
Saroj Kumar Gochhayat
 
Speech coding techniques
Speech coding techniquesSpeech coding techniques
Speech coding techniques
Hemaraja Nayaka S
 
DC_PPT.pptx
DC_PPT.pptxDC_PPT.pptx
DC_PPT.pptx
RahulAgarwal505237
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
KIRUTHIKAAR2
 
Harmonic speech coding
Harmonic speech codingHarmonic speech coding
Harmonic speech coding
MuthanaALJANABI
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
KIRUTHIKAAR2
 
Lpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabLpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlab
chakravarthy Gopi
 
Ncc2004 ofdm tutorial part ii-apal
Ncc2004 ofdm tutorial   part ii-apalNcc2004 ofdm tutorial   part ii-apal
Ncc2004 ofdm tutorial part ii-apal
Arpan Pal
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellation
chintanajoshi
 
Arithmetic Coding
Arithmetic CodingArithmetic Coding
Arithmetic Coding
anithabalaprabhu
 
Sampling
SamplingSampling
Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...
CSP Scarl
 
Analog mixed vlsi notes
Analog mixed vlsi notesAnalog mixed vlsi notes
Analog mixed vlsi notes
athiulla
 
A1mpeg12 2004
A1mpeg12 2004A1mpeg12 2004
A1mpeg12 2004
Thiago Skiba
 
Carrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.pptCarrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.ppt
AbdulMaalik17
 
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
ZeyadAlabsy
 
Logsv2
Logsv2Logsv2
Logsv2
douglaslyon
 
05 signal encodingtechniques
05 signal encodingtechniques05 signal encodingtechniques
05 signal encodingtechniques
Orbay Yeşil
 
An audio quality evaluation of digital radio system
An audio quality evaluation of digital radio systemAn audio quality evaluation of digital radio system
An audio quality evaluation of digital radio system
Rojith Thomas
 

Similar to Z24 4 Speech Compression (20)

add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
30 CHL PCM PDH SDH BY SKG
30 CHL PCM PDH SDH BY SKG30 CHL PCM PDH SDH BY SKG
30 CHL PCM PDH SDH BY SKG
 
Speech coding techniques
Speech coding techniquesSpeech coding techniques
Speech coding techniques
 
DC_PPT.pptx
DC_PPT.pptxDC_PPT.pptx
DC_PPT.pptx
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
Harmonic speech coding
Harmonic speech codingHarmonic speech coding
Harmonic speech coding
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
Lpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabLpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlab
 
Ncc2004 ofdm tutorial part ii-apal
Ncc2004 ofdm tutorial   part ii-apalNcc2004 ofdm tutorial   part ii-apal
Ncc2004 ofdm tutorial part ii-apal
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellation
 
Arithmetic Coding
Arithmetic CodingArithmetic Coding
Arithmetic Coding
 
Sampling
SamplingSampling
Sampling
 
Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...Software-defined white-space cognitive systems: implementation of the spectru...
Software-defined white-space cognitive systems: implementation of the spectru...
 
Analog mixed vlsi notes
Analog mixed vlsi notesAnalog mixed vlsi notes
Analog mixed vlsi notes
 
A1mpeg12 2004
A1mpeg12 2004A1mpeg12 2004
A1mpeg12 2004
 
Carrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.pptCarrier to Noise Versus Signal to Noise.ppt
Carrier to Noise Versus Signal to Noise.ppt
 
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
03_04-AnalogDigital-HYanikomeroglu-12Jan2011_14Jan2011_Old1.ppt
 
Logsv2
Logsv2Logsv2
Logsv2
 
05 signal encodingtechniques
05 signal encodingtechniques05 signal encodingtechniques
05 signal encodingtechniques
 
An audio quality evaluation of digital radio system
An audio quality evaluation of digital radio systemAn audio quality evaluation of digital radio system
An audio quality evaluation of digital radio system
 

More from anithabalaprabhu

Shannon Fano
Shannon FanoShannon Fano
Shannon Fano
anithabalaprabhu
 
Ch 04 Arithmetic Coding ( P P T)
Ch 04  Arithmetic  Coding ( P P T)Ch 04  Arithmetic  Coding ( P P T)
Ch 04 Arithmetic Coding ( P P T)
anithabalaprabhu
 
Compression
CompressionCompression
Compression
anithabalaprabhu
 
Datacompression1
Datacompression1Datacompression1
Datacompression1
anithabalaprabhu
 
Speech Compression
Speech CompressionSpeech Compression
Speech Compression
anithabalaprabhu
 
Dictor
DictorDictor
Dictionary Based Compression
Dictionary Based CompressionDictionary Based Compression
Dictionary Based Compression
anithabalaprabhu
 
Module 4 Arithmetic Coding
Module 4 Arithmetic CodingModule 4 Arithmetic Coding
Module 4 Arithmetic Coding
anithabalaprabhu
 
Ch 04 Arithmetic Coding (Ppt)
Ch 04 Arithmetic Coding (Ppt)Ch 04 Arithmetic Coding (Ppt)
Ch 04 Arithmetic Coding (Ppt)
anithabalaprabhu
 
Compression Ii
Compression IiCompression Ii
Compression Ii
anithabalaprabhu
 
06 Arithmetic 1
06 Arithmetic 106 Arithmetic 1
06 Arithmetic 1
anithabalaprabhu
 
Lassy
LassyLassy
Compression Ii
Compression IiCompression Ii
Compression Ii
anithabalaprabhu
 
Lossy
LossyLossy
Planning
PlanningPlanning
Lossless
LosslessLossless
Losseless
LosselessLosseless
Losseless
anithabalaprabhu
 
Lec32
Lec32Lec32
Lec5 Compression
Lec5 CompressionLec5 Compression
Lec5 Compression
anithabalaprabhu
 
Huffman Student
Huffman StudentHuffman Student
Huffman Student
anithabalaprabhu
 

More from anithabalaprabhu (20)

Shannon Fano
Shannon FanoShannon Fano
Shannon Fano
 
Ch 04 Arithmetic Coding ( P P T)
Ch 04  Arithmetic  Coding ( P P T)Ch 04  Arithmetic  Coding ( P P T)
Ch 04 Arithmetic Coding ( P P T)
 
Compression
CompressionCompression
Compression
 
Datacompression1
Datacompression1Datacompression1
Datacompression1
 
Speech Compression
Speech CompressionSpeech Compression
Speech Compression
 
Dictor
DictorDictor
Dictor
 
Dictionary Based Compression
Dictionary Based CompressionDictionary Based Compression
Dictionary Based Compression
 
Module 4 Arithmetic Coding
Module 4 Arithmetic CodingModule 4 Arithmetic Coding
Module 4 Arithmetic Coding
 
Ch 04 Arithmetic Coding (Ppt)
Ch 04 Arithmetic Coding (Ppt)Ch 04 Arithmetic Coding (Ppt)
Ch 04 Arithmetic Coding (Ppt)
 
Compression Ii
Compression IiCompression Ii
Compression Ii
 
06 Arithmetic 1
06 Arithmetic 106 Arithmetic 1
06 Arithmetic 1
 
Lassy
LassyLassy
Lassy
 
Compression Ii
Compression IiCompression Ii
Compression Ii
 
Lossy
LossyLossy
Lossy
 
Planning
PlanningPlanning
Planning
 
Lossless
LosslessLossless
Lossless
 
Losseless
LosselessLosseless
Losseless
 
Lec32
Lec32Lec32
Lec32
 
Lec5 Compression
Lec5 CompressionLec5 Compression
Lec5 Compression
 
Huffman Student
Huffman StudentHuffman Student
Huffman Student
 

Recently uploaded

Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 

Recently uploaded (20)

Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 

Z24 4 Speech Compression

  • 1. Speech Compression Uncompressed audio data rates • Recommended Reading: J. Harrington and S. • Voice: 8000samples/sec, 8bits/sample, Cassidy, “Techniques in Speech Acoustics”, mono Kluwer, 1999 = 64000bits/sec (64kbps) • Contents • CD: 44100samples/sec, 16bits/sample, – Uncompressed audio data rates stereo – ADPCM – SB-ADPCM =1411200bits/sec (~1.5Mbps) – LPC ADPCM (Adaptive Differential PCM) ADPCM • Uses the statistical properties of human speech (=> not Measured Transmitted compatible with fax/modem signals) value value • Makes a prediction about the size of the next sample, based Adaptive on previous info quantiser - • Transmitter then sends only the difference between real value and predicted value Predictor • Receiver uses the same prediction algorithm, together with the differences to reconstruct the speech data • Enables the data rate to be reduced to 32kbps • Used on international telephone links • Specified in G.721, G.722, G.723, G.726, G.727 SB-APDCM (Sub-band ADPCM) SB-ADPCM • Given 64kbps: ADPCM could produce Upper sub-band ADPCM encoder better than toll voice quality (eg radio) Input 4-7KHz 16kbps MUX • Sub-bands are 0-4kHz (given 48kbps), 4- filters 7kHz (given 16kbps) Lower sub-band ADPCM encoder • Low band contains more audio energy, high 50Hz-4KHz 48kbps band contains intelligibility info. • Standardised in G.722 Analogue Digital signal signal in out 1
  • 2. Linear Predictive Coding (LPC) LPC • Introduced in the 1960s • coefficients (‘a’s) correspond to those of a vocal • nth signal sample is represented as a linear tract filter and the error signal (‘e’) corresponds to combination of the previous p samples, plus a a source signal residual representing the prediction error: • Source signal will approximate either a voiced signal (which looks like a series of impulses) or a x(n) = a1x(n-1) + a2x(n-2) + … + apx(n-p) + e(n) white noise source • So, LPC involves “exciting” a source signal with a • If the error (‘e’) is small enough, we can just transmit the coefficients (‘a’s) vocal tract filter Impulses and Filters LPC – Autocorrelation • Minimise the error signal by choosing optimal coefficients (‘a’s) • Use the autocorrelation criteria (aka root mean squared criterion): for 1<=j<=p, where R is the autocorrelation of x(n) defined as R(i) = E[x(n)x(n-i)] LPC – Solving the autocorrelation LPC formula • In matrix form the equation can be written as • Used in: R*a=r – GSM (Groupe Speciale Mobile) (Residual where the autocorrelation matrix R is a symmetric Toeplitz Pulse Excited-LPC) (13kbps) matrix with elements ri,j = R(i - j), vector r is the autocorrelation vector rj = R(j), and vector a is the – LD-CELP (Low-Delay Code Excited Linear parameter vector of ai Prediction) (G.728) (16kbps) • An algorithm by N. Levinson (proposed in 1947) and – CS-ACELP (Conjugate Structure-Algebraic modified by J. Durbin (in 1959) recursively calculates the CELP) (G.729) (8kbps) solution to the Toeplitz matrix. • GSM coder uses an integer version of the Schur recursion – MP-MLQ (Multi Pulse – Maximum Likelihood (1917) Quantisation) (G.723.1) (6.3kbps)… 2