SlideShare a Scribd company logo
1 of 13
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1
IEEE Speech Coding Workshop
Sept 17–20, 2000
Lake Lawn Resort
Delavan, WI
Jean-Marc Valin, Roch Lefebvre
University of Sherbrooke
Bandwidth Extension of Narrowband
Speech for Low Bit-Rate Wideband Coding
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 2
Outline
• Problem statement
• Proposed solution
• System performance
• Discussion
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 3
Problem Statement
• Telephone Band: 300 - 3400 Hz
• AM Band: 50 - 7000 Hz
• How to make sound like with 500 bits/sec?
• We need to recover information from both low and
high-frequency bands
(G.729)
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 4
Proposed Solution
0 1000 2000 3000 4000 5000 6000 7000 8000
10
20
30
40
50
60
70
80
90
100
110
Frequency (Hz)
Amplitude
(dB)
• 1) Do our best to recover the wideband information
from narrowband speech
• 2) Use coding for the information that cannot be
recovered
– Recovered information :
• Low-frequency band
• High-frequency excitation
– Coded information :
• High-frequency spectral
envelope
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 5
System Overview
• Inverse IRM filter is optional
– produces a flat response from 200-3500 Hz
Inverse
IRM
Filter
Low-frequency
regeneration
2
High-frequency
regeneration
8 kHz
narrowband
16 kHz
wideband
50-300 Hz
band
300-3400 Hz
band
3400-8000 Hz
band
Side information
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 6
Low-Frequency Regeneration (1/2)
• Assumptions :
– Only pitch harmonics need to be recovered
• In general, no more than two pitch harmonics below 200 Hz
– Absolute phase is not perceptually relevant
• Frequency of harmonics determined from pitch
analysis
• Amplitudes determined from feed-forward multi-
layer perceptron (output in log domain)
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 7
Low-Frequency Regeneration (2/2)
Low-frequency
harmonic synthesis
Scale
and sum
2
1st harmonic
2nd harmonic
Pitch
analysis
MFCC
calculation
Multi-layer
Perceptron
Pitch delay
Cepstral coefficients
Scale
factors
Narrowband
speech
Low
frequencies
Pitch gain
LP
filter
(1)
(1)
(16)
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 8
High-Frequency Extension
• Excitation-filter model (16 ms frames)
• Problem is separated in two parts
– Excitation extension (no side information)
– Spectral envelope coding (side information)
A(z)
Narrowband
speech
LPC
analysis
Excitation
extension B(z)
1
Spectral envelope
Extension
Side information
(High-frequency spectral envelope)
High-
frequency
band
High
pass
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 9
Excitation Extension
Narrowband
excitation
Absolute
value
Whitening
filter
wideband
excitation
2
High
pass
0 5 10 15 20
-0.5
0
0.5
1
0 2 4 6 8
0
2
4
6
8
10
0 5 10 15 20
0
0.2
0.4
0.6
0.8
1
0 2 4 6 8
0
5
10
15
0 5 10 15 20
-0.2
0
0.2
0.4
0.6
0.8
0 2 4 6 8
0
1
2
3
4
5
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 10
Spectral Envelope Coding
• Spectral envelope calculated from the wideband
LPC coefficients
• Quantization of the 3000-8000 Hz range (40 points)
– Log domain
– 8-bit Vector Quantization (500 bits/s side information,
using 16 ms frames)
• Concatenation with envelope obtained from LPC
analysis on narrowband speech
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 11
Objective results
• Low-frequency band
– 3 dB RMS error on harmonic amplitude
• High-frequency band
– 3.6 dB RMS error on envelope
– No objective measure for excitation extension
(perceptually very close to original)
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 12
Subjective Results
female male
Original wideband
Recovered from
original IRM-filtered speech
Recovered from
G.729 coded speech
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 13
Discussion
• Highlights
– Expand IRM-filtered telephone-band speech to AM band
– Very low side information rate (500 bits/s)
• Areas of improvement
– Use high-band spectral estimation before coding
– Use residual low-frequency information (below 300 Hz)
– Noise robustness
– Post-filtering

More Related Content

Similar to BWE2.ppt

Rt60 | Dewesoft
Rt60 | DewesoftRt60 | Dewesoft
Rt60 | DewesoftDewesoft
 
MULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALA
MULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALAMULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALA
MULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALASaikiran Panjala
 
Frequency Modulation In Data Transmission
Frequency Modulation In Data TransmissionFrequency Modulation In Data Transmission
Frequency Modulation In Data TransmissionBise Mond
 
Channel Models for Massive MIMO
Channel Models for Massive MIMOChannel Models for Massive MIMO
Channel Models for Massive MIMOCPqD
 
ULTRA WIDE BAND TECHNOLOGY
ULTRA WIDE BAND TECHNOLOGYULTRA WIDE BAND TECHNOLOGY
ULTRA WIDE BAND TECHNOLOGYAJAL A J
 
Speaker Dependent WaveNet Vocoder
Speaker Dependent WaveNet VocoderSpeaker Dependent WaveNet Vocoder
Speaker Dependent WaveNet VocoderAkira Tamamori
 
RF Basics, RF for Non-RF Engineers.pdf
RF Basics, RF for Non-RF Engineers.pdfRF Basics, RF for Non-RF Engineers.pdf
RF Basics, RF for Non-RF Engineers.pdfatsnoida3
 
Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...
Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...
Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...acmarkham
 
Physical layer overview
Physical layer overviewPhysical layer overview
Physical layer overviewakruthi k
 
Digital multiplexing and transmission concept
Digital multiplexing and transmission conceptDigital multiplexing and transmission concept
Digital multiplexing and transmission conceptSaroj Kumar Gochhayat
 
fundamentals of radio RF frequency concept.pdf
fundamentals of radio RF frequency concept.pdffundamentals of radio RF frequency concept.pdf
fundamentals of radio RF frequency concept.pdfThobiusJoseph3
 
Dereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domainsDereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domainsTakuya Yoshioka
 
Ono sokki Laser Doppler Vibrometer lv1800e_4_denkei
Ono sokki Laser Doppler Vibrometer lv1800e_4_denkeiOno sokki Laser Doppler Vibrometer lv1800e_4_denkei
Ono sokki Laser Doppler Vibrometer lv1800e_4_denkeiNIHON DENKEI SINGAPORE
 
RADIO RECEIVERS.pptx
RADIO RECEIVERS.pptxRADIO RECEIVERS.pptx
RADIO RECEIVERS.pptxCyprianObota
 
NF Digital Lock-in Amplifier LI5600 Series
NF Digital Lock-in Amplifier LI5600 SeriesNF Digital Lock-in Amplifier LI5600 Series
NF Digital Lock-in Amplifier LI5600 SeriesNIHON DENKEI SINGAPORE
 
rffundamentalsseminarrecordingsection1v21588598809158.pdf
rffundamentalsseminarrecordingsection1v21588598809158.pdfrffundamentalsseminarrecordingsection1v21588598809158.pdf
rffundamentalsseminarrecordingsection1v21588598809158.pdfsantanusen30
 

Similar to BWE2.ppt (20)

Rt60 | Dewesoft
Rt60 | DewesoftRt60 | Dewesoft
Rt60 | Dewesoft
 
MULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALA
MULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALAMULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALA
MULTIPLE INPUT MULTIPLE OUTPUT BY SAIKIRAN PANJALA
 
Frequency Modulation In Data Transmission
Frequency Modulation In Data TransmissionFrequency Modulation In Data Transmission
Frequency Modulation In Data Transmission
 
Channel Models for Massive MIMO
Channel Models for Massive MIMOChannel Models for Massive MIMO
Channel Models for Massive MIMO
 
ULTRA WIDE BAND TECHNOLOGY
ULTRA WIDE BAND TECHNOLOGYULTRA WIDE BAND TECHNOLOGY
ULTRA WIDE BAND TECHNOLOGY
 
Lecture 5
Lecture 5Lecture 5
Lecture 5
 
Speaker Dependent WaveNet Vocoder
Speaker Dependent WaveNet VocoderSpeaker Dependent WaveNet Vocoder
Speaker Dependent WaveNet Vocoder
 
RF Basics, RF for Non-RF Engineers.pdf
RF Basics, RF for Non-RF Engineers.pdfRF Basics, RF for Non-RF Engineers.pdf
RF Basics, RF for Non-RF Engineers.pdf
 
Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...
Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...
Magneto-Inductive NEtworked Rescue System (MINERS): Taking Sensor Networks Un...
 
Physical layer overview
Physical layer overviewPhysical layer overview
Physical layer overview
 
Digital multiplexing and transmission concept
Digital multiplexing and transmission conceptDigital multiplexing and transmission concept
Digital multiplexing and transmission concept
 
unit 5 ADC.pptx
unit 5 ADC.pptxunit 5 ADC.pptx
unit 5 ADC.pptx
 
Part 2
Part 2Part 2
Part 2
 
fundamentals of radio RF frequency concept.pdf
fundamentals of radio RF frequency concept.pdffundamentals of radio RF frequency concept.pdf
fundamentals of radio RF frequency concept.pdf
 
ss2.pdf
ss2.pdfss2.pdf
ss2.pdf
 
Dereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domainsDereverberation in the stft and log mel frequency feature domains
Dereverberation in the stft and log mel frequency feature domains
 
Ono sokki Laser Doppler Vibrometer lv1800e_4_denkei
Ono sokki Laser Doppler Vibrometer lv1800e_4_denkeiOno sokki Laser Doppler Vibrometer lv1800e_4_denkei
Ono sokki Laser Doppler Vibrometer lv1800e_4_denkei
 
RADIO RECEIVERS.pptx
RADIO RECEIVERS.pptxRADIO RECEIVERS.pptx
RADIO RECEIVERS.pptx
 
NF Digital Lock-in Amplifier LI5600 Series
NF Digital Lock-in Amplifier LI5600 SeriesNF Digital Lock-in Amplifier LI5600 Series
NF Digital Lock-in Amplifier LI5600 Series
 
rffundamentalsseminarrecordingsection1v21588598809158.pdf
rffundamentalsseminarrecordingsection1v21588598809158.pdfrffundamentalsseminarrecordingsection1v21588598809158.pdf
rffundamentalsseminarrecordingsection1v21588598809158.pdf
 

More from Rakesh Pogula

lec2 - Modern Processors - SIMD.pptx
lec2 - Modern Processors - SIMD.pptxlec2 - Modern Processors - SIMD.pptx
lec2 - Modern Processors - SIMD.pptxRakesh Pogula
 
Tomas_IWAENC_keynote10.ppt
Tomas_IWAENC_keynote10.pptTomas_IWAENC_keynote10.ppt
Tomas_IWAENC_keynote10.pptRakesh Pogula
 
Combinational Filters.pptx
Combinational Filters.pptxCombinational Filters.pptx
Combinational Filters.pptxRakesh Pogula
 

More from Rakesh Pogula (6)

BWE1.ppt
BWE1.pptBWE1.ppt
BWE1.ppt
 
chapter4.ppt
chapter4.pptchapter4.ppt
chapter4.ppt
 
lec2 - Modern Processors - SIMD.pptx
lec2 - Modern Processors - SIMD.pptxlec2 - Modern Processors - SIMD.pptx
lec2 - Modern Processors - SIMD.pptx
 
Tomas_IWAENC_keynote10.ppt
Tomas_IWAENC_keynote10.pptTomas_IWAENC_keynote10.ppt
Tomas_IWAENC_keynote10.ppt
 
Combinational Filters.pptx
Combinational Filters.pptxCombinational Filters.pptx
Combinational Filters.pptx
 
Research_Wu.pptx
Research_Wu.pptxResearch_Wu.pptx
Research_Wu.pptx
 

Recently uploaded

UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxkalpana413121
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxhublikarsn
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptAfnanAhmad53
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfsumitt6_25730773
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesChandrakantDivate1
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdfKamal Acharya
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...josephjonse
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementDr. Deepak Mudgal
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsmeharikiros2
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxMustafa Ahmed
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesLinux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesRashidFaridChishti
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)ChandrakantDivate1
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 

Recently uploaded (20)

UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptx
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdf
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdf
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systems
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesLinux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 

BWE2.ppt

  • 1. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1 IEEE Speech Coding Workshop Sept 17–20, 2000 Lake Lawn Resort Delavan, WI Jean-Marc Valin, Roch Lefebvre University of Sherbrooke Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding
  • 2. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 2 Outline • Problem statement • Proposed solution • System performance • Discussion
  • 3. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 3 Problem Statement • Telephone Band: 300 - 3400 Hz • AM Band: 50 - 7000 Hz • How to make sound like with 500 bits/sec? • We need to recover information from both low and high-frequency bands (G.729)
  • 4. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 4 Proposed Solution 0 1000 2000 3000 4000 5000 6000 7000 8000 10 20 30 40 50 60 70 80 90 100 110 Frequency (Hz) Amplitude (dB) • 1) Do our best to recover the wideband information from narrowband speech • 2) Use coding for the information that cannot be recovered – Recovered information : • Low-frequency band • High-frequency excitation – Coded information : • High-frequency spectral envelope
  • 5. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 5 System Overview • Inverse IRM filter is optional – produces a flat response from 200-3500 Hz Inverse IRM Filter Low-frequency regeneration 2 High-frequency regeneration 8 kHz narrowband 16 kHz wideband 50-300 Hz band 300-3400 Hz band 3400-8000 Hz band Side information
  • 6. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 6 Low-Frequency Regeneration (1/2) • Assumptions : – Only pitch harmonics need to be recovered • In general, no more than two pitch harmonics below 200 Hz – Absolute phase is not perceptually relevant • Frequency of harmonics determined from pitch analysis • Amplitudes determined from feed-forward multi- layer perceptron (output in log domain)
  • 7. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 7 Low-Frequency Regeneration (2/2) Low-frequency harmonic synthesis Scale and sum 2 1st harmonic 2nd harmonic Pitch analysis MFCC calculation Multi-layer Perceptron Pitch delay Cepstral coefficients Scale factors Narrowband speech Low frequencies Pitch gain LP filter (1) (1) (16)
  • 8. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 8 High-Frequency Extension • Excitation-filter model (16 ms frames) • Problem is separated in two parts – Excitation extension (no side information) – Spectral envelope coding (side information) A(z) Narrowband speech LPC analysis Excitation extension B(z) 1 Spectral envelope Extension Side information (High-frequency spectral envelope) High- frequency band High pass
  • 9. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 9 Excitation Extension Narrowband excitation Absolute value Whitening filter wideband excitation 2 High pass 0 5 10 15 20 -0.5 0 0.5 1 0 2 4 6 8 0 2 4 6 8 10 0 5 10 15 20 0 0.2 0.4 0.6 0.8 1 0 2 4 6 8 0 5 10 15 0 5 10 15 20 -0.2 0 0.2 0.4 0.6 0.8 0 2 4 6 8 0 1 2 3 4 5
  • 10. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 10 Spectral Envelope Coding • Spectral envelope calculated from the wideband LPC coefficients • Quantization of the 3000-8000 Hz range (40 points) – Log domain – 8-bit Vector Quantization (500 bits/s side information, using 16 ms frames) • Concatenation with envelope obtained from LPC analysis on narrowband speech
  • 11. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 11 Objective results • Low-frequency band – 3 dB RMS error on harmonic amplitude • High-frequency band – 3.6 dB RMS error on envelope – No objective measure for excitation extension (perceptually very close to original)
  • 12. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 12 Subjective Results female male Original wideband Recovered from original IRM-filtered speech Recovered from G.729 coded speech
  • 13. Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 13 Discussion • Highlights – Expand IRM-filtered telephone-band speech to AM band – Very low side information rate (500 bits/s) • Areas of improvement – Use high-band spectral estimation before coding – Use residual low-frequency information (below 300 Hz) – Noise robustness – Post-filtering